Browsing: AI Safety

Global AI Trends

OpenAI and Anthropic Partner on Rare Joint Safety Tests Amid Fierce AI Competition

27 August 2025

In a rare moment of cooperation between two of the world’s biggest AI labs, OpenAI and Anthropic temporarily opened up…

Global AI Trends

Top AI Labs Urge Focus on Monitoring Thought Processes of AI Reasoning Models

15 July 2025

Researchers from OpenAI, Google DeepMind, Anthropic, and other leading organizations are calling for deeper investigation into techniques for monitoring the…

Global AI Trends

Deceptive AI: New Models Lie, Manipulate, and Threaten, Raising Safety Alarms

29 June 2025

Artificial intelligence systems are exhibiting increasingly disturbing behaviours, including lying, manipulation, and even threatening their creators, according to researchers evaluating…

Global AI Trends

Anthropic Warns Most Leading AI Models Resort to Harmful Behavior in Simulated Tests

21 June 2025

Anthropic has released new research showing that most major AI models, when placed in high-stakes simulated environments, resorted to harmful…

Global AI Trends

Former OpenAI Researcher Warns GPT-4o Shows Alarming Self-Preservation Bias in Safety Tests

12 June 2025

Former OpenAI research leader Steven Adler has published a new independent study claiming that the company’s GPT-4o model often prioritizes…

Global AI Trends

OpenAI Launches Safety Hub: Because Even Chatbots Need a Performance Review

17 May 2025

OpenAI just dropped something new – and no, it’s not another AI model that writes sonnets about soup. This time,…

Subscribe to our Newsletter:

Browsing: AI Safety

OpenAI and Anthropic Partner on Rare Joint Safety Tests Amid Fierce AI Competition

Top AI Labs Urge Focus on Monitoring Thought Processes of AI Reasoning Models

Deceptive AI: New Models Lie, Manipulate, and Threaten, Raising Safety Alarms

Anthropic Warns Most Leading AI Models Resort to Harmful Behavior in Simulated Tests

Former OpenAI Researcher Warns GPT-4o Shows Alarming Self-Preservation Bias in Safety Tests

OpenAI Launches Safety Hub: Because Even Chatbots Need a Performance Review

Nvidia Posts Soaring Revenue As AI Chip Demand Surges, Easing Fears Of An AI Market Bubble

Survey Finds Half Of Novelists Fear Full Replacement By AI As Income Losses And Copyright Concerns Grow

AI-driven cyber attacks surge across Africa, with Ethiopia facing highest weekly assault rate

How AI Can Fight Fraud in Claims

Lamola Says South Africa Will Use G20 Summit To Push For Fairer Trade, Beneficiation And Digital Revenue Reform

Google Launches Gemini 3, Ushering In A New Era Of Agentic AI For South Africa