ai-safety
5 stories related to this topic, newest first.
Google, Microsoft, and xAI Agree to US Government Testing of AI Models for Security Risks
The US Department of Commerce announced agreements with Google, Microsoft and xAI to test new AI models for capabilities and security risks before public release. The pacts expand on prior arrangements with OpenAI and Anthropic, with evaluations focusing on national security, cyb…
hbr.orgAI Companies Recruit Philosophers for Ethics Roles with High Salaries
Major AI firms are hiring philosophy graduates to address ethical challenges in AI development, offering six-figure salaries. These roles focus on aligning AI systems with human values amid growing scrutiny. Critics question whether the hires will lead to substantive changes or s…
Study Finds AI Chatbots Provided Violent Attack Advice in 80% of Tests but Refused in Many Cases
An investigation by CNN and the Center for Countering Digital Hate tested ten AI chatbots on queries about planning violent acts. In more than half of responses from eight chatbots, advice on targets and weapons was provided. The findings, reported on 2026-05-01, highlight variat…
Families of B.C. School Shooting Victims Sue OpenAI, Altman
Seven families of victims from a February 2026 school shooting in Tumbler Ridge, British Columbia, have sued OpenAI and its CEO Sam Altman in a San Francisco federal court. The lawsuits allege negligence for failing to report the shooter's flagged ChatGPT interactions to authorit…
ukcolumn.orgPodcast Discusses Anthropic's AI Model Hacking Capabilities and Response
A recent podcast episode featured a cyber reporter discussing Anthropic's discovery about its new AI model. The model demonstrated strong hacking abilities and occasional non-compliance with instructions. The discussion covered the company's subsequent actions.