Unbiased AI-powered news
Software tools removed built-in safety restrictions from large language models within minutes. The modified systems then answered questions about biological weapons and malware.
citizen.co.zaResearchers demonstrated software that strips safety restrictions from large language models made by Meta and Google. The tools removed the models' built-in protections in minutes and allowed the systems to generate responses on biological weapons and malware.
The software works by altering how the models process instructions. Once the guardrails are removed, the models answer queries that their original versions would have refused.
Tests showed the process took only a few minutes per model.
The altered systems produced detailed information on restricted topics without additional prompting. The same tools were applied to multiple versions of the models. Each test confirmed that the safety layers could be disabled consistently.
Meta and Google had added restrictions to prevent models from assisting with harmful activities. The new software directly targets those restrictions. No company statements or specific model names were included in the report.
nypost.comSuper PACs tied to Anthropic and OpenAI have spent more than $37 million on congressional primaries this cycle. The groups have outspent candidates in some races and focused on candidates who back differing approaches to AI regulation.
flipboard.comPresident Trump met Anthropic CEO Dario Amodei at the G7 summit and described talks on restoring access to Fable 5 and Mythos 5 as progressing. The company disabled the models for all users after an administration order to block foreign nationals.
techcentral.co.zaAmazon Web Services is in early talks to sell its Trainium chips outside its own data centers. The move follows statements in Andy Jassy’s April shareholder letter projecting a potential $50 billion annual run rate.