Unbiased AI-powered news
A 2023 research paper by Anthropic employees explored how AI models handle discrimination related to race and gender. The paper discussed potential overcorrections in model responses to address historical injustices, based on experiments with human input. It highlighted challenges in training AI on human-generated text while mitigating biases.
Substrate placeholder — needs reviewFox News reported on a 2023 paper by researchers at Anthropic, an AI company, that examined discrimination in AI models. The paper, co-authored by Amanda Askell and four others, analyzed how large language models respond to prompts involving race and gender.
one experiment, a 175 billion parameter model showed a 3% discrimination against Black students compared to White students in a baseline condition. With additional human input and training, the model shifted to favor Black students by 7%. The paper noted that such overcorrections might be applied in contexts aiming to address historical injustices against marginalized groups, if compliant with local laws.
The authors stated that language models trained on human text often reflect harmful stereotypes and discrimination present in that data.
The paper observed that models can be steered to avoid bias through natural language instructions requesting unbiased responses. Anthropic has positioned its AI model Claude as focused on ethical behavior, with its constitution aiming for the model to exhibit skill, judgment, nuance, and sensitivity in decision-making.
Amanda Askell, a philosopher at Anthropic, works on finetuning and AI alignment to make models more honest and to develop positive character traits. She previously held a similar role at OpenAI.
recently withheld its latest model, Mythos, due to its effectiveness in identifying cyber vulnerabilities. Earlier this year, the company clashed with the Department of War over restrictions on using its technology for lethal operations. A federal judge blocked a ban on Anthropic's technology for Department of War use.
Anthropic markets Claude as an ethical AI option amid ongoing debates about AI ethics and applications. The U.S. military reportedly used Claude in an operation to capture Venezuelan leader Nicolás Maduro, where it handled requests autonomously and generated documentation.
nypost.comSuper PACs tied to Anthropic and OpenAI have spent more than $37 million on congressional primaries this cycle. The groups have outspent candidates in some races and focused on candidates who back differing approaches to AI regulation.
flipboard.comPresident Trump met Anthropic CEO Dario Amodei at the G7 summit and described talks on restoring access to Fable 5 and Mythos 5 as progressing. The company disabled the models for all users after an administration order to block foreign nationals.
techcentral.co.zaAmazon Web Services is in early talks to sell its Trainium chips outside its own data centers. The move follows statements in Andy Jassy’s April shareholder letter projecting a potential $50 billion annual run rate.