Major Publishers and Author File Copyright Lawsuit Against Meta Over AI Training Data
Five major publishers and author Scott Turow filed a class-action lawsuit against Meta and CEO Mark Zuckerberg in Manhattan federal court, alleging illegal use of copyrighted books and articles to train the Llama AI model. The suit claims Zuckerberg personally authorized the infringement. Meta vowed to fight the case, citing fair use precedents.
VarietyFive major publishing houses and author Scott Turow sued Meta Platforms and CEO Mark Zuckerberg in Manhattan federal court on Tuesday, alleging the company illegally used millions of copyrighted works to train its Llama artificial intelligence language models.
The class-action complaint accuses Meta of reproducing and distributing the materials without permission or compensation, with full knowledge of violating copyright law. Plaintiffs include Elsevier, Cengage, Hachette Book Group, Macmillan, and McGraw Hill, alongside Turow, whose works appear among those allegedly infringed.
The lawsuit, filed on May 5, 2026, details how Meta and Zuckerberg allegedly torrented millions of copyrighted books and journal articles from pirate sites and downloaded unauthorized web scrapes of much of the internet. These materials were then copied multiple times to develop the multibillion-dollar Llama system, which generates responses to human prompts.
Authors published by the suing companies include Scott Turow, James Patterson, Donna Tartt, former President Joe Biden, Yiyun Li, and Amanda Vaill.
Yiyun Li and Amanda Vaill were among the Pulitzer Prize winners announced on Monday. The publishers allege Meta pirated a range of works, from textbooks and scientific articles to novels such as The Fifth Season by NK Jemisin and The Wild Robot by Peter Brown.
The plaintiffs asked the court for permission to represent a larger class of copyright owners and seek an unspecified amount of monetary damages.
The complaint states that Zuckerberg personally authorized and actively encouraged the infringement. It describes the actions as one of the most massive infringements of copyrighted materials in history, driven by Meta's effort to build a functional generative AI model.
““Defendants reproduced and distributed millions of copyrighted works without permission, without providing any compensation to authors or publishers, and with full knowledge that their conduct violated copyright law.” — Plaintiffs' complaint Meta issued a statement on Monday vowing to fight the lawsuit aggressively. A Meta spokesperson said AI powers transformative innovations, productivity, and creativity for individuals and companies, and courts have found that training AI on copyrighted material can qualify as fair use. The company denied any wrongdoing in response to the allegations.”
““AI is powering transformative innovations, productivity and creativity for individuals and companies, and courts have rightly found that training AI on copyrighted material can qualify as fair use.” — Meta spokesperson Maria Pallante, president of the Association of American Publishers, stated that Meta’s mass-scale infringement isn’t public progress, and AI will never be properly realized if tech companies prioritize pirate sites over scholarship and imagination. Her comment came in a statement following the lawsuit's filing. The case opens a new front in ongoing legal battles between creators and AI developers over copyright issues. In 2025, Anthropic agreed to pay $1.5 billion to settle a class-action suit initiated by thriller novelist Andrea Bartz and nonfiction writers Charles Graeber and Kirk Wallace Johnson. A final approval hearing for that settlement is scheduled for next week. Multiple lawsuits have targeted companies like Meta, OpenAI, and Anthropic, with cases often hinging on whether AI training constitutes fair use by creating transformative content. Last year, the first judicial rulings on such matters diverged, and Anthropic became the first major AI firm to settle, avoiding potentially higher damages. The publishers' suit references Meta's history, including Zuckerberg's testimony during a Senate judiciary committee hearing in Washington DC on January 31, 2024. While that hearing focused on broader tech issues, the complaint ties it to the company's approach to innovation. Authors and publishers have pursued similar actions in recent years, with dozens of plaintiffs including news outlets and visual artists alleging infringement. The New York Times has sued OpenAI and Microsoft on related grounds. These disputes center on the balance between technological advancement and intellectual property rights, with outcomes likely to shape AI development practices. The lawsuit alleges Meta followed its motto 'move fast and break things' by prioritizing speed over legal compliance in the AI arms race. Plaintiffs claim this led to unauthorized copying on an unprecedented scale, affecting works across genres and disciplines.”
Key Facts
Story Timeline
6 events- 2026-05-05
Five publishers and author Scott Turow filed a class-action lawsuit against Meta and Mark Zuckerberg in Manhattan federal court.
6 sourcesfortune.com · Variety · The Washington Post · The Guardian - 2026-05-04
Meta issued a statement vowing to fight the lawsuit aggressively.
2 sourcesfortune.com · The Guardian - 2026-05-04
Pulitzer Prize winners including Yiyun Li and Amanda Vaill announced.
1 sourcefortune.com - 2025
Anthropic agreed to pay $1.5 billion to settle a class-action suit by authors Andrea Bartz, Charles Graeber, and Kirk Wallace Johnson.
3 sourcesfortune.com · The Guardian · Variety - 2024-01-31
Mark Zuckerberg testified during a Senate judiciary committee hearing in Washington DC.
1 sourceThe Guardian - Next week (from 2026-05-06)
Final approval hearing scheduled for Anthropic settlement.
2 sourcesfortune.com · The Guardian
Potential Impact
- 01
Broader implications for AI industry practices on copyright and fair use in training models.
- 02
Influence on ongoing cases against other AI firms like OpenAI and Microsoft.
- 03
Potential financial damages for Meta if the lawsuit succeeds, similar to Anthropic's $1.5 billion settlement.
- 04
Potential shifts in publishing industry negotiations with tech companies over AI data use.
- 05
Possible class expansion to include more copyright owners, increasing scope of claims.
Transparency Panel
Related Stories
naturalnews.comBrockman Testifies on Heated 2017 Dispute with Musk Over OpenAI's For-Profit Shift in Federal Trial
OpenAI President Greg Brockman detailed a heated 2017 confrontation with Elon Musk during testimony in the federal trial Musk v. Altman. He described Musk storming around a table and grabbing a painting after rejecting shared control proposals. The lawsuit seeks $150 billion in d…
Trump Administration Explores Government Review of AI Models Before Public Release
The Trump administration is discussing measures to vet advanced AI models for safety and security risks prior to their release, marking a potential shift from its previous hands-off stance on AI regulation. Officials are considering an executive order to establish a working group…
thenation.comPublishing Houses, Scott Turow Sue Meta Over AI Training Data Copyright
Five major publishing houses and author Scott Turow filed a class action lawsuit against Meta and CEO Mark Zuckerberg, alleging the company illegally used millions of copyrighted books and journal articles to train its Llama AI model. The suit, filed in federal court in Manhattan…