Substrate
technology

Major Publishers and Author File Copyright Lawsuit Against Meta Over AI Training Data

Five major publishers and author Scott Turow filed a class-action lawsuit against Meta and CEO Mark Zuckerberg in Manhattan federal court, alleging illegal use of copyrighted books and articles to train the Llama AI model. The suit claims Zuckerberg personally authorized the infringement. Meta vowed to fight the case, citing fair use precedents.

fortune.com
Variety
The Washington Post
The Guardian
The Verge
NPR
6 sources·May 5, 7:08 PM(1 hr ago)·3m read
|
Major Publishers and Author File Copyright Lawsuit Against Meta Over AI Training DataVariety
Audio version
Tap play to generate a narrated version.

Five major publishing houses and author Scott Turow sued Meta Platforms and CEO Mark Zuckerberg in Manhattan federal court on Tuesday, alleging the company illegally used millions of copyrighted works to train its Llama artificial intelligence language models.

The class-action complaint accuses Meta of reproducing and distributing the materials without permission or compensation, with full knowledge of violating copyright law. Plaintiffs include Elsevier, Cengage, Hachette Book Group, Macmillan, and McGraw Hill, alongside Turow, whose works appear among those allegedly infringed.

The lawsuit, filed on May 5, 2026, details how Meta and Zuckerberg allegedly torrented millions of copyrighted books and journal articles from pirate sites and downloaded unauthorized web scrapes of much of the internet. These materials were then copied multiple times to develop the multibillion-dollar Llama system, which generates responses to human prompts.

Authors published by the suing companies include Scott Turow, James Patterson, Donna Tartt, former President Joe Biden, Yiyun Li, and Amanda Vaill.

Yiyun Li and Amanda Vaill were among the Pulitzer Prize winners announced on Monday. The publishers allege Meta pirated a range of works, from textbooks and scientific articles to novels such as The Fifth Season by NK Jemisin and The Wild Robot by Peter Brown.

The plaintiffs asked the court for permission to represent a larger class of copyright owners and seek an unspecified amount of monetary damages.

The complaint states that Zuckerberg personally authorized and actively encouraged the infringement. It describes the actions as one of the most massive infringements of copyrighted materials in history, driven by Meta's effort to build a functional generative AI model.

“Defendants reproduced and distributed millions of copyrighted works without permission, without providing any compensation to authors or publishers, and with full knowledge that their conduct violated copyright law.” — Plaintiffs' complaint Meta issued a statement on Monday vowing to fight the lawsuit aggressively. A Meta spokesperson said AI powers transformative innovations, productivity, and creativity for individuals and companies, and courts have found that training AI on copyrighted material can qualify as fair use. The company denied any wrongdoing in response to the allegations.

“AI is powering transformative innovations, productivity and creativity for individuals and companies, and courts have rightly found that training AI on copyrighted material can qualify as fair use.” — Meta spokesperson Maria Pallante, president of the Association of American Publishers, stated that Meta’s mass-scale infringement isn’t public progress, and AI will never be properly realized if tech companies prioritize pirate sites over scholarship and imagination. Her comment came in a statement following the lawsuit's filing. The case opens a new front in ongoing legal battles between creators and AI developers over copyright issues. In 2025, Anthropic agreed to pay $1.5 billion to settle a class-action suit initiated by thriller novelist Andrea Bartz and nonfiction writers Charles Graeber and Kirk Wallace Johnson. A final approval hearing for that settlement is scheduled for next week. Multiple lawsuits have targeted companies like Meta, OpenAI, and Anthropic, with cases often hinging on whether AI training constitutes fair use by creating transformative content. Last year, the first judicial rulings on such matters diverged, and Anthropic became the first major AI firm to settle, avoiding potentially higher damages. The publishers' suit references Meta's history, including Zuckerberg's testimony during a Senate judiciary committee hearing in Washington DC on January 31, 2024. While that hearing focused on broader tech issues, the complaint ties it to the company's approach to innovation. Authors and publishers have pursued similar actions in recent years, with dozens of plaintiffs including news outlets and visual artists alleging infringement. The New York Times has sued OpenAI and Microsoft on related grounds. These disputes center on the balance between technological advancement and intellectual property rights, with outcomes likely to shape AI development practices. The lawsuit alleges Meta followed its motto 'move fast and break things' by prioritizing speed over legal compliance in the AI arms race. Plaintiffs claim this led to unauthorized copying on an unprecedented scale, affecting works across genres and disciplines.

Key Facts

Lawsuit Filing
Filed on May 5, 2026, in Manhattan federal court by five publishers and Scott Turow against Meta and Zuckerberg.
Allegations
Meta allegedly torrented millions of copyrighted works from pirate sites to train Llama AI, with Zuckerberg personally authorizing infringement.
Meta Response
Meta stated it will fight aggressively, claiming AI training on copyrighted material qualifies as fair use.
Prior Settlement
Anthropic paid $1.5 billion in 2025 to settle similar claims.
Affected Authors
Includes James Patterson, Donna Tartt, former President Joe Biden, Yiyun Li, Amanda Vaill, NK Jemisin, and Peter Brown.

Story Timeline

6 events
  1. 2026-05-05

    Five publishers and author Scott Turow filed a class-action lawsuit against Meta and Mark Zuckerberg in Manhattan federal court.

    6 sourcesfortune.com · Variety · The Washington Post · The Guardian
  2. 2026-05-04

    Meta issued a statement vowing to fight the lawsuit aggressively.

    2 sourcesfortune.com · The Guardian
  3. 2026-05-04

    Pulitzer Prize winners including Yiyun Li and Amanda Vaill announced.

    1 sourcefortune.com
  4. 2025

    Anthropic agreed to pay $1.5 billion to settle a class-action suit by authors Andrea Bartz, Charles Graeber, and Kirk Wallace Johnson.

    3 sourcesfortune.com · The Guardian · Variety
  5. 2024-01-31

    Mark Zuckerberg testified during a Senate judiciary committee hearing in Washington DC.

    1 sourceThe Guardian
  6. Next week (from 2026-05-06)

    Final approval hearing scheduled for Anthropic settlement.

    2 sourcesfortune.com · The Guardian

Potential Impact

  1. 01

    Broader implications for AI industry practices on copyright and fair use in training models.

  2. 02

    Influence on ongoing cases against other AI firms like OpenAI and Microsoft.

  3. 03

    Potential financial damages for Meta if the lawsuit succeeds, similar to Anthropic's $1.5 billion settlement.

  4. 04

    Potential shifts in publishing industry negotiations with tech companies over AI data use.

  5. 05

    Possible class expansion to include more copyright owners, increasing scope of claims.

Transparency Panel

Sources cross-referenced6
Confidence score97%
Synthesized bySubstrate AI
Word count664 words
PublishedMay 5, 2026, 7:08 PM
Bias signals removed3 across 3 outlets
Signal Breakdown
Loaded 3

Related Stories

Brockman Testifies on Heated 2017 Dispute with Musk Over OpenAI's For-Profit Shift in Federal Trialnaturalnews.com
ai1 hr agoUpdated

Brockman Testifies on Heated 2017 Dispute with Musk Over OpenAI's For-Profit Shift in Federal Trial

OpenAI President Greg Brockman detailed a heated 2017 confrontation with Elon Musk during testimony in the federal trial Musk v. Altman. He described Musk storming around a table and grabbing a painting after rejecting shared control proposals. The lawsuit seeks $150 billion in d…

The New York Times
Wired
New York Post
BBC News
Business Insider
+3
9 sources
Trump Administration Explores Government Review of AI Models Before Public ReleaseShealeah Craighead / Wikimedia (Public domain)
technology1 hr agoUpdated

Trump Administration Explores Government Review of AI Models Before Public Release

The Trump administration is discussing measures to vet advanced AI models for safety and security risks prior to their release, marking a potential shift from its previous hands-off stance on AI regulation. Officials are considering an executive order to establish a working group…

FO
The New York Times
Semafor
Politico
CBS News
+6
12 sources
Publishing Houses, Scott Turow Sue Meta Over AI Training Data Copyrightthenation.com
ai5 hrs agoFraming55Framing risk55/100Rewrite inherits negative framing of Meta's actions through loaded verbs and phrases, with lede misdirection centering on lawsuit filing over core infringement allegations.Click to jump to full framing analysis

Publishing Houses, Scott Turow Sue Meta Over AI Training Data Copyright

Five major publishing houses and author Scott Turow filed a class action lawsuit against Meta and CEO Mark Zuckerberg, alleging the company illegally used millions of copyrighted books and journal articles to train its Llama AI model. The suit, filed in federal court in Manhattan…

fortune.com
The Washington Post
Financial Times
NPR
4 sources