Substrate
ai

Publishing Houses, Scott Turow Sue Meta Over AI Training Data Copyright

Five major publishing houses and author Scott Turow filed a class action lawsuit against Meta and CEO Mark Zuckerberg, alleging the company illegally used millions of copyrighted books and journal articles to train its Llama AI model. The suit, filed in federal court in Manhattan, accuses Meta of bypassing licensing deals and sourcing materials from pirate sites.

fortune.com
The Washington Post
Financial Times
NPR
4 sources·May 5, 7:08 PM(59 min ago)·2m read
|
Publishing Houses, Scott Turow Sue Meta Over AI Training Data Copyrightthenation.com
Audio version
Tap play to generate a narrated version.

Five publishing houses and bestselling author Scott Turow sued Meta and its CEO Mark Zuckerberg on Tuesday, accusing the company of illegally using millions of copyrighted works to train its AI language system Llama. The class action lawsuit, filed in the United States District Court for the Southern District of New York in Manhattan, alleges that Meta reproduced and distributed these materials without permission or compensation to authors and publishers, despite knowing the conduct violated copyright law.

In the suit. The plaintiffs claim Zuckerberg personally authorized and actively encouraged the infringement, following Meta's motto of 'move fast and break things' by drawing upon books and journal articles for Llama. According to the complaint, Meta knowingly copied copyrighted materials from pirate websites such as LibGen and Anna's Archive to train various iterations of the model.

The suit argues Meta willfully bypassed legal licensing markets to gain an advantage in the AI arms race. Authors published by the suing companies include Turow, James Patterson, Donna Tartt, former President Joe Biden, Yiyun Li and Amanda Vaill. Yiyun Li and Amanda Vaill are two of the Pulitzer Prize winners announced on Monday.

U.K. Jemisin's The Fifth Season, and Lemony Snicket's Who Could That Be at This Hour?. The class represented by Turow encompasses all legal or beneficial owners of registered copyrights for any book with an International Standard Book Number (ISBN) or journal article with a Digital Object Identifier (DOI) or International Standard Serial Number (ISSN).

The plaintiffs seek statutory damages, a permanent injunction against Meta to stop further use of their works, and an order requiring the company to destroy all infringing copies of the materials. I. has been created with stolen words, and it is shameful that these violations were undertaken by one of the richest corporations in the world.

The complaint details how Meta briefly considered licensing deals with major publishers but changed its strategy in April 2023. The question of whether to license or pirate was escalated to Zuckerberg that month, after which Meta's business development team received verbal instructions to stop licensing efforts. Meta vowed to fight the lawsuit aggressively in a statement on Monday.

Nkechi Nneji, a public affairs director for Meta, stated to NPR that AI is powering transformative innovations, productivity and creativity for individuals and companies, and courts have rightly found that training AI on copyrighted material can qualify as fair use. S.

District Court Judge Vince Chhabria dismissed a copyright infringement lawsuit against Meta, stating the court had no choice but to grant summary judgment to Meta on the claim that the company violated copyright law by training its models with plaintiffs' books, as the plaintiffs did not present enough evidence of harm.

The suit comes amid a wave of litigation over AI training practices. 5 billion to settle a class action suit initiated by thriller novelist Andrea Bartz and nonfiction writers Charles Graeber and Kirk Wallace Johnson. A final approval hearing for the Anthropic settlement is scheduled for next week.

U.S. District Court Judge William Alsup stated last June that the use of the books at issue to train Claude and its precursors was exceedingly transformative.

Key Facts

Lawsuit filed by five publishers and Scott Turow
Accuses Meta of using millions of copyrighted works from pirate sites to train Llama AI without permission or compensation.
Zuckerberg allegedly authorized infringement
Meta shifted from licensing to pirating strategy in April 2023 after escalation to CEO, per complaint.
Specific works cited in suit
Includes Presumed Innocent by Turow (1987), Impact by Preston, The Wild Robot by Brown, The Fifth Season by Jemisin, and Who Could That Be at This Hour? by Snic
Meta's response
Company states AI training on copyrighted material qualifies as fair use and will fight aggressively, citing court precedents.
Prior AI copyright cases
Anthropic settled for $1.5 billion in 2025; Meta won dismissal in June 2025 for insufficient evidence of harm.

Story Timeline

6 events
  1. 2026-05-05

    Pulitzer Prize winners Yiyun Li and Amanda Vaill announced; Meta issues statement vowing to fight upcoming lawsuit aggressively.

    2 sourcesfortune.com · NPR
  2. 2026-05-06

    Five publishing houses and Scott Turow file class action lawsuit against Meta and Mark Zuckerberg in Manhattan federal court.

    3 sourcesfortune.com · NPR · unattributed
  3. 2023-04

    Meta escalates licensing vs. pirating decision to Zuckerberg, leading to instructions to halt licensing efforts.

    1 sourceNPR
  4. 2025

    Anthropic agrees to $1.5 billion settlement in class action suit by authors including Andrea Bartz, Charles Graeber, and Kirk Wallace Johnson.

    2 sourcesfortune.com · NPR
  5. 2025-06

    U.S. District Court Judge William Alsup rules Anthropic's use of books to train AI as transformative.

    1 sourceNPR
  6. 2025-06

    U.S. District Court Judge Vince Chhabria dismisses copyright lawsuit against Meta for lack of evidence of harm.

    1 sourceNPR

Potential Impact

  1. 01

    Bypassing licensing markets may deter future publisher deals with AI companies, slowing industry collaborations.

  2. 02

    Broad class definition may expand to thousands of authors and publishers, increasing Meta's legal exposure in copyright disputes.

  3. 03

    Settlement precedent from Anthropic's $1.5 billion deal may pressure Meta toward negotiation rather than prolonged litigation.

  4. 04

    Ruling on fair use could clarify AI training boundaries, affecting other tech firms like those developing similar models.

  5. 05

    Potential statutory damages and injunction could force Meta to destroy AI training data and halt Llama development using disputed materials.

Transparency Panel

Sources cross-referenced4
Framing risk55/100 (moderate)
Confidence score90%
Synthesized bySubstrate AI
Word count522 words
PublishedMay 5, 2026, 7:08 PM
Bias signals removed2 across 2 outlets
Signal Breakdown
Loaded 2

Related Stories

Brockman Testifies on Heated 2017 Dispute with Musk Over OpenAI's For-Profit Shift in Federal Trialfrance24.com
ai4 hrs ago

Brockman Testifies on Heated 2017 Dispute with Musk Over OpenAI's For-Profit Shift in Federal Trial

OpenAI President Greg Brockman detailed a heated 2017 confrontation with Elon Musk during testimony in the federal trial Musk v. Altman. He described Musk storming around a table and grabbing a painting after rejecting shared control proposals. The lawsuit seeks $150 billion in d…

The New York Times
Wired
New York Post
3 sources
Anthropic Launches AI Agents for Finance Amid Investments and Pentagon Exclusionthehindu.com
ai4 hrs ago

Anthropic Launches AI Agents for Finance Amid Investments and Pentagon Exclusion

Anthropic introduced 10 new AI agents designed to automate routine tasks in the financial sector, such as building models and preparing pitches. Major tech firms reported significant profit gains from their stakes in the company, while the Pentagon announced deals with other AI p…

Bloomberg
The New York Times
Fortune
Business Insider
Defense News
5 sources
Apple Plans to Let Users Select Third-Party AI Models for iOS 27 Features This FallCrisco 1492 / Wikimedia (CC BY-SA 4.0)
ai2 hrs agoUpdated

Apple Plans to Let Users Select Third-Party AI Models for iOS 27 Features This Fall

Apple intends to introduce greater flexibility in its AI ecosystem by allowing users to choose third-party models for tasks in iOS 27, iPadOS 27, and macOS 27. The updates, expected this fall, will enable integrations with models from providers like Google and Anthropic. This mov…

IN
TechCrunch
The Verge
Reuters
4 sources