Settlement Rejected Filed 2024-08 Updated August 20, 2025

Authors v. Anthropic

1.5B settlement rejected by judge.

Parties:
Andrea Bartz et al. vs. Anthropic
Court:
U.S. District Court, Northern District of California (Judge William Alsup)
Claims:
Copyright infringement: Anthropic used copyrighted books from The Pile dataset to train Claude witho...
Damages:
$1.5 billion (settlement offered, rejected)

Authors v. Anthropic — The $1.5 Billion AI Training Case

Case Summary

Multiple authors including Andrea Bartz, Charles Graeber, and Kirk Wallace Johnson sued Anthropic in August 2024 for using their copyrighted books to train the Claude AI model. The books were included in "The Pile," a dataset that inadvertently contained copyrighted works. Anthropic offered a record $1.5 billion settlement, which the judge rejected.

Timeline

DateEvent
Aug 2024Authors file lawsuit
Early 2025Judge Alsup rules purchased books = fair use
Mid 2025Alsup rules The Pile usage = NOT fair use
Aug 2025Anthropic offers $1.5B settlement
Late 2025Judge Alsup rejects settlement terms
2026Renegotiation ongoing

Key Legal Issues

The Dual Fair Use Ruling

Judge Alsup made a critical distinction:

Fair Use (Purchased Books):

  • Anthropic legally bought physical books and digitized them
  • Use was transformative (training, not reproduction)
  • AI does not reproduce books verbatim
  • Result: Fair use

NOT Fair Use (The Pile):

  • Books were included without license or purchase
  • Dataset creators had not obtained permission
  • Some works remained after copyright concerns were raised
  • Result: Copyright infringement

Why the Settlement Was Rejected

Judge Alsup rejected the $1.5B settlement because:

  • ~$3,000 per author was deemed insufficient
  • Release terms were too broad (would prevent future claims)
  • Opt-out process was unnecessarily complicated
  • Insufficient disclosure about which specific works were used
  • Terms would be forced "down the throat of authors"

Significance

The Numbers

  • $1.5 billion — largest copyright settlement offer in U.S. history
  • ~500,000 authors affected
  • ~$3,000 per author (deemed insufficient)

What This Establishes

1. Unlicensed use of copyrighted works for AI training CAN be infringement

2. Purchasing books and digitizing them for training MAY be fair use

3. The source/provenance of training data matters legally

4. Courts will scrutinize settlement terms to protect creators

5. AI companies face real financial liability for training data choices

Current Status

SETTLEMENT REJECTED — Parties are renegotiating revised terms that address Judge Alsup's concerns. New proposal expected in 2026.