Major publishing houses sue Meta and Mark Zuckerberg over AI copyright

0
1
Advertisement

Alexandra Alter

Five major publishers – Hachette, Macmillan, McGraw Hill, Elsevier and Cengage – and bestselling novelist Scott Turow have filed a class-action copyright infringement lawsuit against Meta and its founder and chief executive, Mark Zuckerberg.

The complaint, which was filed Tuesday morning (US time) in the US District Court for the Southern District of New York, accuses Meta and Zuckerberg of illegally using millions of copyrighted works to train their artificial intelligence program Llama, and of removing copyright notices and other copyright management information from those works.

Meta chief executive Mark Zuckerberg.AP

The lawsuit asserts that Meta’s engineers relied on pirated books and journal articles to train the program by downloading unlicensed copies through websites like Anna’s Archive, an open source search engine for piracy sites including LibGen and Sci-Hub. The suit also claims that “Zuckerberg himself personally authorised and actively encouraged the infringement”.

“AI is powering transformative innovations, productivity and creativity for individuals and companies, and courts have rightly found that training AI on copyrighted material can qualify as fair use,” a Meta spokesperson, Dave Arnold, told The New York Times. “We will fight this lawsuit aggressively.”

Advertisement

The plaintiffs argue that Meta’s AI program poses a threat to the livelihoods of writers and publishers because the technology can be used to quickly produce AI-generated copycat books and to summarise the plot and themes of copyrighted books in such great detail that readers don’t have to buy them.

“These AI-generated books are already flooding the world’s largest book marketplace, Amazon, in volumes that materially displace human-authored works,” the complaint states. The filing cites several authors whose works the plaintiffs claim were used to train Llama, including V.E. Schwab, N.K. Jemisin, Lemony Snicket and Turow.

Some of the evidence cited in the complaint purportedly comes directly from Llama. When asked to produce a travel guide in the style of the writer Becky Lomax, Llama rapidly produced “a convincing rendition of Lomax’s local insider voice,” the complaint says. Then, when asked how it was able to reproduce Lomax’s style so accurately, Llama allegedly replied, “While I don’t have personal interactions with Becky Lomax, I’ve been trained on a vast amount of text data, including her published works.”

Llama is also able to summarise books in detail. When asked to give a synopsis of Turow’s Presumed Innocent, Llama confirmed that it had “been trained on a digital version of the book, which allows me to access and analyse its content,” according to the complaint.

Advertisement

In an email to the Times, Turow said Meta’s use of pirated works amounted to “shameless, damaging and unjust behaviour.”

“I find it distressing and infuriating that one of the top-10 richest corporations in the world knowingly used pirated copies of my books, and thousands of other authors, to train Llama, which can and has produced competing material, including works supposedly in my style,” Turow wrote.

Attorney-turned-author Scott Turow.AP

By producing “knock-offs and imitations” of authors’ works, Meta’s AI program could “dilute the overall market for literary works,” the plaintiffs argue.

“These outputs are similar enough to copyrighted works – in subject matter, plot details, sequencing of events, character names and traits, or other creative choices – that they replace the original work for many readers or consumers,” the complaint says.

Advertisement

The lawsuit is the latest effort by authors and publishers to rein in tech companies’ use of copyrighted works to train their large language models.

Writers have brought lawsuits against tech companies including OpenAI, Anthropic, Google and xAI for the companies’ unauthorised use of their work. Last northern autumn, Anthropic agreed to pay a $1.5 billion ($2 billion) settlement to writers whose books had been used to train its AI program.

(The Times has sued OpenAI and Microsoft, as well as Perplexity, accusing the companies of copyright infringement of news content related to AI systems. The companies have rebutted the claims.)

Authors have challenged Meta in court before. In June 2025, a judge ruled in Meta’s favour, finding that the plaintiffs had not presented enough evidence that Meta’s AI product would create “market dilution” by producing a flood of AI-generated books.

Advertisement

The lawsuit filed on Tuesday against Meta brought together trade and education publishers, academic publishers of scientific and medical journals and a bestselling author of legal thrillers. The plaintiffs are seeking an order requiring Meta to destroy all illegally acquired copies of works copyrighted by the plaintiffs that Meta used in AI training and to “cease all unlawful activities,” as well as requesting any “further relief as the court deems proper”.

“We’re focused on a much more sustainable AI landscape – something that’s transparent and fair and participatory and has guardrails against harm for authors and publishers,” said Maria A. Pallante, president and chief executive of the Association of American Publishers, a trade group that acts as a law and policy advocate for the book publishing industry. “The harm is already evident.”

The New York Times

The Booklist is a weekly newsletter for book lovers from Jason Steger. Get it delivered every Friday.

From our partners

Advertisement
Advertisement

Disclaimer : This story is auto aggregated by a computer programme and has not been created or edited by DOWNTHENEWS. Publisher: www.smh.com.au