Skip to main content
BREAKING6d ago

Meta sued by major book publishers over copyright infringement

Macmillan, McGraw-Hill, Cengage, and others claim Meta carried out ‘one of the most massive infringements of copyrighted materials in history.’ Macmi

2 min read96 views3 tags
Originally reported bytheverge
Macmillan, McGraw-Hill, Cengage, and others claim Meta carried out ‘one of the most massive infringements of copyrighted materials in history.’ Macmillan, McGraw-Hill, Cengage, and others claim Meta carried out ‘one of the most massive infringements of copyrighted materials in history.’ Meta is facing a class action lawsuit filed by five major book publishers and one author over claims the company “engaged in one of the most massive infringements of copyrighted materials in history” when training its Llama AI models, asreported earlier byThe New York Times. Intheir suit, Macmillan, McGraw-Hill, Elsevier, Hachette, Cengage, and author Scott Turow allege that Meta “repeatedly copied” their books and journal articles without permission. The lawsuit accuses Meta of knowingly ripping copyrighted work from “notorious pirate sites,” such as LibGen, Anna’s Archive, Sci-Hub, Sci-Mag, and others, and then feeding that material into its AI model. It also claims that Meta trained Llama with information inside the Common Crawl dataset, which is allegedly “full of unauthorized copies of copyrighted works.” As a result, Llama “outputs verbatim and near-verbatim substitutes” of copyrighted material: For example, when prompted with two brief sentences from Cengage’s best-selling textbook, Calculus: Early Transcendentals, 9th edition, by James Stewart, Llama begins reproducing word-for-word the continuation of the section. Several authorshavealready suedMeta for alleged copyright infringement, whichbrought to light the company’s internal discussionsabout how to handle “media coverage suggesting we have used a dataset we know to be pirated.” Last year,a federal judge ruled in favorof Meta in one of these lawsuits, though he pointed out that his ruling “does not stand for the proposition that Meta’s use of copyrighted materials to train its language models is lawful.” A group of authors also sued Anthropic over copyright infringement. While a federal judge ruled thattraining AI models on legally purchased bookswithout permission is considered fair use, he allowed the authors to move forward with a class action lawsuit over the “millions” of works Anthropic allegedly pirated.Anthropic agreed to pay writers$1.5 billion last year to settle the class action lawsuit. Turow and the group of publishers are suing Meta for damages, and ask that the court order the company to block its allegedly unlawful activities. They also ask the court to require the company to provide a list of books, journal articles, and other copyrighted works that it trained its Llama AI models on. “AI is powering transformative innovations, productivity and creativity for individuals and companies, and courts have rightly found that training AI on copyrighted material can qualify as fair use,” Meta spokesperson Dave Arnold said in an emailed statement toThe Verge.“We will fight this lawsuit aggressively.” A free daily digest of the news that matters most. This is the title for the native ad
ES
Editorial StaffEditor

The Editorial Staff at AIChief is a team of professional content writers with extensive experience in AI and marketing. Founded in 2025, AIChief has quickly grown into the largest free AI resource hub in the industry.

View all posts
Reader feedback

What did you think of this story?

User Comments

Filter:
No comments yet. Be the first to comment!
Continue reading
View all news