US judge allows using pirated books to train AI

US judge allows using pirated books to train AI

A ruling by a US judge that Anthropic, led by CEO Dario Amodei, did not violate copyright law by training its artificial intelligence on pirated books could be cited by other firms defending themselves in similar lawsuits filed by authors
A ruling by a US judge that Anthropic, led by CEO Dario Amodei, did not violate copyright law by training its artificial intelligence on pirated books could be cited by other firms defending themselves in similar lawsuits filed by authors. Photo: FABRICE COFFRINI / AFP
Source: AFP

Ghana’s top stories, now easier to find. Discover our new search feature!

A federal judge has sided with the AI company Anthropic in its practice of training a chatbot on copyrighted books without permission from the authors.

In a decision with the potential to set legal precedent, District Court Judge William Asup ruled on Monday that Anthropic's training of its artificial intelligence creation Claude with millions of pirated books was allowed under a "fair use" doctrine in a law called the Copyright Act.

"Use of the books at issue to train Claude and its precursors was exceedingly transformative and was a fair use," Alsup wrote in his decision.

Tremendous amounts of data are needed to train large language models powering generative AI.

Musicians, book authors, visual artists and news publications have sued AI companies that used their data without permission or payment.

Alsup's decision in favor of Anthropic is a first in the United States and could be sited in other cases as a legal precedent by AI firms defending themselves in court.

AI companies generally defend their practices by claiming fair use, arguing that training AI on large data sets fundamentally changes the original content and is necessary for innovation.

Though most of these lawsuits are still in early stages, their outcomes could have a profound effect on the shape of the AI industry.

Along with downloading for free millions of books from websites offering pirated works, Anthropic bought copyrighted books, scanned the pages and stored them in digital format, according to court documents.

Anthropic's aim was to amass a library of "all the books in the world", training AI models on content as deemed fit, the judge said in his ruling.

"Anthropic had no entitlement to use pirated copies for its central library," Alsup ruled, ordering a trial on that portion of the copyright lawsuit filed by authors to determine damages.

Anthropic, valued at $61.5 billion and heavily backed by Amazon, was founded in 2021 by former executives from OpenAI, the creator of ChatGPT.

The company, known for its Claude chatbot and AI models, bills itself as focused on AI safety and responsible development.

Anthropic did not immediately reply to a request for comment.

New feature: Сheck out news that is picked for YOU ➡️ click on “Recommended for you” and enjoy!

Source: AFP

Authors:
AFP avatar

AFP AFP text, photo, graphic, audio or video material shall not be published, broadcast, rewritten for broadcast or publication or redistributed directly or indirectly in any medium. AFP news material may not be stored in whole or in part in a computer or otherwise except for personal and non-commercial use. AFP will not be held liable for any delays, inaccuracies, errors or omissions in any AFP news material or in transmission or delivery of all or any part thereof or for any damages whatsoever. As a newswire service, AFP does not obtain releases from subjects, individuals, groups or entities contained in its photographs, videos, graphics or quoted in its texts. Further, no clearance is obtained from the owners of any trademarks or copyrighted materials whose marks and materials are included in AFP material. Therefore you will be solely responsible for obtaining any and all necessary releases from whatever individuals and/or entities necessary for any uses of AFP material.

OSZAR »