San Francisco, July 10: Comedian and author Sarah Silverman, along with authors Christopher Golden and Richard Kadrey, have sued Sam Altman-run OpenAI and Mark Zuckerberg-owned Meta over dual claims of copyright infringement. The lawsuits alleged that OpenAI's ChatGPT and Meta's LLaMA (a set of large language models) were trained on illegally-acquired datasets containing their works. ChatGPT: Over 40% of UK Universities Probing Students for Using AI Chatbots To Cheat in Exam.
Their works were allegedly acquired from "shadow library" websites like Bibliotik, Library Genesis, Z-Library, and others, noting the books are "available in bulk via torrent systems," reports The Verge.
"When ChatGPT is prompted, ChatGPT generates summaries of plaintiffs’ copyrighted works -- something only possible if ChatGPT was trained on Plaintiffs' copyrighted works," the lawsuit claimed.
The lawsuit alleged that the chatbot never bothered to "reproduce any of the copyright management information Plaintiffs included with their published works."
In a separate lawsuit against Meta, it alleged the authors' books were accessible in datasets Meta used to train its LLaMA models.
"Many of Plaintiffs' copyrighted books appear in the dataset that Meta has admitted to using to train LLaMA," it read.
Silverman owns a registered copyright in one book, called The Bedwetter while Golden owns registered copyrights in several books, including Ararat. Kadrey owns registered copyrights in several books, including 'Sandman Slim'.
In both lawsuits, the authors said that they "did not consent to the use of their copyrighted books as training material" for the companies' AI models.
Each lawsuit contains six counts of copyright violations, negligence, unjust enrichment, and unfair competition.
The authors are looking for statutory damages, restitution of profits, and more. Meta or OpenAI did not comment on the lawsuits. ChatGPT ‘Browse With Bing’ Beta Feature Disabled by OpenAI.
The suits allege, among other things, that OpenAI’s ChatGPT and Meta’s LLaMA were trained on illegally-acquired datasets containing their works, which they say were acquired from "shadow library" websites like Bibliotik, Library Genesis, Z-Library, and others, noting the books are "available in bulk via torrent systems".
(The above story first appeared on LatestLY on Jul 10, 2023 11:25 AM IST. For more news and updates on politics, world, sports, entertainment and lifestyle, log on to our website latestly.com).