Why is Sarah Silverman Suing OpenAI and Meta?

WHAT ARE THE LAWSUITS ABOUT?

Comedian Sarah Silverman and other authors claim that ChatGPT and LLaMA (Meta’s AI generator) were “trained” on their copyrighted books without consent or compensation. Generative AI models like ChatGPT known as Large Language Models (LLMs) are designed to mimic sets of data that they are fed. LLMs produce full sentences and paragraphs that are similar to human language because they are trained to continuously adjust their outputs to resemble sequences of words copied from a training dataset. Silverman’s complaint alleges that LLMs are committing infringement by feeding copies of her works into the AI application for such “training.”

The LLMs are trained using books because they are a great source of long-form, high-quality written language. Silverman’s lawsuit alleges that OpenAI, which generally refuses to reveal its training datasets, has scraped databases of torrented books to train its LLMs. The authors allege that these companies have copied their content without permission or compensation. This, they maintain, is theft.

IS AI TRAINING FAIR USE?

The unauthorized ingestion of copyrighted material into the LLMs by the AI companies likely constitutes copyright infringement. However, these companies may argue that their conduct is fair use. Fair use is a defense to copyright infringement. Section 107 of the Copyright Act directs courts to consider at least four factors when evaluating a fair use defense:

the purpose and character of the use, including whether the use is of a commercial nature or is for nonprofit educational purposes;

the nature of the copyrighted work;

the amount and substantiality of the portion used in relation to the copyrighted work as a whole; and

the effect of the use upon the potential market for or value of the copyrighted work.

The AI companies have plausible, but rebuttable arguments on each of these factors. First, although the companies are using the work in a transformative way by ingesting it as a series of data points to inform unrelated output, the use is still commercial since AI products are sold for profit. Second, AI treats the input work as factual bits of data, but many of these works are creative in nature regardless of AI’s treatment of it. Third, it will be hard for AI companies to argue they have used only a small portion of each work, as AI can often generate book summaries or accurate writings in the style of a particular author, which would require the digestion of an entire body of work. Fourth, since AI rarely reproduces a work exactly, it is unlikely that AI outputs compete directly with a copyrighted work, but creators may argue that AI is a substitute for their creative efforts in markets in which their copyrighted work is sold.

WHAT SHOULD CREATORS DO NOW?

If your work has already been used to train AI, it cannot be undone. It is impossible to disentangle a single work from the neural network of an LLM. Furthermore, if your work is available online in any form, it is likely hard to protect it from being scraped and used for AI training. With this understanding, some artists are seeking ways to be compensated for their work being used by AI. From musician Grimes to the New York Times, some writers and creators have accepted that AI will inevitably make use of their work and are exploring ways to license or sell the use of their content and likeness.

Regardless of the results of the lawsuits: always register copyrights for your work. Registration is a relatively simple and inexpensive process. Although your work is automatically copyrighted when it’s finished in a tangible form, registration ensures that all remedies for infringement are available to you. It also provides notice to others that you own the work.

CONCLUSION

Creators, including comedians, should consult with an attorney experienced intellectual property law and comedy law to understand how emerging technologies like AI affect their proprietary rights. Contact us to speak with a member of our team.

Contributions to this blog by Gabriella Epley

Nov 25, 2024 | Business | Copyright | Entertainment | Technology From Retro to Right Now: The Legality Video Game Emulators in Modern Gaming

Video game emulators have revolutionized how we access and enjoy classic games. Essentially, an emulator is a piece of software that mimics the original hardware of a gaming console, allowing games designed for that console to be played on other devices such as computers or smartphones. This technology not only provides a cost-effective alternative to

drawing of Uma Thurman and John Travolta dancing from the movie Pulp Fiction

“If my answers frighten you, Vincent, then you should cease asking scary questions.” – Jules Winnfield in Pulp Fiction Scary questions are always in the script when the law collides with new technology. And Non-fungible tokens (NFTs) pose novel legal challenges. Our closeup is on a recent legal dispute between Quentin Tarantino and Miramax regarding

Dec 11, 2023 | Copyright | Entertainment | Technology | Trademark Digital Doppelgängers: Why AI-Generated Deepfakes Can Create Legal Trouble

Celebrities routinely sponsor products in advertisements. However, there has been a recent spate of unauthorized AI-generated versions of celebrities to hawk products. These unauthorized deepfake ads violate the celebrity’s right of publicity but also create other legal problems, too. Right of Publicity The “right of publicity” is the right for a person to control their

Why is Sarah Silverman Suing OpenAI and Meta?

WHAT ARE THE LAWSUITS ABOUT?

IS AI TRAINING FAIR USE?

WHAT SHOULD CREATORS DO NOW?

CONCLUSION

Photo by Michel Grolet on Unsplash

Schedule an appointment for a case evaluation

Call us Today

Social Links

Schedule an appointment for a case evaluation