Copyright fight besieges Openai

Chatbots ‘trained’ on authored works

2024-01-10 - By Matt O’brien

High-profile lawsuits in a New York federal court will test the future of CHATGPT and other artificial intelligence products that wouldn’t be so eloquent had they not ingested huge troves of copyrighted human works.

But are AI chatbots — in this case, commercialized products made by Openai and its business partner Microsoft — breaking copyright and fair competition laws? Professional writers and media outlets will face a difficult fight to win that argument in court.

“I would like to be optimistic on behalf of the authors, but I’m not. I just think they have an uphill battle here,” said copyright attorney Ashima Aggarwal, who used to work for academic publishing giant John Wiley & Sons.

One lawsuit comes from The New York Times. Another from a group of well-known novelists such as John Grisham, Jodi Picoult and George R.R. Martin. A third from bestselling nonfiction writers, including an author of the Pulitzer Prize-winning biography on which the hit movie “Oppenheimer” was based.

The lawsuits

Each of the lawsuits makes different allegations, but they all center on the San Francisco-based company Openai “building this product on the back of other peoples’ intellectual property,” said attorney Justin Nelson, who is representing the nonfiction writers and whose law firm is also representing The Times.

“What Openai is saying is that they have a free ride to take anybody else’s intellectual property really since the dawn of time, as long as it’s been on the internet,” Nelson said.

The Times sued in December, arguing that CHATGPT and Microsoft’s Copilot are competing with the same outlets they are trained on and diverting web traffic away from the newspaper and other copyright holders who depend on advertising revenue generated from their sites to keep producing their journalism. It also provided evidence of the chatbots spitting out Times articles wordfor-word. At other times the chatbots falsely attributed misinformation to the paper in a way it said damaged its reputation.

One senior federal judge is presiding over all three cases, as well as a fourth from two more nonfiction authors who filed another lawsuit last week. U.S. District Judge Sidney Stein has been at the Manhattan-based court since 1995 when he was nominated by then-president Bill Clinton.

The response

Openai and Microsoft haven’t yet filed formal counterarguments on the New York cases, but Openai made a public statement this week describing The Times lawsuit as “without merit” and saying that the chatbot’s ability to regurgitate some articles verbatim was a “rare bug.”

“Training AI models using publicly available internet materials is fair use, as supported by long-standing and widely accepted precedents,” a Monday blog post from the company said. It went on to suggest that the Times “either instructed the model to regurgitate or cherry-picked their examples from many attempts.”

Openai cited licensing agreements made last year with The Associated Press, the German media company Axel Springer and other organizations as offering a glimpse into how the company is trying to support a healthy news ecosystem. Openai is paying an undisclosed fee to license AP’S archive of news stories. The New York Times was engaged in similar talks before deciding to sue.

Michael Dwyer The Associated Press The Openai logo on a mobile phone in front of a computer screen displaying output from CHATGPT in Boston. Openai and business partner Microsoft face copyright suits.

Copyright fight besieges Openai

Chatbots ‘trained’ on authored works

The response

Newspapers in English

Newspapers from United States