OpenAI's deals with publishers could spell trouble for rivals

Kyle Wiggers

March 13, 2024 at 6:12 p.m.·4 min read

OpenAI's legal battle with The New York Times over data to train its AI models might still be brewing. But OpenAI's forging ahead on deals with other publishers, including some of France's and Spain's largest news publishers.

OpenAI on Wednesday announced that it signed contracts with Le Monde and Prisa Media to bring French and Spanish news content to OpenAI's ChatGPT chatbot. In a blog post, OpenAI said that the partnership will put the organizations' current events coverage -- from brands including El País, Cinco Días, As and El Huffpost -- in front of ChatGPT users where it makes sense, as well as contribute to OpenAI's ever-expanding volume of training data.

OpenAI writes:

Over the coming months, ChatGPT users will be able to interact with relevant news content from these publishers through select summaries with attribution and enhanced links to the original articles, giving users the ability to access additional information or related articles from their news sites ... We are continually making improvements to ChatGPT and are supporting the essential role of the news industry in delivering real-time, authoritative information to users.

So, OpenAI's revealed licensing deals with a handful of content providers at this point. Now felt like a good opportunity to take stock:

Stock media library Shutterstock (for images, videos and music training data)
The Associated Press
Axel Springer (owner of Politico and Business Insider, among others)
Le Monde
Prisa Media

How much is OpenAI paying each? Well, it's not saying -- at least not publicly. But we can estimate.

The Information reported in January that OpenAI was offering publishers between $1 million and $5 million a year to access archives to train its GenAI models. That doesn't tell us much about the Shutterstock partnership. But on the article licensing front -- assuming The Information's reporting is accurate and those figures haven't changed since then -- OpenAI's shelling out between $4 million and $20 million a year for news.

That might be pennies to OpenAI, whose war chest sits at over $11 billion and whose annualized revenue recently topped $2 billion (per Financial Times). But as Hunter Walk, a partner at Homebrew and the co-founder of Screendoor, recently mused, it's substantial enough to potentially edge out AI rivals also pursuing licensing agreements.

Walk writes on his blog:

[I]f experimentation is gated by nine figures worth of licensing deals, we are doing a disservice to innovation ... The checks being cut to 'owners' of training data are creating a huge barrier to entry for challengers. If Google, OpenAI, and other large tech companies can establish a high enough cost, they implicitly prevent future competition.

Now, whether there's a barrier to entry today is debatable. Many -- if not most -- AI vendors have chosen to risk the wrath of IP holders, opting not to license the data on which they're training AI models. There's evidence that art-generating platform Midjourney, for example, is training on Disney movie stills -- and Midjourney has no deal with Disney.

The tougher question to wrestle with is: Should licensing simply be the cost of doing business and experimentation in the AI space?

Walk would argue not. He advocates for a regulator-imposed "safe harbor" that'd protect any AI vendor -- as well as small-time startups and researchers -- from legal liability so long as they abide by certain transparency and ethical standards.

Interestingly, the U.K. recently tried to codify something along those lines, exempting the use of text and data mining for AI training from copyright considerations so long as it's for research purposes. But those efforts ended up falling through.

Me, I'm not sure I'd go so far as Walk in his "safe harbor" proposal considering the impact AI threatens to have on an already-destabilized news industry. A recent model from The Atlantic found that if a search engine like Google were to integrate AI into search, it’d answer a user’s query 75% of the time without requiring a click-through to its website.

But perhaps there is room for carve-outs.

Publishers should be paid -- and paid fairly. Is there not an outcome, though, in which they're paid and challengers to AI incumbents -- as well as academics -- get access to the same data as those incumbents? I should think so. Grants are one way. Larger VC checks are another.

I can't say I have the solution, particularly given that the courts have yet to decide whether -- and to what extent -- fair use shields AI vendors from copyright claims. But it's vital we tease these things out. Otherwise, the industry could well end up in a situation where academic "brain drain" continues unabated and only a few powerful companies have access to vast pools of valuable training sets.

BBC
Three ways Trump is trying to end the Harris honeymoon
Kamala Harris is riding a wave of momentum, but Republicans sense vulnerabilities they can exploit.
Business Insider
North Korea's economy is booming thanks to its arms trade with Russia
North Korea's GDP grew 3.1% in real terms, snapping a three-year slump, the Bank of Korea reported.
The Canadian Press
Two former FBI officials settle lawsuits with Justice Department over leaked text messages
WASHINGTON (AP) — Two former FBI officials settled lawsuits with the Justice Department on Friday, resolving claims that their privacy rights were violated when the department leaked to the news media text messages that they had sent one another that disparaged former President Donald Trump.
People
Team USA Flagbearer Coco Gauff Beams with Pride During Olympics Opening Ceremony: ‘Truly No Words’
The tennis champ made history as the youngest American flagbearer on Paris' Seine River
The Canadian Press
FBI says Trump was indeed struck by bullet during assassination attempt
WASHINGTON (AP) — Nearly two weeks after Donald Trump’s near assassination, the FBI confirmed Friday that it was indeed a bullet that struck the former president’s ear, moving to clear up conflicting accounts about what caused the former president’s injuries after a gunman opened fire at a Pennsylvania rally.
Deadline
2024 Premiere Dates For New & Returning Series On Broadcast, Cable & Streaming
Midseason is the new fall. As Hollywood and the broader industry continue to recover from the debilitating dual actors and writers strikes, the 2024 television landscape is coming into focus. All of the broadcast networks have set return dates for most of their shows, but there’s no usual Premiere Week to speak of. But as …
The Weather Network
Tornado watch issued for northwest Ontario on Friday evening
A tornado watch is in effect for northwestern Ontario on Friday evening
The Guardian
‘I didn’t say I’m leaving’: Pep Guardiola could extend his Manchester City stay
Pep Guardiola has played down comments he made after winning the title in May, hinting that he could still extend his deal beyond next summer
KGTV - San Diego Scripps
Impact of water rate hike on car wash
Taking a look at the impact of the water rate hike on local car washes.
The Independent
Trump says he accepts FBI ‘apology’ over assassin’s bullet claims and slams Harris at Florida speech
Agency confirmed Friday that Trump was wounded by bullet shot from rifle
WFTS-Tampa
Former ballerina who allegedly murdered her husband testifies in court
Ashley Benefield took the stand on Day 4 to share her side of the story. She claims she shot Doug Benefield because she feared for her life.
CBC
Councillor calls for charges after death of cyclist in Yorkville
A Toronto city councillor says she'd like to see criminal charges laid in the death of a 24-year-old female cyclist in Yorkville this week.Coun. Dianne Saxe, who represents Ward 11, University-Rosedale, said on Friday that a construction bin was placed illegally in the middle of a bike lane in front of 150 Bloor Avenue W., before the cyclist was killed Thursday. Saxe said the bin blocked the bike lane.Saxe says a general contractor is working at the address and she wants to see the contractor an
KNXV - Phoenix Scripps
Pinal County officials prepare for Arizona's Primary Election
Pinal County election officials are preparing for the upcoming primary election.
KGTV - San Diego Scripps
Preventing summer learning loss
ABC 10News is taking a look at preventing summer learning loss.
Miami Herald
South Florida benefited from Biden’s rich infrastructure legacy | Opinion
Is Florida’s infrastructure future secure without Biden?
The Daily Beast
J.D. Vance ‘Couch’ Story Finally Makes Appearance on Fox
The embarrassing gossip that Republican vice presidential candidate J.D. Vance had sex with a couch was alluded to in passing on Fox News Friday night for the first time, making for a bit of an amusing—if not awkward—moment on the right-wing channel.Vance’s rollout as Donald Trump’s running mate has been largely viewed as less than ideal, thanks in part to his controversial comments about women and voting. In addition, scores of memes have imagined Vance’s relationship with furniture, as a resul
People
No, Megan Fox Is Not Pregnant — Despite What It Looks Like in That MGK and Jelly Roll Music Video
Social media theorizing ramped up after Megan Fox's appearance in Machine Gun Kelly and Jelly Roll's music video for 'Lonely Road'
KGTV - San Diego Scripps
Keeping Downtown San Diego clean during Comic-Con
Day two of Comic-Con is in full swing, as organizers expect 130-thousand people to visit this weekend. That creates a huge job for the people trying to keep downtown clean.
The Canadian Press
Phillies deal outfielder Pache, reliever Domínguez to Baltimore for 2023 All-Star outfielder Hays
PHILADELPHIA (AP) — The Philadelphia Phillies acquired outfielder Austin Hays from the Baltimore Orioles on Friday in exchange for right-handed pitcher Seranthony Domínguez and outfielder Cristian Pache in a deal between the East Division leaders in both leagues.
The Canadian Press
A tanker plane crash has killed a firefighting pilot in Oregon as Western wildfires spread
Communities in the U.S. West and Canada were under siege from raging wildfires on Friday, as a fast-moving blaze sparked by lightning sent people fleeing on fire-ringed roads in rural Idaho and a human-caused inferno forced the evacuation of hundreds of homes in northern California.

Latest Stories