OpenAI launches an API for ChatGPT, plus dedicated capacity for enterprise customers

Kyle Wiggers

Updated March 1, 2023 at 4:54 p.m.·7 min read

To call ChatGPT, the free text-generating AI developed by San Francisco-based startup OpenAI, a hit is a massive understatement.

As of December, ChatGPT had an estimated more than 100 million monthly active users. It's attracted major media attention and spawned countless memes on social media. It's been used to write hundreds of e-books in Amazon's Kindle store. And it's credited with co-authoring at least one scientific paper.

But OpenAI, being a business -- albeit a capped-profit one -- had to monetize ChatGPT somehow, lest investors get antsy. It took a step toward this with the launch of a premium service, ChatGPT Plus, in February. And it made a bigger move today, introducing an API that'll allow any business to build ChatGPT tech into their apps, websites, products and services.

An API was always the plan. That's according to Greg Brockman, the president and chairman of OpenAI (and also one of the co-founders). He spoke with me yesterday afternoon via a video call ahead of the launch of the ChatGPT API.

"It takes us a while to get these APIs to a certain quality level," Brockman said. "I think it's kind of this, like, just being able to meet the demand and the scale."

Brockman says the ChatGPT API is powered by the same AI model behind OpenAI's wildly popular ChatGPT, dubbed "gpt-3.5-turbo." GPT-3.5 is the most powerful text-generating model OpenAI offers today through its API suite; the "turbo" moniker refers to an optimized, more responsive version of GPT-3.5 that OpenAI's been quietly testing for ChatGPT.

Priced at $0.002 per 1,000 tokens, or about 750 words, Brockman claims that the API can drive a range of experiences, including "non-chat" applications. Snap, Quizlet, Instacart and Shopify are among the early adopters.

The initial motivation behind developing gpt-3.5-turbo might've been to cut down on ChatGPT's gargantuan compute costs. OpenAI CEO Sam Altman once called ChatGPT’s expenses “eye-watering,” estimating them at a few cents per chat in compute costs. (With over a million users, that presumably adds up quickly.)

But Brockman says that gpt-3.5-turbo is improved in other ways.

"If you're building an AI-powered tutor, you never want the tutor to just give an answer to the student. You want it to always explain it and help them learn -- that's an example of the kind of system you should be able to build [with the API]," Brockman said. "We think this is going to be something that will just, like, make the API much more usable and accessible."

The ChatGPT API underpins My AI, Snap's recently announced chatbot for Snapchat+ subscribers, and Quizlet's new Q-Chat virtual tutor feature. Shopify used the ChatGPT API to build a personalized assistant for shopping recommendations, while Instacart leveraged it to create Ask Instacart, an upcoming toll that'll allow Instacart customers to ask about food and get "shoppable" answers informed by product data from the company's retail partners.

“Grocery shopping can require a big mental load, with a lot of factors at play, like budget, health and nutrition, personal tastes, seasonality, culinary skills, prep time, and recipe inspiration," Instacart chief architect JJ Zhuang told me via email. "What if AI could take on that mental load, and we could help the household leaders who are commonly responsible for grocery shopping, meal planning, and putting food on the table -- and actually make grocery shopping truly fun? Instacart’s AI system, when integrated with OpenAI’s ChatGPT, will enable us to do exactly that, and we’re thrilled to start experimenting with what’s possible in the Instacart app.”

Image Credits: Instacart

Those who've been closely following the ChatGPT saga, though, might be wondering if it's ripe for release -- and rightly so.

Early on, users were able to prompt ChatGPT to answer questions in racist and sexist ways, a reflection of the biased data on which ChatGPT was initially trained. (ChatGPT's training data includes a broad swath of internet content, namely e-books, Reddit posts and Wikipedia articles.) ChatGPT also invents facts without disclosing that it's doing so, a phenomenon in AI known as hallucination.

ChatGPT -- and systems like it -- are susceptible to prompt-based attacks as well, or malicious adversarial prompts that get them to perform tasks that weren’t a part of their original objectives. Entire communities on Reddit have formed around finding ways to "jailbreak" ChatGPT and bypass any safeguards that OpenAI put in place. In one of the less offensive examples, a staffer at startup Scale AI was able to get ChatGPT to divulge information about its inner technical workings.

Brands, no doubt, wouldn't want to be caught in the crosshairs. Brockman is adamant they won't be. Why so? One reason, he says, is continued improvements on the back end -- in some cases at the expense of Kenyan contract workers. But Brockman emphasized a new (and decidedly less controversial) approach that OpenAI calls Chat Markup Language, or ChatML. ChatML feeds text to the ChatGPT API as a sequence of messages together with metadata. That's as opposed to the standard ChatGPT, which consumes raw text represented as a series of tokens. (The word "fantastic" would be split into the tokens "fan," “tas" and "tic," for example.)

For example, given the prompt "What are some interesting party ideas for my 30th birthday?" a developer can choose to append that prompt with an additional prompt like "You are a fun conversational chatbot designed to help users with the questions they ask. You should answer truthfully and in a fun way!" or "You are a bot" before having the ChatGPT API process it. These instructions help to better tailor -- and filter -- the ChatGPT model's responses, according to Brockman.

"We're moving to a higher-level API. If you have a more structured way of representing input to the system, where you say, 'this is from the developer' or 'this is from the user' ... I should expect that, as a developer, you actually can be more robust [using ChatML] against these kinds of prompt attacks," Brockman said.

Another change that'll (hopefully) prevent unintended ChatGPT behavior is more frequent model updates. With the release of gpt-3.5-turbo, developers will by default be automatically upgraded to OpenAI's latest stable model, Brockman says, starting with gpt-3.5-turbo-0301 (released today). Developers will have the option to remain with an older model if they so choose, though, which might somewhat negate the benefit.

Whether they opt to update to the newest model or not, Brockman notes that some customers -- mainly large enterprises with correspondingly large budgets -- will have deeper control over system performance with the introduction of dedicated capacity plans. First detailed in documentation leaked earlier this month, OpenAI's dedicated capacity plans, launched today, let customers pay for an allocation of compute infrastructure to run an OpenAI model -- for example, gpt-3.5-turbo. (It's Azure on the back end, by the way.)

In addition to "full control" over the instance's load -- normally, calls to the OpenAI API happen on shared compute resources -- dedicated capacity gives customers the ability to enable features such as longer context limits. Context limits refer to the text that the model considers before generating additional text; longer context limits allow the model to "remember" more text essentially. While higher context limits might not solve all the bias and toxicity issues, they could lead models like gpt-3.5-turbo to hallucinate less.

Brockman says that dedicated capacity customers can expect gpt-3.5-turbo models with up to a 16k context window, meaning they can take in four times as many tokens as the standard ChatGPT model. That might let someone paste in pages and pages of tax code and get reasonable answers from the model, say -- a feat that's not possible today.

Brockman alluded to a general release in the future, but not anytime soon.

"The context windows are starting to creep up, and part of the reason that we're dedicated-capacity-customers-only right now is because there's a lot of performance tradeoffs on our side," Brockman said. "We might eventually be able to offer an on-demand version of the same thing."

Given OpenAI's increasing pressure to turn a profit after a multibillion-dollar investment from Microsoft, that wouldn't be terribly surprising.

BBC
Three ways Trump is trying to end the Harris honeymoon
Kamala Harris is riding a wave of momentum, but Republicans sense vulnerabilities they can exploit.
Business Insider
North Korea's economy is booming thanks to its arms trade with Russia
North Korea's GDP grew 3.1% in real terms, snapping a three-year slump, the Bank of Korea reported.
The Canadian Press
Two former FBI officials settle lawsuits with Justice Department over leaked text messages
WASHINGTON (AP) — Two former FBI officials settled lawsuits with the Justice Department on Friday, resolving claims that their privacy rights were violated when the department leaked to the news media text messages that they had sent one another that disparaged former President Donald Trump.
People
Team USA Flagbearer Coco Gauff Beams with Pride During Olympics Opening Ceremony: ‘Truly No Words’
The tennis champ made history as the youngest American flagbearer on Paris' Seine River
The Canadian Press
FBI says Trump was indeed struck by bullet during assassination attempt
WASHINGTON (AP) — Nearly two weeks after Donald Trump’s near assassination, the FBI confirmed Friday that it was indeed a bullet that struck the former president’s ear, moving to clear up conflicting accounts about what caused the former president’s injuries after a gunman opened fire at a Pennsylvania rally.
Deadline
2024 Premiere Dates For New & Returning Series On Broadcast, Cable & Streaming
Midseason is the new fall. As Hollywood and the broader industry continue to recover from the debilitating dual actors and writers strikes, the 2024 television landscape is coming into focus. All of the broadcast networks have set return dates for most of their shows, but there’s no usual Premiere Week to speak of. But as …
The Weather Network
Tornado watch issued for northwest Ontario on Friday evening
A tornado watch is in effect for northwestern Ontario on Friday evening
The Guardian
‘I didn’t say I’m leaving’: Pep Guardiola could extend his Manchester City stay
Pep Guardiola has played down comments he made after winning the title in May, hinting that he could still extend his deal beyond next summer
The Independent
Trump says he accepts FBI ‘apology’ over assassin’s bullet claims and slams Harris at Florida speech
Agency confirmed Friday that Trump was wounded by bullet shot from rifle
WFTS-Tampa
Former ballerina who allegedly murdered her husband testifies in court
Ashley Benefield took the stand on Day 4 to share her side of the story. She claims she shot Doug Benefield because she feared for her life.
CBC
Councillor calls for charges after death of cyclist in Yorkville
A Toronto city councillor says she'd like to see criminal charges laid in the death of a 24-year-old female cyclist in Yorkville this week.Coun. Dianne Saxe, who represents Ward 11, University-Rosedale, said on Friday that a construction bin was placed illegally in the middle of a bike lane in front of 150 Bloor Avenue W., before the cyclist was killed Thursday. Saxe said the bin blocked the bike lane.Saxe says a general contractor is working at the address and she wants to see the contractor an
KNXV - Phoenix Scripps
Pinal County officials prepare for Arizona's Primary Election
Pinal County election officials are preparing for the upcoming primary election.
KGTV - San Diego Scripps
Preventing summer learning loss
ABC 10News is taking a look at preventing summer learning loss.
Miami Herald
South Florida benefited from Biden’s rich infrastructure legacy | Opinion
Is Florida’s infrastructure future secure without Biden?
The Daily Beast
J.D. Vance ‘Couch’ Story Finally Makes Appearance on Fox
The embarrassing gossip that Republican vice presidential candidate J.D. Vance had sex with a couch was alluded to in passing on Fox News Friday night for the first time, making for a bit of an amusing—if not awkward—moment on the right-wing channel.Vance’s rollout as Donald Trump’s running mate has been largely viewed as less than ideal, thanks in part to his controversial comments about women and voting. In addition, scores of memes have imagined Vance’s relationship with furniture, as a resul
People
No, Megan Fox Is Not Pregnant — Despite What It Looks Like in That MGK and Jelly Roll Music Video
Social media theorizing ramped up after Megan Fox's appearance in Machine Gun Kelly and Jelly Roll's music video for 'Lonely Road'
KGTV - San Diego Scripps
Keeping Downtown San Diego clean during Comic-Con
Day two of Comic-Con is in full swing, as organizers expect 130-thousand people to visit this weekend. That creates a huge job for the people trying to keep downtown clean.
The Canadian Press
Phillies deal outfielder Pache, reliever Domínguez to Baltimore for 2023 All-Star outfielder Hays
PHILADELPHIA (AP) — The Philadelphia Phillies acquired outfielder Austin Hays from the Baltimore Orioles on Friday in exchange for right-handed pitcher Seranthony Domínguez and outfielder Cristian Pache in a deal between the East Division leaders in both leagues.
The Canadian Press
A tanker plane crash has killed a firefighting pilot in Oregon as Western wildfires spread
Communities in the U.S. West and Canada were under siege from raging wildfires on Friday, as a fast-moving blaze sparked by lightning sent people fleeing on fire-ringed roads in rural Idaho and a human-caused inferno forced the evacuation of hundreds of homes in northern California.
BBC
She conquered Everest 10 times - and escaped an abusive marriage
Lhakpa Sherpa, who has climbed Everest more than any other woman, wants to inspire women and girls.

Latest Stories