OpenAI Nears Release of ‘Strawberry’ Model, With Reasoning Capabilities

The startup has said AI with the power to reason represents a significant step in the technology’s progress.

Advanced Generative AI Tools as Major Tech Companies Urge Lawmakers to Avoid Heavy-handed Regulation — *Photographer: Andrey Rudakov/Bloomberg*

Gift this article

The model’s release, which has been rumored for months, comes as OpenAI is looking to raise billions in funding and faces heightened competition in the race to develop ever more sophisticated artificial intelligence systems. OpenAI isn’t the only company working on such capabilities; competitors Anthropic and Google have also touted “reasoning” skills with their advanced AI models.

OpenAI declined to comment.

The experience of using OpenAI’s updated AI system will differ somewhat from what people have come to expect with ChatGPT, the company’s chatbot. Before responding to a user’s prompt, the new software will pause for a matter of seconds while, behind the scenes and invisible to the user, it considers a number of related prompts and then summarizes what appears to be the best response, the person said. This technique is sometimes referred to as “chain of thought” prompting. The Information previously reported some details of how Strawberry would process prompts.

Get the Singapore Edition newsletter in your inbox.

Go beyond the headlines with insights into one of Asia’s most dynamic economies. Delivered weekly.

Bloomberg may send me offers and promotions.

By submitting my information, I agree to the Privacy Policy and Terms of Service.

This approach could enable the technology to respond more accurately to prompts that currently bedevil ChatGPT and other chatbots. For instance, when asked whether the number 9.11 is larger than 9.9 — a question that may be simple for a human but isn’t always answered correctly even by state-of-the-art AI systems — the updated model was able to correctly determine that 9.9 is bigger, the person said.

During an all-hands meeting in July, OpenAI executives showed off a demonstration of the company’s most advanced AI system enhanced with new reasoning capabilities, Bloomberg previously reported. The product was able to answer several word problems that have stumped its models in the past and also solve an advanced chemistry problem.

OpenAI has been working to get computers to carry out multi-step actions for some time. In May 2023, for instance, the company released a blog post and an accompanying research paper about its efforts to improve AI systems’ abilities to solve math problems. According to the paper, the company trained a model by rewarding it for each correct step in the process toward coming up with an answer to a problem, rather than by just rewarding it for generating an accurate answer.

The topic is also something the company is increasingly addressing publicly. Noam Brown, a research scientist at OpenAI, is scheduled to speak about generative AI and multi-step “reasoning agents” at a TED AI event in San Francisco next month, according to the event’s website.

In this Article