OpenAI's "code red" response to Google's Gemini 3 Pro has arrived. On the same day the company announced a Sora licensing pact with Disney, it took the wraps off GPT-5.2. OpenAI is touting the new model as its best yet for real-world, professional use. "It's better at creating spreadsheets, building presentations, writing code, perceiving images, understanding long contexts, using tools, and handling complex, multi-step projects," said OpenAI.
In a series of 10 benchmarks highlighted by OpenAI, GPT-5.2 Thinking, the most advanced version of the model, outperformed its GPT-5.1 counterpart, sometimes by a significant margin. For example, in AIME 2025, a test that involves 30 challenging mathematics problems, the model earned a perfect 100 percent score, beating out GPT-5.1's already state-of-the-art score of 94 perfect. It also achieved that feat without turning to tools like web search. Meanwhile, in ARC-AGI-1, a benchmark that tests an AI system's ability to reason abstractly like a human being would, the new system beat GPT-5.1's score by more than 10 percentage points.
OpenAI says GPT-5.2 Thinking is better at answering questions factually, with the company finding it produces errors 30 percent less frequently. "For professionals,
|