News

OpenAI's mid April announcements include its most advanced reasoning models o3 and o4 mini, with a biorisk monitor, the ...
On Wednesday, OpenAI announced the release of two new models—o3 and o4-mini—that combine simulated reasoning capabilities ...
OpenAI’s o3 and o4-mini models are available now to ChatGPT Plus, Pro, and Team users. Enterprise and education users will ...
OpenAI has launched its advanced AI models, o3 and o4-mini, enhancing reasoning and problem-solving capabilities. The o3 ...
OpenAI has finally released the full o3 reasoning model along with o4-mini. New models can use multiple tools inside ChatGPT ...
Discover OpenAI’s O3 & O4 Mini, the groundbreaking AI models excelling in reasoning, tool usage, and cost efficiency. Learn ...
Benchmark performance results typically accompany ... or graduate-level reasoning (GPQA). To fill that gap, OpenAI launched the OpenAI Pioneers Program, intended to advance AI model development ...
According to data from Artificial Analysis, a third-party AI testing outfit, it costs $2,767.05 to evaluate OpenAI's o1 reasoning model across a suite of seven popular AI benchmarks: MMLU-Pro ...
OpenAI thinks AI benchmarks are broken. Now the company is launching a program to fix how AI models are scored. The new OpenAI Pioneers Program will focus on creating evaluations for AI models ...
Described as the company's “smartest models to date,” they can agentically use and combine every tool within ChatGPT, such as ...
OpenAI says o3 has set new records across several popular AI performance benchmarks. One of them is ... o3 makes 20 percent fewer major errors than OpenAI o1 on difficult, real-world tasks ...
OpenAI, like many AI labs, thinks AI benchmarks are broken. It says it wants to fix them through a new program. Called the OpenAI Pioneers Program, the program will focus on creating evaluations ...