Gen3 - Hub for Developers, Founders & Everyone. Based in Taiwan

OpenAI Releases New Generation Large Model o1, Featuring Reasoning Abilities and Enhanced Performance in Math and Coding

Article is form followin

September 13, 2024

This article is translated by ChatGPT Show original

OpenAI Announces o1, a Reasoning-Powered AI Model

OpenAI has announced the release of o1, an AI model with reasoning capabilities, internally codenamed "Strawberry." OpenAI o1 can reason through complex tasks and solve problems more difficult than previous scientific, coding, and mathematical models.

In tests, OpenAI o1 performed on par with PhD students in challenging benchmark tasks across physics, chemistry, and biology. It excelled in mathematics and coding, scoring 83% on the International Mathematical Olympiad (IMO) qualification exam, while GPT-4o only solved 13% of the problems correctly. OpenAI o1's coding ability reached the 89th percentile in Codeforces competitions.

As an early model, OpenAI o1 lacks many of the useful features found in ChatGPT, like browsing the web for information and uploading files and images. GPT-4o is expected to be more powerful in the short term. However, this marks a significant leap forward for complex reasoning tasks, representing a new level of AI capability. Given this, the counter is reset to 1 and the series is named OpenAI o1.

Healthcare researchers can use o1 to annotate cell sequencing data, physicists can use it to generate complex mathematical formulas needed for quantum optics, and developers across fields can use it to build and execute multi-step workflows.

OpenAI has also released OpenAI o1-mini, a cost-effective reasoning model. o1-mini excels in STEM fields, particularly math and coding – performing nearly on par with OpenAI o1 in evaluation benchmarks like AIME and Codeforces. OpenAI expects o1-mini to be a faster, cost-effective model for applications that require reasoning without extensive world knowledge, being 80% cheaper than o1-preview. ChatGPT Plus, Team, Enterprise, and Edu users can use o1-mini as an alternative to o1-preview with higher rate limits and lower latency.

Source

1. Disclaimer: The views expressed are solely those of the author and do not reflect the stance of Gen3. They are not intended as investment advice.