Episode 2784 - September 13 - Tiếng Anh - OpenAI công bố mô hình AI mới, OpenAI o1 - Vina Technology at AI time - Lê Quang Văn | Podcast - Nhac.vn

Episode 2784 - September 13 - Tiếng Anh - OpenAI công bố mô hình AI mới, OpenAI o1 - Vina Technology at AI time
13 Thg09, 24

OpenAI Announces a New AI Model, Code-Named Strawberry

1 - Introducing OpenAI o1-preview

A new series of reasoning models for solving hard problems. Available starting September12.

We've developed a new series of AI models designed to spend more time thinking before they respond. They can reason through complex tasks and solve harder problems than previous models in science, coding, and math.

Today, we are releasing the first of this series in ChatGPT and our API. This is a preview and we expect regular updates and improvements. Alongside this release, we’re also including evaluations for the next update, currently in development.

How it works

We trained these models to spend more time thinking through problems before they respond, much like a person would. Through training, they learn to refine their thinking process, try different strategies, and recognize their mistakes.

In our tests, the next model update performs similarly to PhD students on challenging benchmark tasks in physics, chemistry, and biology. We also found that it excels in math and coding. In a qualifying exam for the International Mathematics Olympiad (IMO), GPT-4o correctly solved only 13% of problems, while the reasoning model scored 83%. Their coding abilities were evaluated in contests and reached the 89th percentile in Codeforces competitions. You can read more about this in our technical research post.

As an early model, it doesn't yet have many of the features that make ChatGPT useful, like browsing the web for information and uploading files and images. For many common cases GPT-4o will be more capable in the near term.

But for complex reasoning tasks this is a significant advancement and represents a new level of AI capability. Given this, we are resetting the counter back to 1 and naming this series OpenAI o1.

Safety

As part of developing these new models, we have come up with a new safety training approach that harnesses their reasoning capabilities to make them adhere to safety and alignment guidelines. By being able to reason about our safety rules in context, it can apply them more effectively.

One way we measure safety is by testing how well our model continues to follow its safety rules if a user tries to bypass them (known as "jailbreaking"). On one of our hardest jailbreaking tests, GPT-4o scored 22 (on a scale of 0-100) while our o1-preview model scored 84. You can read more about this in the system card and our research post.

To match the new capabilities of these models, we’ve bolstered our safety work, internal governance, and federal government collaboration. This includes rigorous testing and evaluations using our Preparedness Framework, best-in-class red teaming, and board-level review processes, including by our Safety & Security Committee.

To advance our commitment to AI safety, we recently formalized agreements with the U.S. and U.K. AI Safety Institutes. We've begun operationalizing these agreements, including granting the institutes early access to a research version of this model. This was an important first step in our partnership, helping to establish a process for research, evaluation, and testing of future models prior to and following their public release.

Whom it’s for

These enhanced reasoning capabilities may be particularly useful if you’re tackling complex problems in science, coding, math, and similar fields. For example, o1 can be used by healthcare researchers to annotate cell sequencing data, by physicists to generate complicated mathematical formulas needed for quantum optics, and by developers in all fields to build and execute multi-step workflows.

OpenAI o1-mini

The o1 series excels at accurately generating and debugging complex code. To offer a more efficient solution for developers, we’re also releasing OpenAI o1-mini, a faster, cheaper reasoning model that is particularly effective at coding. As a smaller model, o1-mini is 80% cheaper than o1-preview, making it a powerful, cost-effective model for

Bình luận
Danh sách
Episode 2826 - September 18 - Tiếng Anh - Trung tâm dữ liệu - Vina Technology at AI time
Episode 2826 - September 18 - Tiếng Anh - Trung tâm dữ liệu - Vina Technology at AI time
18 Thg09, 24 • 12ph
Episode 2825 - September 18 - Trung tâm dữ liệu – Vina Technology at AI time
Episode 2825 - September 18 - Trung tâm dữ liệu – Vina Technology at AI time
18 Thg09, 24 • 11ph
Episode 2824 - September 18 - Tiếng Anh - Cuộc sống kỹ thuật số của chúng ta – Ngày 17 tháng 9, 2024 - Vina Technology at AI time
Episode 2824 - September 18 - Tiếng Anh - Cuộc sống kỹ thuật số của chúng ta – Ngày 17 tháng 9, 2024 - Vina Technology at AI time
18 Thg09, 24 • 10ph
Episode 2823 - September 18 - Cuộc sống kỹ thuật số của chúng ta – Ngày 17 tháng 9, 2024 - Vina Technology at AI time
Episode 2823 - September 18 - Cuộc sống kỹ thuật số của chúng ta – Ngày 17 tháng 9, 2024 - Vina Technology at AI time
18 Thg09, 24 • 12ph
Episode 2822 - September 18 - Tin kinh doanh - Ngày 17 tháng 9, 2024 – Ngày 17 tháng 9, 2024 - Vina Technology at AI time
Episode 2822 - September 18 - Tin kinh doanh - Ngày 17 tháng 9, 2024 – Ngày 17 tháng 9, 2024 - Vina Technology at AI time
18 Thg09, 24 • 12ph
Episode 2821 - September 18 - Phần 3 của 6 - Xử lý Ngôn ngữ Tự nhiên - Vina Technology at AI time
Episode 2821 - September 18 - Phần 3 của 6 - Xử lý Ngôn ngữ Tự nhiên - Vina Technology at AI time
18 Thg09, 24 • 10ph
Episode 2820 - September 18 - Tiếng Anh - AI có thể làm cho cuộc sống tốt hơn - Vina Technology at AI time
Episode 2820 - September 18 - Tiếng Anh - AI có thể làm cho cuộc sống tốt hơn - Vina Technology at AI time
18 Thg09, 24 • 12ph
Episode 2819 - September 18 - AI có thể làm cho cuộc sống tốt hơn - Vina Technology at AI time
Episode 2819 - September 18 - AI có thể làm cho cuộc sống tốt hơn - Vina Technology at AI time
18 Thg09, 24 • 11ph
Episode 2818 - September 17 - Tiếng Anh - Phần 2 của 2 - Larry Page - Vina Technology at AI time
Episode 2818 - September 17 - Tiếng Anh - Phần 2 của 2 - Larry Page - Vina Technology at AI time
17 Thg09, 24 • 10ph
Episode 2817 - September 17 - Phần 2 của 2 - Larry Page - Vina Technology at AI time
Episode 2817 - September 17 - Phần 2 của 2 - Larry Page - Vina Technology at AI time
17 Thg09, 24 • 12ph
Episode 2816 - September 17 - Tiếng Anh - Phần 1 của 2 - Larry Page - Vina Technology at AI time
Episode 2816 - September 17 - Tiếng Anh - Phần 1 của 2 - Larry Page - Vina Technology at AI time
17 Thg09, 24 • 10ph
Episode 2815 - September 17 - Phần 1 của 2 - Larry Page - Vina Technology at AI time - Larry Page
Episode 2815 - September 17 - Phần 1 của 2 - Larry Page - Vina Technology at AI time - Larry Page
17 Thg09, 24 • 11ph
Episode 2814 - September 17 - Tiếng Anh - Trường hợp ở New York - Vina Technology at AI time
Episode 2814 - September 17 - Tiếng Anh - Trường hợp ở New York - Vina Technology at AI time
17 Thg09, 24 • 10ph
Episode 2813 - September 17 - Trường hợp ở New York - Vina Technology at AI time
Episode 2813 - September 17 - Trường hợp ở New York - Vina Technology at AI time
17 Thg09, 24 • 11ph
Episode 2812 - September 17 - Tiếng Anh - Tin Công nghệ Thông tin – Ngày 16 tháng 9, 2024 - Vina Technology at AI time
Episode 2812 - September 17 - Tiếng Anh - Tin Công nghệ Thông tin – Ngày 16 tháng 9, 2024 - Vina Technology at AI time
17 Thg09, 24 • 12ph
Nâng cấp tài khoản
Quý khách vui lòng nâng cấp tài khoản để nghe podcast này