Skip to content

Commit f12b820

Browse files
authored
Merge pull request d-run#410 from windsonsea/news
Add gpt-5 news
2 parents 0ae271c + bbf1314 commit f12b820

File tree

11 files changed

+244
-0
lines changed

11 files changed

+244
-0
lines changed

docs/zh/docs/blogs/2025/gpt5.md

Lines changed: 124 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,124 @@
1+
# GPT-5 正式发布:OpenAI 史上最大规模产品升级 四大版本全面解析
2+
3+
2025 年 8 月 7 日,OpenAI 正式发布 GPT-5 系列模型,这是该公司历史上最重要的产品升级。此次发布包含
4+
GPT-5、GPT-5Mini、GPT-5Nano 和 GPT-5Pro 四个版本,每个版本针对不同应用场景进行深度优化,标志着 AI 技术进入全新发展阶段。
5+
6+
## 统一智能系统:技术架构的革命性突破
7+
8+
GPT-5 被 OpenAI 定位为"统一智能系统",成功整合了此前分散在不同模型中的能力:GPT-4o 的多模态处理、
9+
o 系列的深度推理、高级数学计算以及代理任务执行。这一架构创新让用户无需在不同模型间手动切换,
10+
系统通过实时路由器根据任务复杂度自动选择最适合的处理方式。
11+
12+
在核心技术指标上,GPT-5 实现了全面突破:
13+
14+
- 数学推理:在 AIME2025 基准测试中达到 94.6%准确率,无需外部工具
15+
- 代码能力:SWE-bench Verified 测试得分 74.9%,Aider Polyglot 多语言编程测试达到 88%
16+
- 多模态理解:MMMU 基准测试得分 84.2%
17+
- 专业知识:在 GPQA 通用问题回答测试中得分 88.4%
18+
19+
## 四大版本详细解析
20+
21+
### GPT-5(旗舰版):最强推理与多模态能力
22+
23+
![gpt5-default](./images/gpt5-01.png)
24+
25+
作为系列中的旗舰产品,GPT-5 专为复杂任务设计,具备以下核心特性:
26+
27+
- 推理能力突破:内置链式推理(Chain-of-Thought)技术,能够分解复杂问题并逐步解决。在内部测试中,
28+
GPT-5 在 40 多个职业领域的复杂任务上表现优于前代所有模型。
29+
- 全面多模态支持:支持文本、图像、语音和视频处理,继承了 Sora 的视频生成技术。用户可以上传各种格式的内容,
30+
GPT-5 能够生成相应回应或执行复合任务,例如分析医学影像或实时翻译视频内容。
31+
- 代理式任务执行:支持自动浏览网页、生成完整软件应用、管理日程等复杂操作。在发布会演示中,
32+
GPT-5 根据简单描述在数秒内生成了包含闪卡、测验和进度跟踪功能的完整法语学习 Web 应用。
33+
- 大幅降低幻觉率:通过"安全补全"技术,GPT-5 的事实错误率比 GPT-4o 降低约 45%,
34+
在使用推理模式时错误率比 o3 模型降低约 80%。
35+
36+
### GPT-5Mini:高性价比的轻量选择
37+
38+
![gpt5-mini](./images/gpt5-02.png)
39+
40+
GPT-5Mini 针对成本敏感应用进行优化,在保留核心功能的同时显著降低了资源需求:
41+
42+
- 支持中等复杂度的链式推理任务
43+
- 具备文本、图像和语音处理能力,视频处理功能相对受限
44+
- 可在较低算力设备上运行,适合中小企业和个人开发者
45+
- 在核心推理测试中接近 o4-mini 性能水平
46+
47+
主要应用场景包括教育内容生成、客户服务自动化、简单多模态任务处理等。
48+
49+
### GPT-5Nano:超高效边缘计算模型
50+
51+
![gpt5-nano](./images/gpt5-03.png)
52+
53+
GPT-5Nano 专为速度和低资源占用优化,是系列中最轻量的版本:
54+
55+
- 极低延迟响应,专为实时应用设计
56+
- 可在内存仅 16GB 的设备上运行,包括 MacBook 或低端服务器
57+
- 推理能力相对简化,主要用于快速交互和简单任务
58+
- 在通用基准测试中与 o3-mini 性能相当
59+
60+
适用场景包括移动设备应用、嵌入式系统、实时翻译、语音助手等对响应速度要求极高的场景。
61+
62+
### GPT-5Pro:面向专业用户的增强版本
63+
64+
GPT-5Pro 是专为高端用户和企业设计的高性能版本:
65+
66+
- 增强推理模式:支持"GPT-5Thinking"功能,可对复杂问题进行更长时间的深度推理,确保极高准确性。
67+
- 无限制访问:Pro 用户享有无限制的 GPT-5 访问权限,以及 GPT-5Pro 的独家访问权。
68+
- 专业多模态能力:在视频处理、复杂图像分析等任务中表现优异,在 HealthBench Hard 医疗基准测试中得分 46.2%。
69+
- 深度工具整合:无缝集成搜索、Canvas、代码执行等专业工具,提供完整的工作流体验。
70+
71+
## 定价策略:史上最大规模免费开放
72+
73+
OpenAI 采用了前所未有的开放策略,向所有用户群体提供 GPT-5 访问权限:
74+
75+
- 免费用户:可使用 GPT-5 和 GPT-5Mini,有使用限额,超出后自动切换至 Mini 版本
76+
- Plus 用户($20/月):享有更高使用限额,适合个人用户和小型团队
77+
- Pro 用户($200/月):无限制访问 GPT-5 和 GPT-5Pro,并可使用"GPT-5Thinking"模式
78+
- 企业与教育用户:发布后一周内获得访问权限,并可使用 GPT-5Pro 版本
79+
- API 定价:输入$1.25/百万 token,输出$10/百万 token,面向专业开发者
80+
81+
## 用户体验的全面升级
82+
83+
GPT-5 系列带来了多项用户体验创新:
84+
85+
- 智能模型选择:系统根据任务复杂度和用户意图自动选择最适合的模型版本,用户无需手动切换
86+
87+
- 个性化交互:提供四种预设人格(Cynic、Robot、Listener、Nerd)和自定义聊天颜色选项
88+
89+
- 增强记忆能力:更大的上下文窗口能够记住更长的对话历史,提供更连贯的交互体验
90+
91+
- 用户友好设计:相比 GPT-4o,新模型减少了过度讨好的表达,使用更少不必要的表情符号,让交互更加自然
92+
93+
## 技术架构创新
94+
95+
GPT-5 系列可能采用了混合专家模型(MoE)架构,通过减少活跃参数数量大幅提升效率。训练数据以英语文本为主,聚焦
96+
STEM、编程和通用知识领域,知识截止时间为 2024 年 6 月。整个训练过程在 NVIDIA H100GPU 上完成,耗费约 210 万 GPU 小时。
97+
98+
## 竞争优势与市场影响
99+
100+
在当前 AI 竞争激烈的环境下,GPT-5 的发布具有重要战略意义。面对 Anthropic Claude3.5Sonnet、
101+
xAI Grok4、Google Gemini2.5Pro 等强劲竞争对手,OpenAI 通过免费开放策略和显著降低幻觉率来巩固市场地位。
102+
103+
据统计,目前已有 500 万付费用户使用 ChatGPT 商业产品,包括 BNY Mellon、加州州立大学、Figma、Intercom、
104+
摩根士丹利等知名机构。GPT-5 的发布预计将进一步加速企业 AI 采用,推动各行业的数字化转型。
105+
106+
## 行业展望与挑战
107+
108+
GPT-5 系列的发布代表了 AI 技术发展的新里程碑,但同时也面临一些挑战:
109+
110+
- 隐私与安全:多模态能力涉及处理医疗影像、个人对话等敏感数据,数据保护成为关键议题
111+
- 技术影响:自动化程度的提升可能对传统工作岗位产生冲击,需要社会层面的适应和调整
112+
- 性能验证:虽然 OpenAI 声称 GPT-5 具备"博士级智能",但其真实推理能力在实际应用中的表现仍需时间检验
113+
114+
## 总结
115+
116+
GPT-5 系列的发布标志着 OpenAI 在 AI 领域的又一次重大突破。通过四个版本的差异化布局,OpenAI
117+
成功覆盖了从个人用户到企业客户的全部需求谱系。这不仅是一次技术升级,更是 AI 产品策略的全面革新。
118+
119+
随着 GPT-5 成为 ChatGPT 的新默认模型,取代此前的 GPT-4o、o3 等版本,用户只需打开 ChatGPT
120+
输入问题,系统将自动处理并在需要时应用推理功能。这种无缝体验的实现,预示着 AI 技术正在从工具向助手、从辅助向协作的方向快速演进。
121+
122+
## 参考
123+
124+
- [OpenAI GPT-5 官方介绍页面](https://openai.com/index/introducing-gpt-5/)
246 KB
Loading
257 KB
Loading
260 KB
Loading

docs/zh/docs/blogs/index.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,11 @@ hide:
77

88
本频道将紧跟技术趋势,收集 AI 行业新闻。
99

10+
* [GPT-5 正式发布:OpenAI 史上最大规模产品升级 四大版本全面解析](./2025/gpt5.md)
11+
12+
2025 年 8 月 7 日,OpenAI 正式发布 GPT-5 系列模型,这是该公司历史上最重要的产品升级。此次发布包含
13+
GPT-5、GPT-5Mini、GPT-5Nano 和 GPT-5Pro 四个版本,每个版本针对不同应用场景进行深度优化,标志着 AI 技术进入全新发展阶段。
14+
1015
* [d.run 上新 DeepSeek-R1-0528,强化 CoT 推理链,代码实力再进化](./2025/0603-deepseek-0528.md)
1116

1217
端午节期间,d.run 大模型服务平台紧跟 DeepSeek 步伐,上线了全新的 **DeepSeek-R1-0528** 模型。

docs/zh/docs/en/blogs/2025/gpt5.md

Lines changed: 109 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,109 @@
1+
# GPT-5 Official Release: The Largest Product Upgrade in OpenAI’s History — Full Analysis of All Four Versions
2+
3+
On August 7, 2025, OpenAI officially released the GPT-5 series models, marking the most significant product upgrade in the company’s history. The release includes GPT-5, GPT-5Mini, GPT-5Nano, and GPT-5Pro, each deeply optimized for different application scenarios. This milestone signifies a new era of AI development.
4+
5+
## Unified Intelligence System: A Revolutionary Technological Breakthrough
6+
7+
OpenAI positions GPT-5 as a "Unified Intelligence System," successfully integrating capabilities that were previously scattered across different models: GPT-4o’s multimodal processing, the o-series’ deep reasoning, advanced mathematical computation, and agent task execution. This architectural innovation eliminates the need for manual switching between models — a real-time router automatically selects the most suitable processing method based on task complexity.
8+
9+
GPT-5 delivers significant breakthroughs in core metrics:
10+
11+
- **Mathematical Reasoning:** Achieves 94.6% accuracy in the AIME2025 benchmark without external tools.
12+
- **Coding Skills:** Scores 74.9% on the SWE-bench Verified test; reaches 88% on the Aider Polyglot multi-language programming benchmark.
13+
- **Multimodal Understanding:** Scores 84.2% on the MMMU benchmark.
14+
- **Professional Knowledge:** Achieves 88.4% on the GPQA general problem answering benchmark.
15+
16+
## Detailed Breakdown of the Four Versions
17+
18+
### GPT-5 (Flagship): The Most Powerful Reasoning and Multimodal Capabilities
19+
20+
![gpt5-default](./images/gpt5-01.png)
21+
22+
As the flagship product of the series, GPT-5 is designed for complex tasks and offers the following key features:
23+
24+
- **Breakthrough Reasoning:** Built-in Chain-of-Thought technology to decompose and solve complex problems step-by-step. In internal testing, GPT-5 outperformed all previous models in complex tasks across 40+ professional fields.
25+
- **Comprehensive Multimodal Support:** Supports text, image, speech, and video processing, inheriting Sora’s video generation technology. Users can upload various formats, and GPT-5 can respond accordingly or execute compound tasks — e.g., analyzing medical images or translating video content in real time.
26+
- **Agent-Like Task Execution:** Can automatically browse the web, generate complete software applications, and manage schedules. In the launch demo, GPT-5 created a full French learning web app with flashcards, quizzes, and progress tracking in seconds based on a simple description.
27+
- **Significantly Lower Hallucination Rate:** With "Safe Completion" technology, GPT-5’s factual error rate is about 45% lower than GPT-4o, and up to 80% lower than the o3 model in reasoning mode.
28+
29+
### GPT-5Mini: High Value, Lightweight Option
30+
31+
![gpt5-mini](./images/gpt5-02.png)
32+
33+
Optimized for cost-sensitive applications, GPT-5Mini retains core capabilities while significantly reducing resource requirements:
34+
35+
- Supports medium-complexity Chain-of-Thought reasoning.
36+
- Handles text, image, and speech processing; limited video capabilities.
37+
- Runs on lower-spec devices — ideal for SMEs and individual developers.
38+
- Core reasoning performance close to o4-mini.
39+
40+
Main use cases include educational content generation, customer service automation, and simple multimodal tasks.
41+
42+
### GPT-5Nano: Ultra-Efficient Edge Computing Model
43+
44+
![gpt5-nano](./images/gpt5-03.png)
45+
46+
GPT-5Nano is optimized for speed and low resource usage, making it the most lightweight version:
47+
48+
- Extremely low latency, designed for real-time applications.
49+
- Operates on devices with as little as 16GB RAM, including MacBooks and low-end servers.
50+
- Simplified reasoning, focused on quick interactions and simple tasks.
51+
- Comparable performance to o3-mini in general benchmarks.
52+
53+
Best suited for mobile apps, embedded systems, real-time translation, and voice assistants where speed is critical.
54+
55+
### GPT-5Pro: Enhanced Version for Professional Users
56+
57+
GPT-5Pro targets high-end users and enterprises:
58+
59+
- **Enhanced Reasoning Mode:** Includes "GPT-5Thinking" for extended deep reasoning on complex problems with extremely high accuracy.
60+
- **Unlimited Access:** Pro users get unrestricted access to GPT-5 and exclusive GPT-5Pro capabilities.
61+
- **Professional-Grade Multimodal:** Excels in video processing and complex image analysis, scoring 46.2% on the HealthBench Hard medical benchmark.
62+
- **Deep Tool Integration:** Seamless access to search, Canvas, code execution, and other professional tools for a complete workflow.
63+
64+
## Pricing Strategy: The Largest Free Access Rollout Ever
65+
66+
OpenAI adopts an unprecedentedly open strategy, granting GPT-5 access to all user groups:
67+
68+
- **Free Users:** Access to GPT-5 and GPT-5Mini with usage limits; excess usage switches to Mini automatically.
69+
- **Plus Users ($20/month):** Higher usage limits, ideal for individuals and small teams.
70+
- **Pro Users ($200/month):** Unlimited access to GPT-5 and GPT-5Pro, plus GPT-5Thinking mode.
71+
- **Enterprise & Education:** Access to GPT-5Pro within a week of release.
72+
- **API Pricing:** $1.25 per million input tokens; $10 per million output tokens for professional developers.
73+
74+
## Comprehensive User Experience Upgrades
75+
76+
The GPT-5 series introduces multiple UX innovations:
77+
78+
- **Smart Model Selection:** Automatically picks the best model version based on task complexity and user intent — no manual switching needed.
79+
- **Personalized Interaction:** Four preset personas (Cynic, Robot, Listener, Nerd) plus custom chat color options.
80+
- **Enhanced Memory:** Larger context window to recall longer conversations for more coherent interactions.
81+
- **User-Friendly Design:** Reduced over-politeness and fewer unnecessary emojis compared to GPT-4o for a more natural feel.
82+
83+
## Architectural Innovations
84+
85+
The GPT-5 series likely adopts a Mixture-of-Experts (MoE) architecture, improving efficiency by reducing the number of active parameters. Training data is primarily English text, focused on STEM, programming, and general knowledge. The knowledge cutoff is June 2024. Training was completed on NVIDIA H100 GPUs, consuming about 2.1 million GPU hours.
86+
87+
## Competitive Edge and Market Impact
88+
89+
In today’s highly competitive AI landscape, GPT-5’s release carries major strategic significance. Against strong rivals like Anthropic Claude 3.5 Sonnet, xAI Grok 4, and Google Gemini 2.5 Pro, OpenAI strengthens its market position with free access and drastically reduced hallucination rates.
90+
91+
Reportedly, 5 million paying users already use ChatGPT’s commercial products, including organizations like BNY Mellon, California State University, Figma, Intercom, and Morgan Stanley. GPT-5’s release is expected to further accelerate enterprise AI adoption and drive digital transformation across industries.
92+
93+
## Industry Outlook and Challenges
94+
95+
The GPT-5 series marks a new milestone in AI development but also faces challenges:
96+
97+
- **Privacy & Security:** Multimodal capabilities involve sensitive data like medical imaging and private conversations, making data protection a priority.
98+
- **Technological Impact:** Increased automation may disrupt traditional jobs, requiring societal adaptation.
99+
- **Performance Verification:** While OpenAI claims GPT-5 has “PhD-level intelligence,” its real-world reasoning performance still needs time to be fully validated.
100+
101+
## Conclusion
102+
103+
The GPT-5 series represents another major breakthrough for OpenAI. With four differentiated versions, it effectively covers the entire spectrum from individual users to enterprise clients. This is not just a technical upgrade, but a complete overhaul of AI product strategy.
104+
105+
With GPT-5 now the default ChatGPT model — replacing GPT-4o, o3, and others — users simply open ChatGPT, input a question, and the system automatically handles it, applying reasoning when needed. This seamless experience signals AI’s rapid evolution from a tool to an assistant, and from assistance to true collaboration.
106+
107+
## References
108+
109+
- [OpenAI GPT-5 Official Page](https://openai.com/index/introducing-gpt-5/)
246 KB
Loading
257 KB
Loading
260 KB
Loading

docs/zh/docs/en/blogs/index.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,10 @@ hide:
77

88
This channel will closely follow technology trends and collect news from the AI industry.
99

10+
- [GPT-5 Official Release: The Largest Product Upgrade in OpenAI’s History — Full Analysis of All Four Versions](./2025/gpt5.md)
11+
12+
On August 7, 2025, OpenAI officially released the GPT-5 series models, marking the most significant product upgrade in the company’s history. The release includes GPT-5, GPT-5Mini, GPT-5Nano, and GPT-5Pro, each deeply optimized for different application scenarios. This milestone signifies a new era of AI development.
13+
1014
- [Announcing the llm-d community!](./2025/llmd.md)
1115

1216
llm-d is a Kubernetes-native high-performance distributed LLM inference framework,

0 commit comments

Comments
 (0)