GPT-4 Technical Report

OpenAI

open-ai(2023)

引用 0|浏览9310
摘要
We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformerbased model pre-trained to predict the next token in a document. The post-training alignment process results in improved performance on measures of factuality and adherence to desired behavior. A core component of this project was developing infrastructure and optimization methods that behave predictably across a wide range of scales. This allowed us to accurately predict some aspects of GPT-4’s performance based on models trained with no more than 1/1,000th the compute of GPT-4
更多
查看译文
PDF
PPT

代码

数据

原文链接
引用

0
您的评分 :

暂无评分

标签
评论
avatar
作者解读

点赞

0%
0/20人

想看人数超过20人时,我们会邀请作者来解读:

  • 解决的问题
  • 实验设计的思路
  • 重要创新
  • 后续可能的深入研究
数据免责声明
页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果,我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问,可以通过电子邮件方式联系我们:report@aminer.cn