当前位置:首页 >探索 >【】

【】

2026-04-17 19:16:44 [時尚] 来源:有聲有色網

Just when you started coming to terms with ChatGPT's eerie capabilities, OpenAI dropped a new version of its AI language model.

OpenAI says GPT-4 is much more advanced than GPT-3, which powers ChatGPT. And to prove it, they made GPT-4 sit down for a bunch of exams. OpenAI tested GPT-4 with a variety of standardized tests from high school to graduate to professional level and spanning across mathematics, science, coding, history, literature, and even the one you take to become a sommelier. The exams were comprised of multiple choice and free-response question and GPT-4 was scored using the standard methodology for each exam.

SEE ALSO:How to get access to GPT-4 right now

Put your pencil down, GPT-4, it's time to see check your scores.

What, like law school is hard?

GPT-4 didn't just get into law school, it passed the bar. The AI language model scored in the 88th percentile on the LSATs (Law School Admission Test) and did even better on the Bar (Uniform Bar Exam) by scoring in the 90th percentile. By comparison, GPT-3 was in the bottom 40 percent of the LSATs and 10 percent on the Bar.

Mashable Games

College admissions tests were a piece of cake

GPT-4 took both the math and reading/writing sections of the SATs and all three sections of the GREs which are broken down into quantitative, verbal, and writing skills. It scored in the 80th or 90th percentile of all sections except for the writing section of the GREs... which it kind of bombed in the 54th percentile.

Mashable Light SpeedWant more out-of-this world tech, space and science stories?Sign up for Mashable's weekly Light Speed newsletter.By signing up you agree to our Terms of Use and Privacy Policy.Thanks for signing up!

The quintessential overachiever, GPT-4 also took allthe AP (Advanced Placement) high school exams. It aced most of them, scoring between the 84th and 100th, except for a few outliers.

GPT-4 scored 44th in AP English Language and a measly 22nd in AP English Literature. So all you wordsmiths out there might have some more time before GPT-4 replaces you. GPT-4 didn't do so hot on AP Calculus BC scoring between 43rd and 59th, proving that even for a supercomputer, calculus is not easy. But that still earns GPT-4 a four, so it might still place out of college calculus.

GPT-4 has some coding work to do

GPT-4 still has some work to do with its coding skills, which is curious since one of its marketed uses is for helping developers. Its rating for Codeforces, which hosts competitive programming events, is 392, which puts it way down in the Newbie category of anything below 1199.


Related Stories
  • OpenAI announces GPT-4
  • Getting a ChatGPT at capacity error? Tips on how to get past it
  • Grammarly introduces a ChatGPT-style AI tool for writing and editing
  • OpenAI is making ChatGPT and Whisper available to third-parties
  • How ChatGPT and AI are affecting the literary world

It did pretty well on the easy level of the Leetcode (31 out of 41 problems solved) but struggled when it came to medium or hard level of difficulty (21/80 and 3/45 respectively). As we saw in the developer demo livestream, GPT-4 is fully capable of writing Python, but required some manual tweaking to set the right parameters, which might explain some these test scores. Or maybe it didn't eat breakfast that morning.

Ok, but can GPT-4 become a sommelier?

GPT-4 passed the sommelier exams with flying colors. It placed lowest (77th percentile) in the most advanced sommelier exam. But for a non-human entity that's never tasted wine, we'll let that one slide.

OpenAI has released a full breakdown of how GPT-4 performed. GPT-4 might not write the next great American novel...yet, but GPT-4's future as a mathematically brilliant lawyer and wine connoisseur looks pretty bright.

TopicsArtificial IntelligenceChatGPT

(责任编辑:焦點)

    推荐文章
    • 17 questions you can answer if you're a good communicator

      17 questions you can answer if you're a good communicatorWhether you regularly speak in public and write online, or you mostly express yourself over email, b ...[详细]
    • How to sext

      How to sextSexting, as Bo Burnham taught us, isn't sex — it's the next best thing.I personally wouldn't g ...[详细]
    • 36周早產兒成活率多少

      36周早產兒成活率多少在懷孕晚期孕婦更需要全麵注重身體所出現的各種變化 ,因為在這個時期稍有不慎就有可能導致胎兒出現早產跡象 。但是也有不少懷孕36周的孕婦,由於沒有注重生活中的方方麵麵 ,導致出現了早產問題。36周的早產兒會比 ...[详细]
    • 寶寶摔跤牙齒咬破嘴唇

      寶寶摔跤牙齒咬破嘴唇寶寶是比較愛動的階段 ,寶寶比較淘氣好動,可能會不小心摔跤了,有的寶寶摔跤後牙齒不小心咬破了嘴唇 ,嘴唇會出現流血,破皮等表現,因為是在嘴唇部位  ,傷口也不容易愈合 。很多寶媽遇到這樣的情況措手不及 ,不知道怎 ...[详细]
    • PlayStation Now game streaming is coming to PC

      PlayStation Now game streaming is coming to PCSony's PlayStation Now service is launching for Windows PC, meaning subscribers will soon be able to ...[详细]
    • 早產兒和足月兒的區別

      早產兒和足月兒的區別即使是現代社會的生活水平正在逐漸提高 ,但是仍舊有許多孕婦出現了早產問題 。如果大家仔細觀察的話就會發現,其實身邊有很多孕婦都是早產 。對於早產寶寶來說 ,身體各方麵和足月的小寶寶當然也有很多的不同,家長們隻 ...[详细]
    • 三歲寶寶發燒39度怎麽辦

      三歲寶寶發燒39度怎麽辦感冒發燒對於小孩來說是很常見的 ,家長們可密切觀察發燒的情況 ,如果不高就不必急於去醫院就診 ,但是三歲寶寶發燒39度怎麽辦呢 ?如果寶寶的體溫達到39度應口服降溫藥 ,但也要注意這個發燒是一下子發起來還是慢慢 ...[详细]
    • 男士瘦肚子的健身方法

      男士瘦肚子的健身方法不少男性在人到中年之後,會出現啤酒肚的問題 。這種問題 ,會在很大程度上影響男性的形象 ,使男性看起來不是那麽的好看。正因此 ,不少人會選擇通過健身的方法來達到瘦肚子的目的 。其實,除了健身之外,可以瘦肚子的方 ...[详细]
    • MashReads Podcast: What makes a good summer read?

      MashReads Podcast: What makes a good summer read?Summer is coming to a close and that means one thing -- last-minute vacations!。SEE ALSO:'Ice Cream B ...[详细]
    • 1歲寶寶能吃火龍果麽

      1歲寶寶能吃火龍果麽1歲的寶寶是可以吃火龍果的 ,火龍果是一種含有很多的維生素以及纖維素的水果,是可以促進寶寶的腸胃消化,所以給1歲的寶寶食用火龍果是很好的 ,也不會影響到寶寶的腸胃功能 ,但是要避免喂養量 ,避免出現腹脹的情況 ...[详细]
    热点阅读