Updated ChatGPT version performs spectacularly on most exams

Republished from Jpost.com MARCH 18, 2023

GPT 4 is more able to handle nuanced instructions than its predecessor and therefore beat it on every exam.

ChatGPT’s newest update, GPT 4, is advanced enough to pass a bar exam with a score in the top 10% of test takers, compared to the previous version which made it to the bottom 10%, Open AI announced this week.

In comparison to the previous version, GPT 3.5, the conversational abilities of GPT 4 are almost the same with only subtle changes. The main differences between the two are expressed in tasks that come with nuanced instructions.

GPT 4 was proven to be more reliable and able to use its creativity to excel in such tasks better than GPT 3.5.

The new version was given real versions of a variety of tests including a bar exam, the GRE, medical knowledge and high school AP class tests. In most of these exams, GPT 4 placed in the 80th percentile or higher. The exams in which it placed low were the writing section of the GRE,USNCO Local Section Exam, codeforces rating and AP Calculus BC.

ChatGPT can also succeed on benchmarks for machine learning models

GPT 4 was also tested on a number of benchmarks designed for machine learning models and outperformed GPT 3.5 on every one of the tests. As these tests are usually written in English, some of the tests were translated into other languages. GPT 4 did better than 3.5 in 24 of 26 languages.

The emergence of ChatGPT, which can mimic normal human language based on prompts, gave rise to concerns of cheating in the education system. Teachers are worried that students will now use ChatGPT to write their papers for them.

Perhaps the fact that GPT 4 performed poorly on the writing section of the GRE will assure teachers that despite GPT 4’s more advanced abilities, it is still not good enough to fool them.

Another new feature that GPT 4 now has is visual input. This means that photos can now be used as part of the prompt for the AI’s output.

Review of ChatGPT

When the translation functionality of ChatGPT was tested against simple sentences that only contained the 220 Dolch Words (eg. girl, puppy, etc.), the end translations were related, but not the expected results.

 

Avatar photo

By Melissa Newman

Site Administrator, Homeschooling for 15+ years, grades Prek - Grade 8.