GPT-4 can ace the bar, but it only has a decent chance of passing the CFA exams. Here's a list of difficult exams the ChatGPT and GPT-4 have passed.

Lakshmi Varanasi

November 5, 2023 at 5:47 p.m.·10 min read

OpenAI's buzzy chatbot, ChatGPT, has already passed medical, law, and business school exams.
And its newest model, GPT-4 can ace the bar and has a reasonable chance passing the CFA exam.
Insider rounded up a list of the assignments, quizzes, and tests both models have passed.

Since OpenAI launched ChatGPT last November, people have been putting the chatbot to the test literally by using it to write exams and generate essays. While the bot has performed reasonably well at the high school level, and even the graduate level on occasion, it certainly makes its share of mistakes, too.

But then, in March, OpenAI released GPT-4, its most advanced model to date. The deep learning model can comprehend and discuss pictures and generate eight times the text of its predecessor, ChatGPT, making it a significantly sharper exam-taker.

If you're wondering exactly how smart these generative AI tools are, check out some of the difficult exams they've attempted, aced, and failed.

GPT-4 has a shot at passing the CFA exam — but ChatGPT? Not a chance.

Young student girl preparing for college test, exam, writing notes. — A Gen Z TikToker who made 6 figures from teaching people how to write essays was found to have plagiarized one of her own essaysfizkes/Getty Images

GPT-4 has a "decent chance" of passing the CFA level I and level II exams with appropriate prompting, while ChatGPT would not pass under all settings that were tested in a study from a team of researchers from Queens University, Virginia Tech, and J.P. Morgan's AI research division. The model struggled more with level II than level I, the researchers said, noting that there's "no consensus" on which level is more difficult for exam takers.

GPT-4 performed better than ChatGPT in almost every topic, the researchers found.

The series of three exams it takes to obtains your CFA is notoriously difficult for humans, too. Pass rates for Level I, II, and III fell between 37% to 47% in August 2023, according to the CFA Institute.

GPT-4 scored in the 90th percentile of the bar exam with a score of 298 out of 400.

While GPT-3.5, which powers the free version of ChatGPT, only scored in the 10th percentile of the bar exam, according to OpenAI.

The threshold for passing the bar varies from state to state. In New York though, exam takers need a score of 266, around the 50th percentile, to pass, according to The New York State Board of Law Examiners.

GPT-4 aced the SAT Reading & Writing section with a score of 710 out of 800, which puts it in the 93rd percentile of test-takers.

5e6fc018235c180e877a2a04 - Students taking an exam — Reuters

Meanwhile, GPT-3.5, scored in the 87th percentile with a score of 670 out of 800, according to OpenAI.

For the math section, GPT-4 earned a 700 out of 800, ranking among the 89th percentile of test-takers, according to OpenAI. While GPT-3.5 scored in the 70th percentile, OpenAI noted.

In total, GPT-4 scored 1410 out of 1600 points. The average score on the SAT in 2021 was 1060, according to a report from the College Board.

GPT-4's scores on the Graduate Record Examinations, or GRE, varied widely according to the sections.

Hand completing a multiple choice exam. The answer form was created by me and is not copyrighted. — Pencil held over a multiple choice exambluestocking / Getty Images

While it scored in the 99th percentile on the verbal section of the exam and in the 80th percentile of the quantitative section of the exam, GPT-4 only scored in the 54th percentile of the writing test, according to OpenAI.

GPT-3.5 also scored in the 54th percentile of the writing test, and earned marks within the 25th percentile and 63rd percentiles for the quantitative and verbal sections respectively, according to OpenAI.

GPT-4 scored in the 99th to 100th percentile on the 2020 USA Biology Olympiad Semifinal Exam, according to OpenAI.

The USA Biology Olympiad is a prestigious national science competition that regularly draws some of the brightest biology students in the country The first round features a 50-minute open online exam that draws thousands of students across the country, according to USABO's site.

The second round — the Semifinal Exam — is a 120-minute exam with three parts featuring multiple choice, true/false, and short answer questions, USABO notes on its site. Students with the top 20 scores on the Semifinal Exam will advance to the National Finals, according to USABO.

GPT-4 has passed a host of Advanced Placement examinations, exams for college-level courses taken by high school students that are administered by the College Board.

Female teacher is marking exam papers in classroom — Leren Lu / Getty Images

Scores range from 1 to 5, with scores of 3 and above generally considered passing grades, according to the College Board.

GPT-4 received a 5 on AP Art History, AP Biology, AP Environmental Science, AP Macroeconomics, AP Microeconomics, AP Psychology, AP Statistics, AP US Government and AP US History, according to OpenAI.

On AP Physics 2, AP Calculus BC, AP Chemistry, and AP World History, GPT-4 received a 4, OpenAI said.

GPT-4 still struggles with high school math exams.

The AMC 10 and 12 are 25-question, 75-minute exams administered to high school students that cover mathematical topics including algebra, geometry, trigonometry, according to the Mathematical Association of America's site.

In the fall of 2022, the average score out of 150 total points on the AMC 10 was 58.33 and 59.9 on the AMC 12, according to the MAA's site. GPT-4 scored a 30 and 60, respectively, putting it between the 6th to 12th percentile of the AMC 10 and the 45th to 66th percentile of the AMC 12, according to OpenAI.

While it's notoriously difficult to earn your credentials as a wine steward, GPT-4 does pass examinations to become a sommelier.

sommelier pouring wine botttle — Shutterstock.com

GPT-4 has passed the Introductory Sommelier, Certified Sommelier, and Advanced Sommelier exams at respective rates of 92%, 86%, and 77%, according to OpenAI.

GPT-3.5 came in at 80%, 58%, and 46% for those same exams, OpenAI said.

ChatGPT fares reasonably well on some sections of a Wharton MBA exam but struggles with others.

The Wharton School.David Tran Photo/Shutterstock

Wharton professor Christian Terwiesch recently tested the technology with questions from his final exam in operations management— which was once a required class for all MBA students — and published his findings.

Terwiesch concluded that the bot did an "amazing job" answering basic operations questions based on case studies, which are focused examinations of a person, group, or company, and a common way business schools teach students.

In other instances though, ChatGPT made simple mistakes in calculations that Terwiesch thought only required 6th-grade-level math. Terwiesch also noted that the bot had issues with more complex questions that required an understanding of how multiple inputs and outputs worked together.

Ultimately, Terwiesch said the bot would receive an B or B- on the exam.

ChatGPT passed all three parts of the United States medical licensing examination within a comfortable range.

Doctor uses computer an smartphone simultaneously.Getty Images

Researchers put ChatGPT through the United States Medical Licensing Exam — a three part exam that aspiring doctors take between medical school and residency — and reported their findings in a paper published in December 2022.

The paper's abstract noted that ChatGPT "performed at or near the passing threshold for all three exams without any specialized training or reinforcement. Additionally, ChatGPT demonstrated a high level of concordance and insight in its explanations."

Ultimately, the results show that large language models — which ChatGPT has been trained on— may have "the potential" to assist with medical education and even clinical decision making, the abstract noted.

The research is still under peer review, Insider noted based on a report from Axios.

While ChatGPT has generated convincing essays on occasion, it's also raised eyebrows for spewing out well-written misinformation.

It didn't take long after ChatGPT was released for students to start using it for essays and educators to start worrying about plagiarism.

In December, Bloomberg podcaster Matthew S. Schwartz tweeted that the "take home essay is dead." He noted that he had fed a law school essay prompt into ChatGPT and it had "responded *instantly* with a solid response."

—〽️atthew S. Schwartz (@SchwartzReports) December 7, 2022

In another instance, a philosophy professor at Furman University caught a student turning in an AI-generated essay upon noticing it had "well-written misinformation," Insider reported.

"Word by word it was a well-written essay," the professor told Insider. As he took a more careful look however, he noticed that the student made a claim about the philosopher David Hume that "made no sense" and was "just flatly wrong" Insider reported.

In an interview in January, Sam Altman— CEO of OpenAI which makes ChatGPT — said that while the company will devise ways to help schools detect plagiarism, he can't guarantee full detection.

ChatGPT aced a college-level microbiology quiz.

Science journalist and executive editor of Big Think, Alex Berezow, tested ChatGPT with a 10-question microbiology quiz that he devised.

Berezow, who also holds a Ph.D in microbiology, noted that the questions would be appropriate for a final exam for college level students. ChatGPT "blew it away," Berezow wrote.

In one example, Berezow asked:

"An emergency room patient presents with a terrible headache and stiff neck. The doctor orders a spinal tap to collect cerebrospinal fluid. A Gram stain of the CSF reveals the presence of Gram negative diplococci. What is the diagnosis?"

To which ChatGPT correctly responded:

Based on the information you provided, the Gram stain of the cerebrospinal fluid (CSF) shows the presence of Gram-negative diplococci, which are bacteria that are typically oval-shaped and occur in pairs. This finding is consistent with the diagnosis of meningitis.

In another instance he asked:

"In five words or less, what is the point of Koch's postulates?"

To which ChatGPT said:

Establish causality between microbe and disease.

Taking out the word "and" Berezow said ChatGPT "Nailed it."

ChatGPT barely passed Law School Exams, earning something close to a C+.

Law professor — Jacobs Stock Photography Ltd/ Getty Images

ChatGPT recently passed exams in four law school courses at the University of Minnesota, based on a recently published paper written by four law school professors at the school.

In total, the bot answered over 95 multiple choice questions and 12 essay questions that were blindly graded by the professors. Ultimately, the professors gave ChatGPT a "low but passing grade in all four courses" approximately equivalent to a C+.

Still the authors pointed out several implications for what this might mean for lawyers and law education. In one section they wrote:

"Although ChatGPT would have been a mediocre law student, its performance was sufficient to successfully earn a JD degree from a highly selective law school, assuming its work remained constant throughout law school (and ignoring other graduation requirements that involve different skills). In an era where remote exam administration has become the norm, this could hypothetically result in a struggling law student using ChatGPT to earn a JD that does not reflect her abilities or readiness to practice law."

But the bot did pass a Stanford Medical School clinical reasoning final.

ChatGPT passed a Stanford Medical School final in clinical reasoning. According to a YouTube video uploaded by Eric Strong — a clinical associate professor at Stanford — ChatGPT passed a clinical reasoning exam with an overall score of 72%.

In the video, Strong described clinical reasoning in five parts. It includes analyzing a patient's symptoms and physical findings, hypothesizing possible diagnoses, selecting appropriate tests, interpreting test results, and recommending treatment options.

He said, "it's a complex, multi-faceted science of its own, one that is very patient-focused, and something that everything every practicing doctor does on a routine basis."

Strong noted in the video that the clinical reasoning exam is normally given to first-year medical students who need a score of 70% to pass.

Read the original article on Business Insider

BANG Showbiz
Megan Thee Stallion being sued for ‘forcing cameraman watch her having lesbian sex!’
In a suit being brought by her ex-cameraman, Megan Thee Stallion is being sued for allegedly creating a hostile work environment and forcing her former videographer to watch her having lesbian sex.
a day ago
Yahoo Canada Style
Sophie Grégoire Trudeau is leaning into the unknown for her next chapter: 'I'm OK with the uncertainty'
No question was off limits in Yahoo Canada's candid and emotional conversation with the "Closer Together" author.
2 days ago
People
“Call Her Daddy'”s Alex Cooper Models Her Wedding Night Lingerie in Instagram Reveal: See the Racy Look
Cooper wore a sexy lacy bodysuit from SKIMS' Wedding Shop collection after marrying Matt Kaplan in Mexico
8 hours ago
Sacramento Bee
Ex-teacher had sex with her student on his 8th-grade graduation, California prosecutors say
The former Butte County teacher pleaded no contest Monday to the charges.
16 hours ago
HuffPost
How Toxic Is Trump? Republican Group's Hidden Camera Reveals Uncomfortable Truth.
The former president's behavior just doesn't fly out in the real world.
2 days ago
People
Ex-Aide Says Melania Trump Will Be Watching 'Every Ounce' of Hush Money Trial — and Looking for 1 Thing
Stephanie Grisham, who served as chief of staff and press secretary to Melania, offered a window into her former boss's thinking as Donald's alleged affairs take center stage in the Manhattan trial
2 days ago
ABC News
'So appalled': What witnesses told special counsel about Trump's handling of classified info while still president
In the summer of 2019, only hours after an Iranian rocket accidentally exploded at one of Iran's own launch sites, senior U.S. officials met with then-president Donald Trump and shared a sharply detailed, highly classified image of the blast's catastrophic aftermath. Worried that the image becoming public could hurt national security efforts, intelligence officials urged Trump to hold off until more knowledgeable experts were able to weigh in, the sources said.
11 hours ago
The Daily Beast
Trump Picks Another Fight With Judge as He Waits for Contempt Ruling
PoolDonald Trump just can’t help himself.Moments after a contentious hearing about whether Trump should be held in contempt for violating his narrowly worded gag order, the former president took to his favorite social media platform to trash the judge who holds his fate. “HIGHLY CONFLICTED, TO PUT IT MILDLY, JUDGE JUAN MERCHAN, HAS TAKEN AWAY MY CONSTITUTIONAL RIGHT TO FREE SPEECH. EVERYBODY IS ALLOWED TO TALK AND LIE ABOUT ME, BUT I AM NOT ALLOWED TO DEFEND MYSELF,” Trump wrote on Truth Social
2 days ago
INSIDER
Where Reena Virk's killers are now almost 30 years on from her murder that inspired 'Under the Bridge'
Hulu's "Under the Bridge" tells the story of the 1997 murder of Canadian teen Reena Virk. Here's where her killers, Warren Glowatski and Kelly Ellard, are now.
21 hours ago
HuffPost
Donald Trump Will Hate What Mitt Romney Just Said About The Hush Money Trial
"So far as I know, you don't pay someone $130,000 not to have sex with you," the Utah senator remarked about the ex-president's payments to Stormy Daniels.
a day ago
INSIDER
I tried Gordon Ramsay's favorite 10-minute pasta and now I know why he makes it every week
Gordon Ramsay swears by this easy 10-minute pasta dish, which he said has become a "regular midweek family meal" in his house.
9 hours ago
People
Kourtney Kardashian's Sexy Bikini Photo from Her 45th Birthday Leaves Husband Travis Barker Melting
Kardashian enjoyed a vacation in paradise with her husband and four kids in honor of "45 trips around the sun"
2 days ago
People
'My Drink Tasted Funny': Pregnant Woman's Last Words Revealed After Alleged Fatal Poisoning by Boyfriend
Jade Benning died on her 25th birthday on March 6 after she was rushed to the hospital the week before
2 days ago
People
Sydney Sweeney Poses Upside Down During Vacation: 'Hanging in Hawaii'
The 'Anyone But You' actress has been sharing her daily dose of travel content
a day ago
HuffPost UK
This Is Why Coachella Has Been Hit With A Hefty Fine Due To Lana Del Rey's Set
Organisers must fork over tens of thousands of dollars following Lana's performance on Friday night.
22 hours ago
The Daily Beast
How Putin’s Whirlwind Bromance Could End in a Kremlin Tragedy
Sputnik/Alexei Nikolsky/Kremlin via ReutersThe Kremlin is reportedly scrambling to find a successor to Ramzan Kadyrov following reports that the Chechen leader has been diagnosed with necrotizing pancreatitis, a terminal illness, according to Russian media reports.Kadyrov, also known as “Putin's attack dog” or “Putin’s soldier” for his loyalty to Russian President Vladimir Putin, has visited Moscow Central Clinical Hospital regularly through the years to undergo procedures. He was allegedly diag
a day ago
HuffPost
'I Shouldn't Have Said That': Joe Biden Mocks 1 Of Trump's Most Cherished Traits
The president took aim at one of his predecessor's personal trademarks -- and the audience loved it.
3 hours ago
Hello!
Prince Louis' birthday photos all have this in common – did you notice?
Prince William and Princess Kate's son, Louis, celebrated his sixth birthday with a new photo - and Kate Middleton has a habit of including the same detail in the annual portrait
22 hours ago
People
Florida Man Runs Over 11-Foot Alligator with Truck to Save Neighbor from Attack
"We pulled over and I got out of the car and saw that an alligator had him by the leg," Walter Rudder recalled to a local news outlet about the scary incident
a day ago
People
Kim Kardashian Reveals the Viral SKIMS Nipple Bra Was Modeled After Her Own Breasts
The bra was first released in October 2023
a day ago

Christie Brinkley uses this anti-aging eye treatment 'morning and night' — and it's on sale

GPT-4 can ace the bar, but it only has a decent chance of passing the CFA exams. Here's a list of difficult exams the ChatGPT and GPT-4 have passed.

Latest Stories

Megan Thee Stallion being sued for ‘forcing cameraman watch her having lesbian sex!’

Sophie Grégoire Trudeau is leaning into the unknown for her next chapter: 'I'm OK with the uncertainty'

“Call Her Daddy'”s Alex Cooper Models Her Wedding Night Lingerie in Instagram Reveal: See the Racy Look

Ex-teacher had sex with her student on his 8th-grade graduation, California prosecutors say

How Toxic Is Trump? Republican Group's Hidden Camera Reveals Uncomfortable Truth.

Ex-Aide Says Melania Trump Will Be Watching 'Every Ounce' of Hush Money Trial — and Looking for 1 Thing

'So appalled': What witnesses told special counsel about Trump's handling of classified info while still president

Trump Picks Another Fight With Judge as He Waits for Contempt Ruling

Where Reena Virk's killers are now almost 30 years on from her murder that inspired 'Under the Bridge'

Donald Trump Will Hate What Mitt Romney Just Said About The Hush Money Trial

I tried Gordon Ramsay's favorite 10-minute pasta and now I know why he makes it every week

Kourtney Kardashian's Sexy Bikini Photo from Her 45th Birthday Leaves Husband Travis Barker Melting

'My Drink Tasted Funny': Pregnant Woman's Last Words Revealed After Alleged Fatal Poisoning by Boyfriend

Sydney Sweeney Poses Upside Down During Vacation: 'Hanging in Hawaii'

This Is Why Coachella Has Been Hit With A Hefty Fine Due To Lana Del Rey's Set

How Putin’s Whirlwind Bromance Could End in a Kremlin Tragedy

'I Shouldn't Have Said That': Joe Biden Mocks 1 Of Trump's Most Cherished Traits

Prince Louis' birthday photos all have this in common – did you notice?

Florida Man Runs Over 11-Foot Alligator with Truck to Save Neighbor from Attack

Kim Kardashian Reveals the Viral SKIMS Nipple Bra Was Modeled After Her Own Breasts