From Re-evaluating GPT-4’s bar exam performance (linked in the article):
First, although GPT-4’s UBE score nears the 90th percentile when examining approximate conversions from February administrations of the Illinois Bar Exam, these estimates are heavily skewed towards repeat test-takers who failed the July administration and score significantly lower than the general test-taking population.
Ohhh, that is sneaky!