Blog posts

2025

How Confident Are You, ChatGPT?

8 minute read

Published:

TL;DR: The key to OpenAI’s IMO gold likely wasn’t a universal verifier. Calibration and uncertainty estimation are central, and will shape future products.