Posts by Tags

OpenAI

How Confident Are You, ChatGPT?

8 minute read

Published:

TL;DR: The key to OpenAI’s IMO gold likely wasn’t a universal verifier. Calibration and uncertainty estimation are central, and will shape future products.

RL

How Confident Are You, ChatGPT?

8 minute read

Published:

TL;DR: The key to OpenAI’s IMO gold likely wasn’t a universal verifier. Calibration and uncertainty estimation are central, and will shape future products.

Reasoning Models

How Confident Are You, ChatGPT?

8 minute read

Published:

TL;DR: The key to OpenAI’s IMO gold likely wasn’t a universal verifier. Calibration and uncertainty estimation are central, and will shape future products.