Brilliant.org, but for LLM evals

Learn LLM evaluation
by actually doing it

Tap-through lessons that teach RAG evals, agent evaluation, and production monitoring the way Brilliant teaches physics — through manipulation, not memorization.

What a lesson feels like

Not a tutorial.
An experience.

  • One idea per screen
  • Zero walls of text
  • Working code you can run today
  • Real prod incidents to diagnose

What’s the faithfulness score?

0%

Faithfulness

Partially Faithful

✓ In the doc
✓ In the doc
✗ Made up

Six tracks · 24 lessons

The complete map of LLM eval

Every lesson teaches architecture, implementation, production monitoring, and how to know when you're ready to ship.

Meet the token characters

Faithful
Retriever
Judge
Hallucinator

Start with one lesson.
7 minutes. No setup.

Did it actually use the document? ↓