The corpus as a map.

Domains are lines, environments are stations. A rollout is a path through the map.

One process, capability to graded environment.

02 / Method
01
Perceive
Map a capability and its failure modes until the reward is well defined.
Capability
02
Represent
Formalize it into a task distribution with a verifiable rubric.
Rubric
03
Build
Stand up environments that separate cleanly from evaluation.
No contamination
04
Scale
Mass-produce variants across the distribution.
Distribution
05
Choose
Target the next capability, guided by where models fail.
Pass rate
06
Grade
Score every rollout against ground truth, step by step.
Dense reward