Research
Environments are the new data.
Models in 2026 don’t get better from more static prompts. They get better from running through realistic environments and being judged on the whole workflow. Our research artifacts ship benchmarks, environments, and the runtime that runs both.


