- Configure simple and modular evals and metrics for agent runs
- Run inference on and finetune Open models via Synth endpoints to iterate on hypotheses
- And more
- Evals Demo: compare models and export datasets
- Rejection Training: generate → filter → train → run