Skip to main content

Synth AI home page

GitHub
Get Started
Get Started

Prompt Optimization

MIPRO
GEPA

Reinforcement Fine-Tuning

GSPO

Supervised Fine-Tuning

SFT

Judges

Judges

Misc

Polyglot Task App: Banking77

Reinforcement Fine-Tuning

GSPO Cookbook

End-to-end examples of Group Sequence Policy Optimization

End-to-end examples of reinforcement fine-tuning using GSPO.

⌘I

Powered by Mintlify