Toggle navigation
JUPYTER
FAQ
View on GitHub
Execute on Binder
main
branch
agent-traces
agent-traces-v2
async-grpo
bump_lighteval_math_verify
code-grpo-configs
code-grpo-exps
data-agent
e2b-expts
ed-dist-grpo
faster-grpo-trainer
fix-rep-test
fix-wandb-log
generate-checkpoint
gpro-script-improvements
grpo-limo
grpo-math-exps
grpo-numina
gui-training
main
puzzles-updates
qwen-coder
qwen-coder-sft-configs
r1-distill-grpo-configs
r1-zero
save-agent-traces-26-jun
sft-all-the-things
sft-experiment-elie
smollm-grpo-configs
test-format
update-grpo-params
tag
open-r1
Name
huggingface's repositories
.github
assets
logs
recipes
scripts
slurm
src
tests
.gitignore
LICENSE
Makefile
README.md
setup.cfg
setup.py