Toggle navigation
JUPYTER
FAQ
View on GitHub
Execute on Binder
main
branch
AMD
FIM
FIM-clean
P3
Pythia
StellaAthena-patch-1
StellaAthena-patch-2
StellaAthena-patch-3
T5-patch
adafactor
adapters
alibi_memory_experiment
appseval
autotune
aws
benchmarking
bf16_update
bfloat16
bigbio
bigscience-harness-adapter
bs_scheduling
bug/767_checkpoint_reverse_compatibility
checkpoint__bug
checkpoint-improvement
chess-dt
ckpt_reshape
curriculum_learning
curt/extra-deepspeed-args
curt/prepare-data-errors
data_wader
tag
legacy_gptj_residual.1.0.0
gpt-neox
Name
lmarti's repositories
.github
configs
eval_tasks
megatron
requirements
tests
tools
.clang-format
.dockerignore
.gitignore
.pre-commit-config.yaml
CITATION.cff
CODEOWNERS
Dockerfile
LICENSE
MANIFEST.in
README.md
deepy.py
evaluate.py
generate.py
prepare_data.py
train.py