The Ultra-Scale Playbook
The ultimate guide to training LLM on large GPU Clusters
In-depth technical articles and research pieces published by Hugging Face
The ultimate guide to training LLM on large GPU Clusters
The secrets to building world-class LLMs
Explore LLM benchmark scores over time
A new open-source dataset for training VLMs
Explore the complex relationships between 400+ machine learning models
Explore and download the FineWeb webβscale text dataset
Explore the Eiffel Tower Llama experiment with open-source models
Learn about ML and Transformers through nanochat
Explore LLM science benchmark scores
Who needs 1T parameters? Olympiad proofs with a 4B model
TRL distillation for 100B+ teachers, 40x faster
Explore PR categories with an interactive particle visualization
Building and scaling RL environments for LLM training
Explore an autonomous AI workflow for physics research
Explore synthetic data experiments on an interactive bookshelf
Interact with a live reactionβdiffusion visual simulation
Download research PDF (Pro access required)