📝 Research & Long-Form Blog Posts

sergiopaniego 's Collections

Bringing Autonomous Driving RL to OpenEnv and TRL resources

Amazing design resources

Vision reasoning datasets

GUI Grounding datasets

My vision Spaces

👁 Vision comparison ftw

😎 Awesome vision Spaces

Vision Language Models: 2025 Update

updated 5 days ago

In-depth technical articles and research pieces published by Hugging Face

Upvote

Running

3.87k

The Ultra-Scale Playbook

🌌

3.87k

The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade

Featured

3.2k

The Smol Training Playbook

📚

3.2k

The secrets to building world-class LLMs
Running

325

Evaluation Guidebook

📝

325

Explore LLM benchmark scores over time
Running

225

FineVision: Open Data is All You Need

📝

225

A new open-source dataset for training VLMs
Running

81

Maintain the unmaintainable

📚

81

Explore the complex relationships between 400+ machine learning models
Running

Featured

1.35k

FineWeb: decanting the web for the finest text data at scale

🍷

1.35k

Explore and download the FineWeb web‑scale text dataset
Running

117

The Eiffel Tower Llama

📝

117

Explore the Eiffel Tower Llama experiment with open-source models
Running

Featured

49

Porting nanochat to Transformers: an AI modeling history lesson

📝

49

Learn about ML and Transformers through nanochat
Running

Featured

74

FinePDFs: Liberating 3T of the finest tokens from PDFs

📄

74
Running

28

Can LLMs Play the Game of Science?

📝

28

Explore LLM science benchmark scores
Running

Featured

74

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

74

Who needs 1T parameters? Olympiad proofs with a 4B model
Running

Featured

85

Distilling 100B+ Models 40x Faster with TRL

📝

85

TRL distillation for 100B+ teachers, 40x faster
Running

12

Extracting Signal from the Noise

📝

12

Explore PR categories with an interactive particle visualization
Running

177

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

177

Building and scaling RL environments for LLM training
Running

49

physics-intern: an Autonomous Agent for Physics Research

📝

49

Explore an autonomous AI workflow for physics research
Running on CPU Upgrade

244

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

📝

244

Explore synthetic data experiments on an interactive bookshelf
Running

4

Token-In, Token-Out Done Right

🧩

4

Interact with a live reaction‑diffusion visual simulation
Running

18

Defeating the trainer-generator precision mismatch in TRL

🎯

18

Download research PDF (Pro access required)

Upvote

📝 Research & Long-Form Blog Posts

The Ultra-Scale Playbook

The Smol Training Playbook

Evaluation Guidebook

FineVision: Open Data is All You Need

Maintain the unmaintainable

FineWeb: decanting the web for the finest text data at scale

The Eiffel Tower Llama

Porting nanochat to Transformers: an AI modeling history lesson

FinePDFs: Liberating 3T of the finest tokens from PDFs

Can LLMs Play the Game of Science?

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

Distilling 100B+ Models 40x Faster with TRL

Extracting Signal from the Noise

The ultimate guide to RL environments: building and scaling them in the LLM era

physics-intern: an Autonomous Agent for Physics Research

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

Token-In, Token-Out Done Right

Defeating the trainer-generator precision mismatch in TRL