UncGPT 2026 — NeurIPS Competition
Collection
Public asset family for the UncGPT 2026 NeurIPS Competition proposal. • 9 items • Updated
This repository contains the two step-100 smoke checkpoints from the strict-clean 1,646 conversation comparison run on 2026-05-15.
These checkpoints are early pipeline-verification artifacts, not production models. They were trained from scratch for 100 steps to validate architecture, tokenizer, ordered-skill data loading, checkpointing, and inference bring-up.
v1_original_moe/uncgpt69_step100.ptlfm2small/uncgpt69_step100.ptconfigs/tokenizer/source/SHA256SUMSBoth arms completed 100 steps successfully on the same 1,646-conversation corpus.
| Arm | Step 100 loss | Training throughput |
|---|---|---|
| v1 original MoE | 5.7438 | ~28.6k tok/s |
| LFM2-small | 5.5756 | ~349k tok/s |
The v1 cached decode path currently benchmarks at roughly 13.5 tok/s batch-1 on the A100 cluster for prompt length 128, versus roughly 2.0 tok/s for naive full-prefix decode. This proves the cache path works but does not yet represent the intended optimized inference path.