view article Article Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds nvidia • Mar 11 • 6
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper • 2602.10604 • Published Feb 11 • 196
UltraData Collection Ultra Scale, Ultra Quality, Ultra Coverage • 10 items • Updated 24 days ago • 81
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning Paper • 2601.05593 • Published Jan 9 • 86
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix codelion • Nov 3, 2025 • 65
view article Article We Got Claude to Fine-Tune an Open Source LLM burtenshaw, evalstate • Dec 4, 2025 • 624
Running on CPU Upgrade Featured 3.16k The Smol Training Playbook 📚 3.16k The secrets to building world-class LLMs
MobileLLM-R1 Collection MobileLLM-R1, a series of sub-billion parameter reasoning models • 10 items • Updated Nov 21, 2025 • 30