view article Article SSE Retrieval MRL v2: Regularization of Representation Space and Performance Improvement via Hyperparameter Optimization RikkaBotan • 9 days ago • 2
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 18 days ago • 333
view article Article Magpie Speech — Applying an LLM Data Synthesis Method to an LLM-Based TTS Model to Synthesize a Speech Dataset Aratako • Aug 14, 2025 • 12
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published Jun 2, 2025 • 160
Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization Paper • 2602.22675 • Published Feb 26 • 23
view article Article Mixture of Experts (MoEs) in Transformers +5 ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap • Feb 26 • 160
view article Article Making LLMs Smaller Without Breaking Them: A GLU-Aware Pruning Approach oopere • Nov 24, 2024 • 20
SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper • 2602.02361 • Published Feb 2 • 60
Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow Paper • 2601.14243 • Published Jan 20 • 23
Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision Paper • 2601.19798 • Published Jan 27 • 44
view article Article Introducing Waypoint-1: Real-time interactive video diffusion from Overworld +3 lapp0, LouisCastricato, ScottieFox, shahbuland, xAesthetics • Jan 20 • 43
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models Paper • 2601.15165 • Published Jan 21 • 74
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published Jan 11 • 214
EnvScaler Collection The official datasets and models of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis" • 10 items • Updated Mar 7 • 3
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8, 2025 • 290