When Models Manipulate Manifolds: The Geometry of a Counting Task Paper • 2601.04480 • Published Jan 8 • 4
view article Article Open Responses: What you need to know +2 evalstate, burtenshaw, merve, pcuenq • Jan 15 • 111
view article Article Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models nvidia • Jan 6 • 28
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez • Sep 11, 2025 • 188
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 itazap, ariG23498, ArthurZ, sergiopaniego, merve, pcuenq • Dec 18, 2025 • 124
view article Article Introducing swift-huggingface: The Complete Swift Client for Hugging Face mattt • Dec 5, 2025 • 43
view article Article Building Deep Research: How we Achieved State of the Art Tavily • Nov 24, 2025 • 36
view article Article We Got Claude to Fine-Tune an Open Source LLM burtenshaw, evalstate • Dec 4, 2025 • 626
CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning Paper • 2511.18659 • Published Nov 24, 2025 • 25
view article Article Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms mattt • Nov 20, 2025 • 42