KW's picture

KW

kevineen

·

AI & ML interests

None yet

Recent Activity

liked a model about 16 hours ago

nvidia/Nemotron-Labs-Diffusion-14B

liked a model 2 days ago

NemoStation/Marlin-2B

upvoted a collection 3 days ago

Ettin Rerankers

View all activity

Organizations

upvoted a collection 3 days ago

Ettin Rerankers

8 items • Updated 3 days ago • 7

upvoted an article 3 days ago

Article

SSE Retrieval MRL v2: Regularization of Representation Space and Performance Improvement via Hyperparameter Optimization

RikkaBotan

•

9 days ago

• 2

upvoted a paper 14 days ago

RLDX-1 Technical Report

Paper • 2605.03269 • Published 17 days ago • 122

upvoted a paper 17 days ago

MolmoAct2: Action Reasoning Models for Real-world Deployment

Paper • 2605.02881 • Published 18 days ago • 333

upvoted an article 23 days ago

Article

Magpie Speech — Applying an LLM Data Synthesis Method to an LLM-Based TTS Model to Synthesize a Speech Dataset

Aratako

•

Aug 14, 2025

• 12

upvoted a paper about 1 month ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 160

upvoted a paper about 2 months ago

daVinci-LLM:Towards the Science of Pretraining

Paper • 2603.27164 • Published Mar 28 • 32

upvoted a paper 3 months ago

Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization

Paper • 2602.22675 • Published Feb 26 • 23

upvoted 2 articles 3 months ago

Article

Mixture of Experts (MoEs) in Transformers

+5

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 160

Article

Making LLMs Smaller Without Breaking Them: A GLU-Aware Pruning Approach

oopere

•

Nov 24, 2024

• 20

upvoted a collection 3 months ago

GPT-OSS-Swallow-v0.1

4 items • Updated Feb 20 • 13

upvoted 3 papers 4 months ago

SWE-Universe: Scale Real-World Verifiable Environments to Millions

Paper • 2602.02361 • Published Feb 2 • 60

Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow

Paper • 2601.14243 • Published Jan 20 • 23

Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision

Paper • 2601.19798 • Published Jan 27 • 44

upvoted an article 4 months ago

Article

Introducing Waypoint-1: Real-time interactive video diffusion from Overworld

+3

lapp0, LouisCastricato, ScottieFox, shahbuland, xAesthetics

•

Jan 20

• 43

upvoted a paper 4 months ago

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Paper • 2601.15165 • Published Jan 21 • 74

upvoted a collection 4 months ago

TranslateGemma

3 items • Updated Mar 12 • 239

upvoted a paper 4 months ago

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Paper • 2601.06943 • Published Jan 11 • 214

upvoted a collection 4 months ago

EnvScaler

The official datasets and models of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis" • 10 items • Updated Mar 7 • 3

upvoted a paper 4 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 290