13 12

ZHANG Yutong

wrodriguez509

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

upvoted a paper 4 days ago

SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution

upvoted a paper 4 days ago

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

View all activity

Organizations

None yet

upvoted a paper 2 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 6 days ago • 199

upvoted 2 papers 4 days ago

SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution

Paper • 2605.18401 • Published 8 days ago • 125

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published 12 days ago • 143

liked a dataset 5 days ago

jat-project/jat-dataset

Viewer • Updated Feb 16, 2024 • 258M • 723k • 52

liked a dataset 7 days ago

radioKale/so100_test_01_20260518_152244

Viewer • Updated 7 days ago • 29.9k • 117 • 1

liked a model 11 days ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 4.45M • • 13.3k

upvoted a paper 14 days ago

RaguTeam at SemEval-2026 Task 8: Meno and Friends in a Judge-Orchestrated LLM Ensemble for Faithful Multi-Turn Response Generation

Paper • 2605.04523 • Published 20 days ago • 46

liked a dataset 19 days ago

haarriia/gt

Updated 11 minutes ago • 8.6k • 1

upvoted a paper 25 days ago

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published 26 days ago • 218

liked a dataset 25 days ago

HennyPr/ps2_hf2

Viewer • Updated Apr 5 • 1 • 797k • 11

liked a model about 1 month ago

stabilityai/sdxl-turbo

Text-to-Image • Updated Jul 10, 2024 • 1.11M • 2.58k

upvoted 3 papers about 1 month ago

NTIRE 2026 Challenge on Video Saliency Prediction: Methods and Results

Paper • 2604.14816 • Published Apr 16 • 3

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

Paper • 2604.13902 • Published Apr 15 • 62

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 263

liked a model about 1 month ago

michelinolinolino/gemma4-4b-sci

Text Generation • 8B • Updated Apr 16 • 162 • 3

upvoted 2 papers about 1 month ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 326

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 630

upvoted 2 papers about 2 months ago

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Paper • 2604.08546 • Published Apr 9 • 115

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published Apr 6 • 123

liked a Space about 2 months ago

VoxCPM Demo

🎙

492

VoxCPM2 Nano-vLLM Demo

ZHANG Yutong

AI & ML interests

Recent Activity

Organizations

wrodriguez509's activity

VoxCPM Demo