Minki Kang's picture

Minki Kang

Nardien

·

Nardien

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs

submitted a paper 9 days ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

upvoted a paper 9 days ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

View all activity

Organizations

upvoted a paper 6 days ago

It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs

Paper • 2605.20258 • Published 9 days ago • 30

upvoted a paper 9 days ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published 12 days ago • 33

upvoted a paper 12 days ago

PREPING: Building Agent Memory without Tasks

Paper • 2605.13880 • Published 16 days ago • 28

upvoted 2 papers about 1 month ago

OpenGame: Open Agentic Coding for Games

Paper • 2604.18394 • Published Apr 20 • 81

Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents

Paper • 2604.14004 • Published Apr 15 • 30

upvoted a paper 2 months ago

T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

Paper • 2603.22341 • Published Mar 21 • 37

upvoted 2 papers 3 months ago

MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents

Paper • 2603.09827 • Published Mar 10 • 30

MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models

Paper • 2602.17602 • Published Feb 19 • 56

upvoted 3 papers 4 months ago

THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

Paper • 2601.23143 • Published Jan 30 • 39

SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents

Paper • 2601.16746 • Published Jan 23 • 91

Self-Refining Video Sampling

Paper • 2601.18577 • Published Jan 26 • 25

upvoted 2 papers 5 months ago

Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation

Paper • 2601.00664 • Published Jan 2 • 57

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Paper • 2512.22238 • Published Dec 23, 2025 • 30

upvoted 2 papers 6 months ago

WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning

Paper • 2512.02425 • Published Dec 2, 2025 • 25

RefineBench: Evaluating Refinement Capability of Language Models via Checklists

Paper • 2511.22173 • Published Nov 27, 2025 • 15

upvoted 5 papers 7 months ago

Adaptive Multi-Agent Response Refinement in Conversational Systems

Paper • 2511.08319 • Published Nov 11, 2025 • 42

Simulating Environments with Reasoning Models for Agent Training

Paper • 2511.01824 • Published Nov 3, 2025 • 2

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28, 2025 • 104

AgentFold: Long-Horizon Web Agents with Proactive Context Management

Paper • 2510.24699 • Published Oct 28, 2025 • 73

ParallelBench: Understanding the Trade-offs of Parallel Decoding in Diffusion LLMs

Paper • 2510.04767 • Published Oct 6, 2025 • 28