Taki WU's picture

Open to Work

Taki WU

taki555

·

https://wutaiqiang.github.io/

AI & ML interests

None yet

Recent Activity

commentedon a paper 3 days ago

OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

upvoted a paper 3 days ago

OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

authored a paper about 1 month ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

View all activity

Organizations

commented a paper 3 days ago

OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

Paper • 2605.19660 • Published 5 days ago • 39 •

upvoted a paper 3 days ago

OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

Paper • 2605.19660 • Published 5 days ago • 39

authored a paper about 1 month ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published Apr 11 • 81

upvoted 2 papers about 1 month ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published Apr 11 • 81

OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence

Paper • 2604.07296 • Published Apr 8 • 40

upvoted a collection about 2 months ago

The Art of Efficient Reasoning

Project: https://wutaiqiang.github.io/project/Art • 8 items • Updated Mar 18 • 2

New activity in taki555/Qwen3-30B-A3B-Thinking-2507-Art 2 months ago

Improve model card metadata and content

#1 opened 2 months ago by

New activity in taki555/Qwen3-30B-A3B-Instruct-2507-Art 2 months ago

Add pipeline tag, library name, and paper link to model card

#1 opened 2 months ago by

New activity in taki555/Qwen3-4B-Instruct-2507-Art 2 months ago

Add metadata and improve model card

#1 opened 2 months ago by

New activity in taki555/Qwen3-1.7B-Art 2 months ago

Improve model card: add metadata and links

#1 opened 2 months ago by

New activity in taki555/Qwen3-0.6B-Art 2 months ago

Add pipeline tag, library name, and link to paper

#1 opened 2 months ago by

New activity in taki555/Qwen3-4B-Thinking-2507-Art 2 months ago

Update model card with metadata and paper link

#1 opened 2 months ago by

updated a collection 2 months ago

The Art of Efficient Reasoning

Project: https://wutaiqiang.github.io/project/Art • 8 items • Updated Mar 18 • 2

updated a model 2 months ago

taki555/Qwen3-30B-A3B-Thinking-2507-Art

Text Generation • 31B • Updated Mar 24 • 7

published a model 2 months ago

taki555/Qwen3-30B-A3B-Thinking-2507-Art

Text Generation • 31B • Updated Mar 24 • 7

updated a collection 3 months ago

The Art of Efficient Reasoning

Project: https://wutaiqiang.github.io/project/Art • 8 items • Updated Mar 18 • 2

updated a model 3 months ago

taki555/Qwen3-4B-Thinking-2507-Art

Text Generation • 4B • Updated Mar 24 • 7

published a model 3 months ago

taki555/Qwen3-4B-Thinking-2507-Art

Text Generation • 4B • Updated Mar 24 • 7

upvoted a paper 3 months ago

CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning

Paper • 2603.00889 • Published Mar 1 • 55

upvoted a collection 3 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.65k