Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Loser Cheems's picture
24 20 31

Loser Cheems

JingzeShi
zipperzac's profile picture nhlayisekobvuma's profile picture Bullet500's profile picture
·
https://github.com/LoserCheems
  • LoserCheems

AI & ML interests

I like training small languge models.

Recent Activity

liked a model 18 days ago
BAAI/OpenSeek-Mid-v1
updated a model 28 days ago
JingzeShi/flash-sparse-attention
published a model 28 days ago
JingzeShi/flash-sparse-attention
View all activity

Organizations

Hugging Face Discord Community's profile picture Hugging Face Party @ PyTorch Conference's profile picture Doge Face's profile picture DIAL-TFM's profile picture

authored 2 papers 4 months ago

Towards Automated Kernel Generation in the Era of LLMs

Paper • 2601.15727 • Published Jan 22 • 19

OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale

Paper • 2602.05711 • Published Feb 5 • 12
submitted a paper to Daily Papers 4 months ago

OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale

Paper • 2602.05711 • Published Feb 5 • 12
authored a paper 10 months ago

Trainable Dynamic Mask Sparse Attention

Paper • 2508.02124 • Published Aug 4, 2025 • 19
authored a paper 12 months ago

Concise Reasoning, Big Gains: Pruning Long Reasoning Trace with Difficulty-Aware Prompting

Paper • 2505.19716 • Published May 26, 2025 • 4
authored a paper over 1 year ago

Cheems: Wonderful Matrices More Efficient and More Effective Architecture

Paper • 2407.16958 • Published Jul 24, 2024 • 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs