1 38 197

Wassim Trabelsi PRO

wath5

https://www.wassimt.com

wassim-trabelsi

AI & ML interests

AI Engineer NLP & Computer Vision

Recent Activity

liked a Space 17 days ago

AdithyaSK/rl-environments-guide

liked a Space 17 days ago

systms/ACTION-HF

liked a Space 17 days ago

smolagents/ml-intern

View all activity

Organizations

upvoted 2 articles 2 months ago

Article

History of State Space Models (SSM) in 2022

lbourdois

•

Apr 11, 2024

• 28

Article

Introduction to State Space Models (SSM)

lbourdois

•

Jul 19, 2024

• 226

upvoted a paper 2 months ago

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153

upvoted 2 articles 3 months ago

Article

Mixture of Experts (MoEs) in Transformers

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 164

Article

From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output

MaziyarPanahi

•

Feb 7

• 22

upvoted 7 articles 4 months ago

Article

Community Evals: Because we're done trusting black-box leaderboards over the community

burtenshaw, SaylorTwift, kramp, merve, davanstrien, nielsr, julien-c

•

Feb 4

• 90

Article

RexRerankers: SOTA Rankers for Product Discovery and AI Assistants

thebajajra

•

Jan 24

• 44

Article

One Year Since the “DeepSeek Moment”

huggingface

•

Jan 20

• 62

Article

How We Built a Semantic Highlight Model To Save Token Cost for RAG

zilliz

•

Jan 15

• 67

Article

Differential Transformer V2

microsoft

•

Jan 20

• 51

Article

Open Responses: What you need to know

evalstate, burtenshaw, merve, pcuenq

•

Jan 15

• 112

Article

The Optimal Architecture for Small Language Models

codelion

•

Dec 26, 2025

• 121

upvoted 3 articles 6 months ago

Article

Curating datasets directly on the Hub

dvilasuero

•

Nov 27, 2025

• 22

Article

Continuous batching from first principles

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 396

Article

Diffusers welcomes FLUX-2

YiYiXu, dg845, sayakpaul, OzzyGT, dn6, ariG23498, linoyts, multimodalart

•

Nov 25, 2025

• 191

upvoted an article 8 months ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

fzliu, KennethEnevoldsen, Samoed, isaacchung, tomaarsen, fzoll

•

Oct 1, 2025

• 144

upvoted 2 papers 8 months ago

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Paper • 2509.08519 • Published Sep 10, 2025 • 130

Rethinking Large Language Model Distillation: A Constrained Markov Decision Process Perspective

Paper • 2509.22921 • Published Sep 26, 2025 • 12

upvoted 2 articles 8 months ago

Article

Gaia2 and ARE: Empowering the community to study agents

clefourrier, gregmialz, mlcu, mortimerp9, XciD, tfrere, evijit, RomainFroger, dheeraj7596, CarolinePascal, upiter

•

Sep 22, 2025

• 134

Article

RexBERT: Encoders for a brave new world of E-Commerce

thebajajra

•

Sep 20, 2025

• 50

Wassim Trabelsi PRO

AI & ML interests

Recent Activity

Organizations

wath5's activity

History of State Space Models (SSM) in 2022

Introduction to State Space Models (SSM)

Mixture of Experts (MoEs) in Transformers

From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output

Community Evals: Because we're done trusting black-box leaderboards over the community

RexRerankers: SOTA Rankers for Product Discovery and AI Assistants

One Year Since the “DeepSeek Moment”

How We Built a Semantic Highlight Model To Save Token Cost for RAG

Differential Transformer V2

Open Responses: What you need to know

The Optimal Architecture for Small Language Models

Curating datasets directly on the Hub

Continuous batching from first principles

Diffusers welcomes FLUX-2

Introducing RTEB: A New Standard for Retrieval Evaluation

Gaia2 and ARE: Empowering the community to study agents

RexBERT: Encoders for a brave new world of E-Commerce