Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
CKeibel
's Collections
SLMs
PII
Code-Embeddings
Speech2Text (ASR)
Seq2Seq
Reward Models
diffusion models
Text-Classification
Data
PEFT (Papers)
LLMs (Papers)
Causal LMs, seq2seq models
Embedding models
Vision stuff
datasets
NER
BERT based tasks (models)
Multimodal
Data
updated
Feb 13
Upvote
-
Sort: Collection
HuggingFaceFW/fineweb-2
Viewer
•
Updated
Oct 27, 2025
•
4.48B
•
94.6k
•
826
allenai/c4
Viewer
•
Updated
Jan 9, 2024
•
10.4B
•
830k
•
601
ServiceNow-AI/R1-Distill-SFT
Viewer
•
Updated
Feb 8, 2025
•
1.85M
•
2.87k
•
322
PrimeIntellect/INTELLECT-2-RL-Dataset
Viewer
•
Updated
May 13, 2025
•
285k
•
126
•
66
togethercomputer/RedPajama-Data-V2
Updated
Nov 21, 2024
•
7.08k
•
403
wikimedia/wikipedia
Viewer
•
Updated
Jan 9, 2024
•
61.6M
•
155k
•
1.25k
avemio/German-RAG-EMBEDDING-TRIPLES-HESSIAN-AI
Viewer
•
Updated
Oct 16, 2024
•
294k
•
7
•
1
urchade/synthetic-pii-ner-mistral-v1
Updated
Apr 20, 2024
•
234
•
16
yahma/alpaca-cleaned
Viewer
•
Updated
Apr 10, 2023
•
51.8k
•
21.9k
•
843
Upvote
-
Sort: Collection
Share collection
View history
Collection guide
Browse collections