TurboQuant 4-bit mlx-lm models. TriAttention compatible. PR #1 merged MIT+NVIDIA.
-
deadbydawn101/gemma-4-E4B-mlx-4bit
Image-Text-to-Text • 2B • Updated • 1.18k • 6 -
deadbydawn101/gemma-4-E4B-Agentic-Opus-Reasoning-GeminiCLI-mlx-4bit
Text Generation • Updated • 11.8k • 19 -
deadbydawn101/gemma-4-E2B-Heretic-Uncensored-mlx-4bit
Image-Text-to-Text • 1B • Updated • 8.17k • 14 -
deadbydawn101/gemma-4-21b-REAP-Tool-Calling-mlx-4bit
Image-Text-to-Text • 4B • Updated • 1.41k • 4