New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance.
AI & ML interests
Open Source AI 🦥
Recent Activity
The Qwen3-Coder models deliver SOTA advancements in agentic coding and code tasks. Includes Qwen3-Coder-Next.
-
unsloth/Qwen3-Coder-Next-GGUF
Text Generation • 80B • Updated • 281k • 302 -
unsloth/Qwen3-Coder-Next-FP8-Dynamic
Text Generation • 80B • Updated • 29.1k • 30 -
unsloth/Qwen3-Coder-Next
Text Generation • 80B • Updated • 20.3k • 13 -
unsloth/Qwen3-Coder-Next-FP8
Text Generation • 80B • Updated • 2.88k • 5
Qwen's new multimodal vision models in GGUF, safetensor, and dynamic Unsloth formats.
-
unsloth/Qwen3-VL-30B-A3B-Instruct-GGUF
Image-Text-to-Text • 31B • Updated • 167k • 78 -
unsloth/Qwen3-VL-30B-A3B-Thinking-GGUF
Image-Text-to-Text • 31B • Updated • 34.9k • 35 -
unsloth/Qwen3-VL-4B-Instruct-GGUF
Image-Text-to-Text • 4B • Updated • 65.6k • 39 -
unsloth/Qwen3-VL-4B-Thinking-GGUF
Image-Text-to-Text • 4B • Updated • 10.3k • 21
DeepSeek's new 3.1 update to their V3 models!
Run or fine-tune embedding models with Unsloth.
-
unsloth/embeddinggemma-300m
Sentence Similarity • 0.3B • Updated • 10.2k • • 9 -
unsloth/embeddinggemma-300m-GGUF
Sentence Similarity • 0.3B • Updated • 6.12k • 48 -
unsloth/Qwen3-Embedding-0.6B
Feature Extraction • 0.6B • Updated • 2.83k • • 4 -
unsloth/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 1.81k • 1
Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes.
Google Gemma 3n models, all versions including Dynamic GGUF, 4-bit, 16-bit and formats!
-
unsloth/gemma-3n-E4B-it-GGUF
Image-Text-to-Text • 7B • Updated • 21k • 189 -
unsloth/gemma-3n-E2B-it-GGUF
Image-Text-to-Text • 4B • Updated • 24.8k • 58 -
unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit
Image-Text-to-Text • 8B • Updated • 14.9k • 9 -
unsloth/gemma-3n-E4B-unsloth-bnb-4bit
Image-Text-to-Text • 8B • Updated • 1.65k • 4
Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes
-
unsloth/Phi-4-reasoning-plus-GGUF
Text Generation • 15B • Updated • 4.6k • 77 -
unsloth/Phi-4-mini-reasoning-GGUF
Text Generation • 4B • Updated • 5.78k • 59 -
unsloth/Phi-4-reasoning-GGUF
Text Generation • 15B • Updated • 1.11k • 19 -
unsloth/phi-4-GGUF
Text Generation • 15B • Updated • 2k • 181
Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions.
-
unsloth/DeepSeek-V3-0324-GGUF-UD
Text Generation • 671B • Updated • 1.02k • 21 -
unsloth/DeepSeek-V3-0324-GGUF
Text Generation • 671B • Updated • 3.91k • 197 -
unsloth/DeepSeek-V3-0324
Text Generation • 684B • Updated • 9 • 7 -
unsloth/DeepSeek-V3-0324-BF16
Text Generation • 684B • Updated • 24.4k • 4
A collection of Mistral's new Small 3.2 and 3 models including GGUF, 4-bit and more!
-
unsloth/Mistral-Small-3.2-24B-Instruct-2506-GGUF
Image-Text-to-Text • 24B • Updated • 39.8k • 158 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506
Image-Text-to-Text • 24B • Updated • 3.37k • • 12 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506-FP8
Image-Text-to-Text • Updated • 606 • 6 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506-unsloth-bnb-4bit
Image-Text-to-Text • 25B • Updated • 2.46k • 12
Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions.
Qwen's reasoning models including QwQ (32B) & QVQ (72B) in formats: GGUF, dynamic 4-bit and 16-bit original versions.
-
unsloth/QwQ-32B-GGUF
Text Generation • 33B • Updated • 1.59k • 86 -
unsloth/QwQ-32B-unsloth-bnb-4bit
Text Generation • 34B • Updated • 540 • 47 -
unsloth/QwQ-32B
Text Generation • 33B • Updated • 29 • • 17 -
unsloth/QwQ-32B-bnb-4bit
Text Generation • 34B • Updated • 354 • 4
Meta's Llama 3.2 vision models 11B and 90B. Include 4-bit bnb and original versions.
-
unsloth/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 30.5k • 87 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-Text-to-Text • Updated • 8.57k • 79 -
unsloth/Llama-3.2-11B-Vision
Image-Text-to-Text • 11B • Updated • 521 • 34 -
unsloth/Llama-3.2-11B-Vision-bnb-4bit
Image-Text-to-Text • 11B • Updated • 153 • 16
Meta's Llama 3.1 models including 8B, 70B, 405B. Includes 4-bit bnb and original versions.
-
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation • 8B • Updated • 254k • 95 -
unsloth/Llama-3.1-8B-Instruct-unsloth-bnb-4bit
Text Generation • 8B • Updated • 35k • 4 -
unsloth/Meta-Llama-3.1-8B-Instruct-unsloth-bnb-4bit
Text Generation • 8B • Updated • 154k • 4 -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation • 8B • Updated • 27.5k • 109
Native bitsandbytes 4bit pre quantized models
-
unsloth/Llama-3.2-3B-bnb-4bit
Text Generation • 3B • Updated • 11.5k • 21 -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation • 8B • Updated • 27.5k • 109 -
unsloth/llama-3-8b-Instruct-bnb-4bit
Text Generation • 8B • Updated • 86.6k • 133 -
unsloth/gemma-2-9b-bnb-4bit
Text Generation • 10B • Updated • 9.06k • 31
Find GGUFs and other variants of diffusion based Qwen-Image and FLUX models.
-
unsloth/Qwen-Image-2512-GGUF
Text-to-Image • 20B • Updated • 57.7k • • 294 -
unsloth/LTX-2-GGUF
Image-to-Video • 19B • Updated • 41.8k • 108 -
unsloth/Z-Image-GGUF
Text-to-Image • 6B • Updated • 22.4k • 106 -
unsloth/FLUX.2-klein-9B-GGUF
Image-to-Image • 9B • Updated • 69.5k • 91
OpenAI's gpt-oss-20b and gpt-oss-120b is here! The powerful open models are available in GGUF, original & 4-bit formats.
-
unsloth/gpt-oss-20b-GGUF
Text Generation • 21B • Updated • 143k • 585 -
unsloth/gpt-oss-120b-GGUF
Text Generation • 117B • Updated • 85k • 209 -
unsloth/gpt-oss-20b-unsloth-bnb-4bit
Text Generation • Updated • 173k • 35 -
unsloth/gpt-oss-120b-unsloth-bnb-4bit
Text Generation • 117B • Updated • 24.2k • 13
Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants.
-
unsloth/Qwen3-30B-A3B-Instruct-2507-GGUF
31B • Updated • 39k • 288 -
unsloth/Qwen3-4B-Instruct-2507-GGUF
4B • Updated • 64.9k • 145 -
unsloth/Qwen3-4B-Thinking-2507-GGUF
4B • Updated • 12.3k • 89 -
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF
Text Generation • 480B • Updated • 4.52k • 169
DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 77.6k • 376 -
unsloth/DeepSeek-R1-0528-GGUF
Text Generation • 671B • Updated • 15k • 194 -
unsloth/DeepSeek-R1-0528-Qwen3-8B-unsloth-bnb-4bit
Text Generation • 8B • Updated • 6.48k • 13 -
unsloth/DeepSeek-R1-0528
Text Generation • 685B • Updated • 24 • 15
All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats.
-
unsloth/gemma-3-270m-it-GGUF
Text Generation • 0.3B • Updated • 40.6k • 147 -
unsloth/gemma-3-270m-it-qat-GGUF
Text Generation • 0.3B • Updated • 5.65k • 11 -
unsloth/gemma-3-270m-it
Text Generation • 0.3B • Updated • 23.4k • 22 -
unsloth/gemma-3-270m-it-unsloth-bnb-4bit
Text Generation • 0.3B • Updated • 16.2k • 5
IBM's new Granite-4.0 models! Run Dynamic GGUFs or fine-tune with Unsloth.
Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth!
-
unsloth/Llama-4-Scout-17B-16E-Instruct-GGUF
Image-Text-to-Text • 108B • Updated • 105k • 135 -
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF
Image-Text-to-Text • 401B • Updated • 17.2k • 43 -
unsloth/Llama-4-Scout-17B-16E-Instruct
Image-Text-to-Text • 109B • Updated • 446 • 56 -
unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-bnb-4bit
Image-Text-to-Text • Updated • 748 • 80
Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit
-
unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
Text Generation • 5B • Updated • 17.3k • 36 -
unsloth/DeepSeek-R1-Distill-Qwen-14B-unsloth-bnb-4bit
Text Generation • 15B • Updated • 5.05k • 30 -
unsloth/DeepSeek-R1-Distill-Qwen-7B-unsloth-bnb-4bit
Text Generation • Updated • 4.49k • 24 -
unsloth/gemma-3-12b-it-unsloth-bnb-4bit
Image-Text-to-Text • 12B • Updated • 39.7k • 24
A collection of 4-bit, Dynamic 4-bit and 16-bit voice models including Sesame-CSM, OpenAI's Whisper, Orpheus. Fine-tune them with Unsloth now!
-
unsloth/orpheus-3b-0.1-ft-GGUF
Text-to-Speech • 3B • Updated • 833 • 11 -
unsloth/orpheus-3b-0.1-ft-unsloth-bnb-4bit
Text-to-Speech • 3B • Updated • 10.1k • 16 -
unsloth/csm-1b
Text-to-Speech • 2B • Updated • 14.4k • 19 -
unsloth/whisper-large-v3
Automatic Speech Recognition • Updated • 7.23k • 15
Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions.
-
unsloth/Llama-3.2-1B-Instruct-GGUF
Text Generation • 1B • Updated • 70.3k • 53 -
unsloth/Llama-3.2-1B-Instruct
Text Generation • 1B • Updated • 135k • 90 -
unsloth/Llama-3.2-1B-Instruct-unsloth-bnb-4bit
Text Generation • 0.8B • Updated • 74.2k • 4 -
unsloth/Llama-3.2-1B-Instruct-bnb-4bit
Text Generation • 1B • Updated • 37.6k • 22
All versions of Qwen2.5-VL including the new 32B version and 4-bit, 16-bit and more!
-
unsloth/Qwen2.5-VL-3B-Instruct-GGUF
Image-Text-to-Text • 3B • Updated • 14.3k • 19 -
unsloth/Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text • 8B • Updated • 62.1k • 142 -
unsloth/Qwen2.5-VL-32B-Instruct-GGUF
Image-Text-to-Text • 33B • Updated • 912 • 7 -
unsloth/Qwen2.5-VL-72B-Instruct-GGUF
Image-Text-to-Text • 73B • Updated • 1.43k • 7
Collection of the most popular vision models including Llama 3.2, LlaVa, Qwen2 VL, Pixtral, PaliGemma and more!
-
unsloth/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 30.5k • 87 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-Text-to-Text • Updated • 8.57k • 79 -
unsloth/Llama-3.2-11B-Vision-Instruct-unsloth-bnb-4bit
Image-Text-to-Text • Updated • 4.53k • 28 -
unsloth/Qwen2-VL-7B-Instruct-bnb-4bit
Image-Text-to-Text • 9B • Updated • 2.18k • 6
Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats.
-
unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF
33B • Updated • 1.48k • 74 -
unsloth/Qwen2.5-Coder-14B-Instruct-128K-GGUF
15B • Updated • 1.7k • 34 -
unsloth/Qwen2.5-Coder-7B-Instruct-128K-GGUF
8B • Updated • 3.58k • 20 -
unsloth/Qwen2.5-Coder-3B-Instruct-128K-GGUF
3B • Updated • 1.31k • 16
-
unsloth/Qwen2.5-7B-Instruct-bnb-4bit
Text Generation • 8B • Updated • 69.4k • 20 -
unsloth/Qwen2.5-7B-Instruct
Text Generation • 8B • Updated • 60k • • 22 -
unsloth/Qwen2.5-14B-bnb-4bit
Text Generation • 15B • Updated • 3.07k • 5 -
unsloth/Qwen2.5-7B-bnb-4bit
Text Generation • 8B • Updated • 7.96k • 6
-
unsloth/Llama-3.2-3B-Instruct-bnb-4bit
Text Generation • 3B • Updated • 51.2k • 33 -
unsloth/Llama-3.2-1B-Instruct-bnb-4bit
Text Generation • 1B • Updated • 37.6k • 22 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-Text-to-Text • Updated • 8.57k • 79 -
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation • 8B • Updated • 254k • 95
New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance.
Find GGUFs and other variants of diffusion based Qwen-Image and FLUX models.
-
unsloth/Qwen-Image-2512-GGUF
Text-to-Image • 20B • Updated • 57.7k • • 294 -
unsloth/LTX-2-GGUF
Image-to-Video • 19B • Updated • 41.8k • 108 -
unsloth/Z-Image-GGUF
Text-to-Image • 6B • Updated • 22.4k • 106 -
unsloth/FLUX.2-klein-9B-GGUF
Image-to-Image • 9B • Updated • 69.5k • 91
The Qwen3-Coder models deliver SOTA advancements in agentic coding and code tasks. Includes Qwen3-Coder-Next.
-
unsloth/Qwen3-Coder-Next-GGUF
Text Generation • 80B • Updated • 281k • 302 -
unsloth/Qwen3-Coder-Next-FP8-Dynamic
Text Generation • 80B • Updated • 29.1k • 30 -
unsloth/Qwen3-Coder-Next
Text Generation • 80B • Updated • 20.3k • 13 -
unsloth/Qwen3-Coder-Next-FP8
Text Generation • 80B • Updated • 2.88k • 5
OpenAI's gpt-oss-20b and gpt-oss-120b is here! The powerful open models are available in GGUF, original & 4-bit formats.
-
unsloth/gpt-oss-20b-GGUF
Text Generation • 21B • Updated • 143k • 585 -
unsloth/gpt-oss-120b-GGUF
Text Generation • 117B • Updated • 85k • 209 -
unsloth/gpt-oss-20b-unsloth-bnb-4bit
Text Generation • Updated • 173k • 35 -
unsloth/gpt-oss-120b-unsloth-bnb-4bit
Text Generation • 117B • Updated • 24.2k • 13
Qwen's new multimodal vision models in GGUF, safetensor, and dynamic Unsloth formats.
-
unsloth/Qwen3-VL-30B-A3B-Instruct-GGUF
Image-Text-to-Text • 31B • Updated • 167k • 78 -
unsloth/Qwen3-VL-30B-A3B-Thinking-GGUF
Image-Text-to-Text • 31B • Updated • 34.9k • 35 -
unsloth/Qwen3-VL-4B-Instruct-GGUF
Image-Text-to-Text • 4B • Updated • 65.6k • 39 -
unsloth/Qwen3-VL-4B-Thinking-GGUF
Image-Text-to-Text • 4B • Updated • 10.3k • 21
Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants.
-
unsloth/Qwen3-30B-A3B-Instruct-2507-GGUF
31B • Updated • 39k • 288 -
unsloth/Qwen3-4B-Instruct-2507-GGUF
4B • Updated • 64.9k • 145 -
unsloth/Qwen3-4B-Thinking-2507-GGUF
4B • Updated • 12.3k • 89 -
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF
Text Generation • 480B • Updated • 4.52k • 169
DeepSeek's new 3.1 update to their V3 models!
DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 77.6k • 376 -
unsloth/DeepSeek-R1-0528-GGUF
Text Generation • 671B • Updated • 15k • 194 -
unsloth/DeepSeek-R1-0528-Qwen3-8B-unsloth-bnb-4bit
Text Generation • 8B • Updated • 6.48k • 13 -
unsloth/DeepSeek-R1-0528
Text Generation • 685B • Updated • 24 • 15
Run or fine-tune embedding models with Unsloth.
-
unsloth/embeddinggemma-300m
Sentence Similarity • 0.3B • Updated • 10.2k • • 9 -
unsloth/embeddinggemma-300m-GGUF
Sentence Similarity • 0.3B • Updated • 6.12k • 48 -
unsloth/Qwen3-Embedding-0.6B
Feature Extraction • 0.6B • Updated • 2.83k • • 4 -
unsloth/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 1.81k • 1
All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats.
-
unsloth/gemma-3-270m-it-GGUF
Text Generation • 0.3B • Updated • 40.6k • 147 -
unsloth/gemma-3-270m-it-qat-GGUF
Text Generation • 0.3B • Updated • 5.65k • 11 -
unsloth/gemma-3-270m-it
Text Generation • 0.3B • Updated • 23.4k • 22 -
unsloth/gemma-3-270m-it-unsloth-bnb-4bit
Text Generation • 0.3B • Updated • 16.2k • 5
Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes.
IBM's new Granite-4.0 models! Run Dynamic GGUFs or fine-tune with Unsloth.
Google Gemma 3n models, all versions including Dynamic GGUF, 4-bit, 16-bit and formats!
-
unsloth/gemma-3n-E4B-it-GGUF
Image-Text-to-Text • 7B • Updated • 21k • 189 -
unsloth/gemma-3n-E2B-it-GGUF
Image-Text-to-Text • 4B • Updated • 24.8k • 58 -
unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit
Image-Text-to-Text • 8B • Updated • 14.9k • 9 -
unsloth/gemma-3n-E4B-unsloth-bnb-4bit
Image-Text-to-Text • 8B • Updated • 1.65k • 4
Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth!
-
unsloth/Llama-4-Scout-17B-16E-Instruct-GGUF
Image-Text-to-Text • 108B • Updated • 105k • 135 -
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF
Image-Text-to-Text • 401B • Updated • 17.2k • 43 -
unsloth/Llama-4-Scout-17B-16E-Instruct
Image-Text-to-Text • 109B • Updated • 446 • 56 -
unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-bnb-4bit
Image-Text-to-Text • Updated • 748 • 80
Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes
-
unsloth/Phi-4-reasoning-plus-GGUF
Text Generation • 15B • Updated • 4.6k • 77 -
unsloth/Phi-4-mini-reasoning-GGUF
Text Generation • 4B • Updated • 5.78k • 59 -
unsloth/Phi-4-reasoning-GGUF
Text Generation • 15B • Updated • 1.11k • 19 -
unsloth/phi-4-GGUF
Text Generation • 15B • Updated • 2k • 181
Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit
-
unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
Text Generation • 5B • Updated • 17.3k • 36 -
unsloth/DeepSeek-R1-Distill-Qwen-14B-unsloth-bnb-4bit
Text Generation • 15B • Updated • 5.05k • 30 -
unsloth/DeepSeek-R1-Distill-Qwen-7B-unsloth-bnb-4bit
Text Generation • Updated • 4.49k • 24 -
unsloth/gemma-3-12b-it-unsloth-bnb-4bit
Image-Text-to-Text • 12B • Updated • 39.7k • 24
Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions.
-
unsloth/DeepSeek-V3-0324-GGUF-UD
Text Generation • 671B • Updated • 1.02k • 21 -
unsloth/DeepSeek-V3-0324-GGUF
Text Generation • 671B • Updated • 3.91k • 197 -
unsloth/DeepSeek-V3-0324
Text Generation • 684B • Updated • 9 • 7 -
unsloth/DeepSeek-V3-0324-BF16
Text Generation • 684B • Updated • 24.4k • 4
A collection of 4-bit, Dynamic 4-bit and 16-bit voice models including Sesame-CSM, OpenAI's Whisper, Orpheus. Fine-tune them with Unsloth now!
-
unsloth/orpheus-3b-0.1-ft-GGUF
Text-to-Speech • 3B • Updated • 833 • 11 -
unsloth/orpheus-3b-0.1-ft-unsloth-bnb-4bit
Text-to-Speech • 3B • Updated • 10.1k • 16 -
unsloth/csm-1b
Text-to-Speech • 2B • Updated • 14.4k • 19 -
unsloth/whisper-large-v3
Automatic Speech Recognition • Updated • 7.23k • 15
A collection of Mistral's new Small 3.2 and 3 models including GGUF, 4-bit and more!
-
unsloth/Mistral-Small-3.2-24B-Instruct-2506-GGUF
Image-Text-to-Text • 24B • Updated • 39.8k • 158 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506
Image-Text-to-Text • 24B • Updated • 3.37k • • 12 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506-FP8
Image-Text-to-Text • Updated • 606 • 6 -
unsloth/Mistral-Small-3.2-24B-Instruct-2506-unsloth-bnb-4bit
Image-Text-to-Text • 25B • Updated • 2.46k • 12
Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions.
-
unsloth/Llama-3.2-1B-Instruct-GGUF
Text Generation • 1B • Updated • 70.3k • 53 -
unsloth/Llama-3.2-1B-Instruct
Text Generation • 1B • Updated • 135k • 90 -
unsloth/Llama-3.2-1B-Instruct-unsloth-bnb-4bit
Text Generation • 0.8B • Updated • 74.2k • 4 -
unsloth/Llama-3.2-1B-Instruct-bnb-4bit
Text Generation • 1B • Updated • 37.6k • 22
Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions.
All versions of Qwen2.5-VL including the new 32B version and 4-bit, 16-bit and more!
-
unsloth/Qwen2.5-VL-3B-Instruct-GGUF
Image-Text-to-Text • 3B • Updated • 14.3k • 19 -
unsloth/Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text • 8B • Updated • 62.1k • 142 -
unsloth/Qwen2.5-VL-32B-Instruct-GGUF
Image-Text-to-Text • 33B • Updated • 912 • 7 -
unsloth/Qwen2.5-VL-72B-Instruct-GGUF
Image-Text-to-Text • 73B • Updated • 1.43k • 7
Qwen's reasoning models including QwQ (32B) & QVQ (72B) in formats: GGUF, dynamic 4-bit and 16-bit original versions.
-
unsloth/QwQ-32B-GGUF
Text Generation • 33B • Updated • 1.59k • 86 -
unsloth/QwQ-32B-unsloth-bnb-4bit
Text Generation • 34B • Updated • 540 • 47 -
unsloth/QwQ-32B
Text Generation • 33B • Updated • 29 • • 17 -
unsloth/QwQ-32B-bnb-4bit
Text Generation • 34B • Updated • 354 • 4
Collection of the most popular vision models including Llama 3.2, LlaVa, Qwen2 VL, Pixtral, PaliGemma and more!
-
unsloth/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 30.5k • 87 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-Text-to-Text • Updated • 8.57k • 79 -
unsloth/Llama-3.2-11B-Vision-Instruct-unsloth-bnb-4bit
Image-Text-to-Text • Updated • 4.53k • 28 -
unsloth/Qwen2-VL-7B-Instruct-bnb-4bit
Image-Text-to-Text • 9B • Updated • 2.18k • 6
Meta's Llama 3.2 vision models 11B and 90B. Include 4-bit bnb and original versions.
-
unsloth/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 30.5k • 87 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-Text-to-Text • Updated • 8.57k • 79 -
unsloth/Llama-3.2-11B-Vision
Image-Text-to-Text • 11B • Updated • 521 • 34 -
unsloth/Llama-3.2-11B-Vision-bnb-4bit
Image-Text-to-Text • 11B • Updated • 153 • 16
Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats.
-
unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF
33B • Updated • 1.48k • 74 -
unsloth/Qwen2.5-Coder-14B-Instruct-128K-GGUF
15B • Updated • 1.7k • 34 -
unsloth/Qwen2.5-Coder-7B-Instruct-128K-GGUF
8B • Updated • 3.58k • 20 -
unsloth/Qwen2.5-Coder-3B-Instruct-128K-GGUF
3B • Updated • 1.31k • 16
Meta's Llama 3.1 models including 8B, 70B, 405B. Includes 4-bit bnb and original versions.
-
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation • 8B • Updated • 254k • 95 -
unsloth/Llama-3.1-8B-Instruct-unsloth-bnb-4bit
Text Generation • 8B • Updated • 35k • 4 -
unsloth/Meta-Llama-3.1-8B-Instruct-unsloth-bnb-4bit
Text Generation • 8B • Updated • 154k • 4 -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation • 8B • Updated • 27.5k • 109
-
unsloth/Qwen2.5-7B-Instruct-bnb-4bit
Text Generation • 8B • Updated • 69.4k • 20 -
unsloth/Qwen2.5-7B-Instruct
Text Generation • 8B • Updated • 60k • • 22 -
unsloth/Qwen2.5-14B-bnb-4bit
Text Generation • 15B • Updated • 3.07k • 5 -
unsloth/Qwen2.5-7B-bnb-4bit
Text Generation • 8B • Updated • 7.96k • 6
Native bitsandbytes 4bit pre quantized models
-
unsloth/Llama-3.2-3B-bnb-4bit
Text Generation • 3B • Updated • 11.5k • 21 -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation • 8B • Updated • 27.5k • 109 -
unsloth/llama-3-8b-Instruct-bnb-4bit
Text Generation • 8B • Updated • 86.6k • 133 -
unsloth/gemma-2-9b-bnb-4bit
Text Generation • 10B • Updated • 9.06k • 31
-
unsloth/Llama-3.2-3B-Instruct-bnb-4bit
Text Generation • 3B • Updated • 51.2k • 33 -
unsloth/Llama-3.2-1B-Instruct-bnb-4bit
Text Generation • 1B • Updated • 37.6k • 22 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-Text-to-Text • Updated • 8.57k • 79 -
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation • 8B • Updated • 254k • 95