An unmerged llama.cpp implementation: https://github.com/TrevorS/llama.cpp/tree/feature/qwen3-omni

Downloads last month
6,252
GGUF
Model size
4B params
Architecture
qwen3omni-talker
Hardware compatibility
Log In to add your hardware

4-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for TrevorJS/Qwen3-Omni-30B-A3B-GGUF

Quantized
(9)
this model