You must log in or register to comment.
deleted by creator
GGUF quants are already out: https://huggingface.co/bartowski/Qwen_QwQ-32B-GGUF
Yay! let’s try
ollama run hf.co/bartowski/Qwen_QwQ-32B-GGUF:Q4_K_M
/set parameter num_ctx 32768
insane, absolutely insane
Why insane? For quality, speed, size? I find the coder 1.5b and 3b light and good
It matches R1 in the given benchmarks. R1 has 671B params (36 activated) while this only has 32