Qwen3.5-122B-A10B BF16 GGUF = 224GB. The "80Gb VRAM" mentioned here will barely ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		sunkeeh 56 days ago \| parent \| context \| favorite \| on: Qwen3.5 122B and 35B models offer Sonnet 4.5 perfo... Qwen3.5-122B-A10B BF16 GGUF = 224GB. The "80Gb VRAM" mentioned here will barely fit Q4_K_S (70GB), which will NOT perform as shown on benchmarks. Quite misleading, really.

CamperBob2 56 days ago [–]

The larger 3.5 quants are actually pretty close to the full-blown 397B model's performance, at least looking at the numbers. Qwen 3.5 seems more tolerant of quantization than most.

Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact