- throughput:
- 35.0 t/s gen
- quant:
- Q4_K_M (gguf)
text-generation
Gemma 4 26B MoE on Instinct MI250X at Q4_K_M. Source: gemma4-ai.com AMD GPU guide
AMD · 128GB · 2 reports
Gemma 4 26B MoE on Instinct MI250X at Q4_K_M. Source: gemma4-ai.com AMD GPU guide
Gemma 4 31B on Instinct MI250X via vLLM + ROCm. Full FP16. Previous gen datacenter but capable. Source: gemma4-ai.com AMD GPU guide