- throughput:
- 22.0 t/s gen
- quant:
- Q4_K_M (gguf)
text-generation
~20-25 tok/s on RX 7900 XT 20GB. Gemma 4 26B MoE Q4_K_M. Comfortable fit. Source: gemma4-ai.com AMD GPU guide
AMD · 20GB · 2 reports
~20-25 tok/s on RX 7900 XT 20GB. Gemma 4 26B MoE Q4_K_M. Comfortable fit. Source: gemma4-ai.com AMD GPU guide
RX 7900 XT · Ollama
~35-40 tok/s on RX 7900 XT 20GB. Gemma 4 E4B Q4_K_M. Headroom for longer context. Source: gemma4-ai.com AMD GPU guide