- throughput:
- 18.0 t/s gen
- quant:
- Q4_K_M (gguf)
text-generation
~15-20 tok/s on RX 7800 XT 16GB. Gemma 4 26B MoE Q4_K_M via Ollama. Fits with short context. Source: gemma4-ai.com AMD GPU guide
AMD · 16GB · 2 reports
~15-20 tok/s on RX 7800 XT 16GB. Gemma 4 26B MoE Q4_K_M via Ollama. Fits with short context. Source: gemma4-ai.com AMD GPU guide
~25-30 tok/s on RX 7800 XT 16GB. Gemma 4 E4B Q4_K_M via Ollama. Good mid-range AMD option. Source: gemma4-ai.com AMD GPU guide