llamaperf

RX 7900 XT

AMD · 20GB · 2 reports

This page is thin (2 of 3 reports needed for indexing). Help fill it in.
Tone: positive
throughput:
38.0 t/s gen
quant:
Q4_K_M (gguf)
text-generation

~35-40 tok/s on RX 7900 XT 20GB. Gemma 4 E4B Q4_K_M. Headroom for longer context. Source: gemma4-ai.com AMD GPU guide