llamaperf

Instinct MI250X 128GB

AMD · 128GB · 2 reports

This page is thin (2 of 3 reports needed for indexing). Help fill it in.
Tone: positive
throughput:
20.0 t/s gen
quant:
FP16 (safetensors)
text-generation

Gemma 4 31B on Instinct MI250X via vLLM + ROCm. Full FP16. Previous gen datacenter but capable. Source: gemma4-ai.com AMD GPU guide