llamaperf

M5 Max 128GB

APPLE · 128GB unified memory · 1 report

This page is thin (1 of 3 reports needed for indexing). Help fill it in.
Tone: negative
throughput:
7.5 t/s gen
flash-attn:
on

User reports poor performance with Gemma4-31B (7.5 tok/s) and Qwen3.6-27B (locking up) on M5 Max 128GB, while Qwen3.6-35B-A3 is fast. Mentions using DFLASH.