llamaperf

M2 Ultra 192GB

APPLE · 192GB unified memory · 1 report

This page is thin (1 of 3 reports needed for indexing). Help fill it in.

Mistral-Medium-3.5 128B

M2 Ultra 192GB · oMLX

Tone: negative
quant:
mlx-8bit (mlx)

Bug report: severe prefill throughput regression vs mlx-vlm 0.4.4 on Mac Studio (M2 Ultra 192GB) with Mistral-Medium-3.5-128B-mlx-8bit, long-context.