Mistral-Medium-3.5 128B
M2 Ultra 192GB · oMLX
- quant:
- mlx-8bit (mlx)
Bug report: severe prefill throughput regression vs mlx-vlm 0.4.4 on Mac Studio (M2 Ultra 192GB) with Mistral-Medium-3.5-128B-mlx-8bit, long-context.
APPLE · 192GB unified memory · 1 report
M2 Ultra 192GB · oMLX
Bug report: severe prefill throughput regression vs mlx-vlm 0.4.4 on Mac Studio (M2 Ultra 192GB) with Mistral-Medium-3.5-128B-mlx-8bit, long-context.