M2 Max 96GB vs RTX 5080
For running local LLMs · 3 reports across 1 model
Tokens per second by model
| Model | M2 Max 96GB | RTX 5080 |
|---|---|---|
| Qwen3.6up to 35B | 28.0n=2 | 56.0n=1 |
For running local LLMs · 3 reports across 1 model
| Model | M2 Max 96GB | RTX 5080 |
|---|---|---|
| Qwen3.6up to 35B | 28.0n=2 | 56.0n=1 |