M3 Max 128GB vs RTX 5080
For running local LLMs · 2 reports across 1 model
Tokens per second by model
| Model | M3 Max 128GB | RTX 5080 |
|---|---|---|
| Qwen3.6up to 35B | 5.5n=1 | 56.0n=1 |
For running local LLMs · 2 reports across 1 model
| Model | M3 Max 128GB | RTX 5080 |
|---|---|---|
| Qwen3.6up to 35B | 5.5n=1 | 56.0n=1 |