VRAM Calculator
Pick your GPU — see which open-weight models fit in VRAM, at which quantization, and roughly how fast they run. Tokens/sec comes from community-reported benchmarks when we have them.
3 fits fully·1 with offload·0 won't run
All 4 reports for this GPU →