- throughput:
- 18.0 t/s gen
- quant:
- Q4_K_M (gguf)
text-generation
~18 tok/s on Intel Arc B580 12GB. 26B MoE fits tight — short context only. Source: compute-market.com
INTEL · 12GB · 2 reports
~18 tok/s on Intel Arc B580 12GB. 26B MoE fits tight — short context only. Source: compute-market.com
~30 tok/s on Intel Arc B580 12GB. Handles E4B comfortably. Source: compute-market.com