llamaperf

DGX Spark

NVIDIA · 128GB unified memory · 1 report

This page is thin (1 of 3 reports needed for indexing). Help fill it in.
Tone: positive
quant:
NVFP4

16x DGX Spark cluster with unified memory, serving GLM-5.1-NVFP4 (434GB) at TP=8. Plans to test DeepSeek and Kimi. Future prefill/decode split with M5 Ultra Mac Studios.