Posted October 13, 2025
STAC Report

STAC-ML™ Markets (Inference) new world record set by NVIDIA and Supermicro

STAC-ML™ Markets (Inference) on a NVIDIA GH200 Grace Hopper Superchip in a Supermicro server

Reports

Detailed configuration information are available to eligible subscribers at the links above. To learn more about subscription options, please contact us.

STAC recently audited a STAC-ML™ Markets (Inference) benchmark on a stack featuring an NVIDIA GH200 Grace Hopper Superchip in a Supermicro ARS-111GL-NHR server.

In this audit, the GH200 system was compared to a previous submission using FPGAs. The results show:

  • Smallest model: Up to 20% lower latency
  • Medium model: Up to 8% lower latency
  • Largest model: 49% lower latency

Full benchmark reports are available at the link opposite to all members.

With additional performance, efficiency and quality results, detailed configurations, and code access available to premium subscribers.

Sign up to
our newsletter