- SUT ID: STAC250314a
- STAC-ML
Extending STAC-ML with Gradient Boosted Tree Models on an X86-Architecture Cloud Instance
Type: STAC Report
Specs: STAC-ML*
Stack under test:
- STAC-ML Markets (Inference) Reference Implementation for GBT Models
- Python 3.12.3; ONNX runtime 1.21.0; NumPy 2.2.3
- Ubuntu Linux 24.04.1 LTS
- Numerous realtime tuning options and configurations
- AWS EC2 c7i.metal-24xl instance:
- 1 x Intel® Xeon® Platinum 8488C Processor
- 192 GiB memory: 8 x 24 GiB DDR5 DIMM @ 4800 MT/s
- 600 GiB AWS EBS volume
This STAC Report presents the findings of a Proof of Concept benchmark focused on Gradient-Boosted Tree (GBT) inference for real-time market data analysis. This study evaluates latency performance across three GBT models with varying complexities, comparing X86 and ARM architectures on AWS bare-metal instances using the ONNX runtime.
Please log in to see file attachments. If you are not registered, you may register for no charge.