STAC Report: IBM BigInsights vs Apache Hadoop, using SWIM
BigInsights performed jobs ~4x faster than pure Apache Hadoop and was ~11x faster in pure scheduling speed.
23 October 2013
STAC has just released a STAC Report detailing the performance difference between IBM InfoSphere® BigInsights® Enterprise Edition with Adaptive MapReduce (ver 18.104.22.168) on the one hand and Apache Hadoop (ver 1.1.2) on the other. We tested the two on the same hardware cluster using an adaptation of the Statistical Workload Injector for MapReduce (SWIM) methodology and a workload modeled on Facebook MapReduce jobs. Click here to access this report.
BigInsights completed the jobs about 4 times faster than Apache Hadoop running on the same environment. BigInsights was also about 11x faster using the corner-case “sleep” test of scheduling speed.
Qualified members may access the detailed system config information in the STAC Configuration Disclosure in the STAC Vault.*
A subscriber from a firm with a premium membership in the STAC Benchmark Council should be permissioned automatically to the STAC Configuration Disclosure (make sure you are logged in). If you get an access-denied message and believe you are entitled to these documents, or if you'd like to ask about premium subscription options for your firm, please contact us. An Observer Member firm that has not already received a complimentary report may request access to this STAC Configuration Disclosure.