Scaling Python and PySpark using Vectorized UDFs and Apache Arrow

Li Jin, Distributed System Engineer, Two Sigma Investments, presented this at the 13 June 2018 STAC Summit in New York.

