In this RCE podcast, Brock Palen and Jeff Squyres speak with Matei Zaharia about Apache Spark, a fast engine for large-scale data processing.
Matei Zaharia is an assistant professor of computer science at MIT and CTO of Databricks, the company commercializing Apache Spark. He begun the Spark project at UC Berkeley and continues to do research in big data processing and computer systems. Apart from Spark, he has contributed to other open source projects including Apache Mesos, the SNAP sequence aligner, and Apache Hadoop.
Spark is used at a wide range of organizations to process large datasets. You can find example use cases at the Spark Summit conference, or on the Powered By Spark page.