RCE Podcast Looks at Apache Spark

May 11, 2015 by Doug Black

0:00

In this RCE podcast, Brock Palen and Jeff Squyres speak with Matei Zaharia about Apache Spark, a fast engine for large-scale data processing.

Matei Zaharia is an assistant professor of computer science at MIT and CTO of Databricks, the company commercializing Apache Spark. He begun the Spark project at UC Berkeley and continues to do research in big data processing and computer systems. Apart from Spark, he has contributed to other open source projects including Apache Mesos, the SNAP sequence aligner, and Apache Hadoop.

Spark is used at a wide range of organizations to process large datasets. You can find example use cases at the Spark Summit conference, or on the Powered By Spark page.

Download the MP3 * Subscribe on iTunes * RSS Feed

Filed Under: High Performance Analytics, HPC Software, Industry Perspectives, Podcast, Resources Tagged With: Apache Spark, big data, RCE Podcast

Energy efficiency drives HPC to the cloud

The high-performance computing (HPC) market is witnessing a notable shift towards the cloud, partially driven by the benefits of enhanced energy efficiency. According to Hyperion Research nearly every organization running HPC workloads is either already using or investigating the cloud to accelerate application performance, with the cloud market for HPC workloads forecast to reach $11.5 […]

Download

RCE Podcast Looks at Apache Spark

Sponsored Guest Articles

Dell: Omnia Copes with Configuring HPC-AI Environments

White Papers

Energy efficiency drives HPC to the cloud

Featured RSS Feed

More News from insideBIGDATA

RCE Podcast Looks at Apache Spark

Sponsored Guest Articles

Dell: Omnia Copes with Configuring HPC-AI Environments

White Papers

Energy efficiency drives HPC to the cloud

Join Us On Social Media

Related Posts

Featured RSS Feed

More News from insideBIGDATA