Video: Accelerating Big Data Processing with Hadoop, Spark and Memcached

March 30, 2015 by Doug Black

DK Panda, Ohio State University

In this video from the 2015 HPC Advisory Council Switzerland Conference, DK Panda from Ohio State University presents: Accelerating Big Data Processing with Hadoop, Spark and Memcached.

Apache Hadoop and Spark are gaining prominence in handling Big Data and analytics. Similarly, Memcached in Web 2.0 environment is becoming important for large-scale query processing. These middleware are traditionally written with sockets and do not deliver best performance on datacenters with modern high performance networks. In this tutorial, we will provide an in-depth overview of the architecture of Hadoop components (HDFS, MapReduce, RPC, HBase, etc.), Spark and Memcached. We will examine the challenges in re-designing the networking and I/O components of these middleware with modern interconnects, protocols (such as InfiniBand, iWARP, RoCE, and RSocket) with RDMA and storage architecture. Using the publicly available software packages in the High-Performance Big Data (HiBD) project, we will provide case studies of the new designs for several Hadoop/Spark/Memcached components and their associated benefits. Through these case studies, we will also examine the interplay between high performance interconnects, storage systems (HDD and SSD), and multi-core platforms to achieve the best solutions for these components.

Download the Slides (PDF) * See more talks in the HPCAC Switzerland Conference Video Gallery.

Sign up for our insideHPC Newsletter.

Video: Accelerating Big Data Processing with Hadoop, Spark and Memcached

Sponsored Guest Articles

The Future-Proofed Datacenter: DDC Delivers 85kW Air-Cooled Density for AI and HPC Workloads

White Papers

Energy efficiency drives HPC to the cloud

Featured RSS Feed

More News from insideBIGDATA

Video: Accelerating Big Data Processing with Hadoop, Spark and Memcached

Sponsored Guest Articles

The Future-Proofed Datacenter: DDC Delivers 85kW Air-Cooled Density for AI and HPC Workloads

White Papers

Energy efficiency drives HPC to the cloud

Join Us On Social Media

Related Posts

Featured RSS Feed

More News from insideBIGDATA