Bridging the MapReduce Skills Gap

Data is exploring at large organization. So is the adoption of Hadoop. Hadoop’s potential cost effectiveness and facility for accepting unstructured data is making it central to modern, “Big Data” architectures. Yet, a significant obstacle to Hadoop adoption has been a shortage of skilled MapReduce coders.

YARN Opens New Doors to Easier Programming tools for Hadoop 2.0

Find out how YARN for Hadoop 2.0 has opened the door to new tools and applications that promise to allow more companies to reap the benefits of big data!

Progress Report on Efficient Integration of Lustre and Hadoop/YARN

Using Hadoop with Lustre provides several benefits, including: Lustre is a real parallel file system, which enables temporary or intermediate data to be stored in parallel on multiple nodes reducing the load on single nodes. In addition, Lustre has its own network protocol, which is more efficient for bulk data transfer than the HTTP protocol. Additionally, because Lustre is a shared file system, each client sees the same file system image, so hardlinks can be used to avoid data transfer between nodes.