A Future for R: Parallel and Distributed Processing in R for Everyone

In this video from the European R Users Meeting, Henrik Bengtsson from the University of California San Francisco presents: A Future for R: Parallel and Distributed Processing in R for Everyone. “The future package is a powerful and elegant cross-platform framework for orchestrating asynchronous computations in R. It’s ideal for working with computations that take a long time to complete; that would benefit from using distributed, parallel frameworks to make them complete faster; and that you’d rather not have locking up your interactive R session.”

Video: ddR – Distributed Data Structures in R

“A few weeks ago, we revealed ddR (Distributed Data-structures in R), an exciting new project started by R-Core, Hewlett Packard Enterprise, and others that provides a fresh new set of computational primitives for distributed and parallel computing in R. The package sets the seed for what may become a standardized and easy way to write parallel algorithms in R, regardless of the computational engine of choice.”

Podcast: PortHadoop Speeds Data Movement for Science

In this TACC Podcast, host Jorge Salazar interviews Xian-He Sun, Distinguished Professor of Computer Science at the Illinois Institute of Technology. Computer Scientists working in his group are bridging the file system gap with a cross-platform Hadoop reader called PortHadoop, short for portable Hadoop. “We tested our PortHadoop-R strategy on Chameleon. In fact, the speedup is 15 times faster,” said Xian-He Sun. “It’s quite amazing.”

Intel DAAL Accelerates Data Analytics and Machine Learning

Intel DAAL is a high-performance library specifically optimized for big data analysis on the latest Intel platforms, including Intel Xeon®, and Intel Xeon Phi™. It provides the algorithmic building blocks for all stages in data analysis in offline, batch, streaming, and distributed processing environments. It was designed for efficient use over all the popular data platforms and APIs in use today, including MPI, Hadoop, Spark, R, MATLAB, Python, C++, and Java.

OCF Deploys Fujitsu HPC Cluster at University of East Anglia

OCF in the U.K. recently deployed a new Fujitsu HPC cluster at the University of East Anglia. As the University’s second new HPC system in 4-years, the cluster can be easily scaled and expanded in the coming months through a framework agreement to match rapidly increasing demand for compute power.

FlyElephant Startup Announces Support for R, Python, and Public API

Today the good folks at FlyElephant announced support for R, Python, and public API for the participants of its beta testing program.