Mark Seamans from SGI presented this talk at the HPC User Forum in Tucson. “As the trusted leader in high performance computing, SGI helps companies find answers to the world’s biggest challenges. Our commitment to innovation is unwavering and focused on delivering market leading solutions in Technical Computing, Big Data Analytics, and Petascale Storage. Our solutions provide unmatched performance, scalability and efficiency for a broad range of customers.”
In this RCE Podcast, Marcel Kornacker from Cloudera describes the Impala project. Impala brings scalable parallel database technology to Hadoop, enabling users to issue low-latency SQL queries to data stored in HDFS and Apache HBase without requiring data movement or transformation. Impala is integrated with Hadoop to use the same file and data formats, metadata, security and resource management frameworks used by MapReduce, Apache Hive, Apache Pig and other Hadoop software.
The New York Scientific Data Summit (NYSDS) has issued its Call for Papers. The event takes place August 14-17 in New York City.
Today Intel announced the inception of the Intel Data Analytics Acceleration Library (Intel DAAL) open source project. “Intel DAAL helps to speed up big data analysis by providing highly optimized algorithmic building blocks for all stages of data analytics (preprocessing, transformation, analysis, modeling, validation, and decision making) in batch, online, and distributed processing modes of computation. The open source project is licensed under Apache License 2.0.”
Rangan Sukumar from ORNL presented this talk at the HPC User Forum in Tucson. “ORiGAMI is a tool for discovering and evaluating potentially interesting associations and creating novel hypothesis in medicine. ORiGAMI will help you “connect the dots” across 70 million knowledge nuggets published in 23 million papers in the medical literature. The tool works on a ‘Knowledge Graph’ derived from SEMANTIC MEDLINE published by the National Library of Medicine integrated with scalable software that enables term-based, path-based, meta-pattern and analogy-based reasoning principles.”
“This talk will present RDMA-based designs using OpenFabrics Verbs and heterogeneous storage architectures to accelerate multiple components of Hadoop (HDFS, MapReduce, RPC, and HBase), Spark and Memcached. An overview of the associated RDMAenabled software libraries being designed and publicly distributed as a part of the HiBD project.”
Hadoop and Spark clusters have a reputation for being extremely difficult to configure, install, and tune, but help is on the way. The good folks at Cluster Monkey are hosting a crash course entitled Apache Hadoop with Spark in One Day. “After completing the workshop attendees will be able to use and navigate a production Hadoop cluster and develop their own projects by building on the workshop examples.”
If you are in the Northwest and you happen to like surf and turf, have I got a deal for you! Dell is hosting a series of Big Data lunch events in Seattle and Portland at the end of April. On April 26, Dell brings the event to Blueacre Seafood in Seattle. In Portland, lunch is on April 27 at the mighty Fogo de Chao, a Brazilian steak house for the Where’s the Beef? crowd. They’re also coming to Flemings in Salt Lake City on April 28.
The 2016 ALCF Data Science Program (ADSP) at Argonne has issued its Call for Proposals. The new initiative is targeted at “big data” science problems that require the scale and performance of leadership resources. “Our goal is to help explore and improve a variety of computational methods that will help enable data-driven discoveries across all scientific disciplines,” said ALCF Director of Science Katherine Riley.
The STFC Hartree Centre in the UK will host a Hackathon for coders, developers, designers, entrepreneurs and start-ups in May. The event will take place May 18-20 at the Hartree Centre in Cheshire. In partnership with IBM Watson, the Hartree Hack will put the latest cognitive technologies directly into the hands of attendees. Participants will learn from the experts about what IBM Watson APIs (application programming interfaces) can offer them and how to use them, create their first cognitive app and compete to win £25k of support from STFC to propel their idea forward to a market reality over just three days.