Spark Archives - High-Performance Computing News Analysis

GigaOm Radar for Evaluating Data Warehouse Platforms

July 1, 2020 by DO NOT USE Leave a Comment

This new GigaOm Radar Report provided by our friends over at Vertica, examines the leading platforms in the data warehouse marketplace, describes the fundamentals of the technology, identifies key criteria and evaluation metrics by which organizations can evaluate competing platforms, describes some potential technology developments to look out for in the future, and classifies platforms across those criteria and metrics.

Tagged With: Apache Spark, Data Lake, data warehouse, Hadoop, Spark, Vertica

Univa Grid Engine Powers University of Oxford Human Genetics Centre

November 24, 2017 by ralphwells

Last week at SC17, Univa announced its Univa Grid Engine distributed resource management system is powering the Wellcome Centre for Human Genetics’ (WHG) high performance computing environment. WHG is a research institute within the Nuffield Department of Medicine at the University of Oxford. The Centre is an international leader in genetics, genomics, statistics and structural biology with more than 400 researchers and 70 administrative and support personnel. WHG’s mission is to advance the understanding of genetically-related conditions through a broad range of multi-disciplinary research.

Filed Under: High Performance Analytics, HPC Hardware, HPC Software, Industry Segments, Network, News, Research / Education, Systems Management Tagged With: InfiniBand, SC17, Spark, Univa Grid Engine, Wellcome Centre for Human Genetics, WHG

Designing HPC, Big Data, & Deep Learning Middleware for Exascale

October 31, 2017 by Doug Black

DK Panda from Ohio State University presented this talk at the HPC Advisory Council Spain Conference. “This talk will focus on challenges in designing HPC, Big Data, and Deep Learning middleware for Exascale systems with millions of processors and accelerators. For the HPC domain, we will discuss about the challenges in designing runtime environments for MPI+X (PGAS OpenSHMEM/UPC/CAF/UPC++, OpenMP, and CUDA) programming models. Features and sample performance numbers from MVAPICH2 libraries will be presented.”

Filed Under: Compute, Events, Exascale, High Performance Analytics, HPC Hardware, HPC Software, Industry Segments, Machine Learning, News, Research / Education, Resources, Videos Tagged With: AI, big data, Deep Learning, Hadoop, HPC Advisory Council Spain Conference, RoCE, Spark, Weekly Newsletter Articles

NEC Vector Computers Accelerate Machine Learning

July 3, 2017 by Doug Black

Today NEC Corporation announced that it has developed new Aurora Vector Engine data processing technology that accelerates the execution of machine learning on vector computers by more than 50 times in comparison to Spark technologies. “This technology enables users to quickly benefit from the results of machine learning, including the optimized placement of web advertisements, recommendations, and document analysis,” said Yuichi Nakamura, General Manager, System Platform Research Laboratories, NEC Corporation. “Furthermore, low-cost analysis using a small number of servers enables a wide range of users to take advantage of large-scale data analysis that was formerly only available to large companies.”

Filed Under: Compute, Datacenter, Enterprise HPC, Government, High Performance Analytics, HPC Hardware, HPC Software, Industry Segments, News, Research / Education Tagged With: NEC, Spark, Vector computing

Bringing HPC Algorithms to Big Data Platforms

March 1, 2017 by Doug Black

Nikolay Malitsky from Brookhaven National Laboratory presented this talk at the Spark Summit East conference. “This talk will present a MPI-based extension of the Spark platform developed in the context of light source facilities. The background and rationale of this extension are described in the paper “Bringing the HPC reconstruction algorithms to Big Data platforms.” which highlighted a gap between two modern driving forces of the scientific discovery process: HPC and Big Data technologies. As a result, it proposed to extend the Spark platform with inter-worker communication for supporting scientific-oriented parallel applications.”

Filed Under: Compute, Datacenter, Enterprise HPC, Events, Government, High Performance Analytics, HPC Hardware, HPC Software, Industry Segments, Main Feature, Research / Education, Resources, Videos Tagged With: big data, Brookhaven National Laboratory, Spark, SPARK Summit East, Weekly Newsletter Articles

Intel DAAL Accelerates Data Analytics and Machine Learning

February 23, 2017 by Richard Friedman

Intel DAAL is a high-performance library specifically optimized for big data analysis on the latest Intel platforms, including Intel Xeon®, and Intel Xeon Phi™. It provides the algorithmic building blocks for all stages in data analysis in offline, batch, streaming, and distributed processing environments. It was designed for efficient use over all the popular data platforms and APIs in use today, including MPI, Hadoop, Spark, R, MATLAB, Python, C++, and Java.

Filed Under: High Performance Analytics, HPC Software, Machine Learning, Main Feature, News, Parallel Programming, Sponsored Post, Tools Tagged With: big data, Hadoop, Intel DAAL, intel mkl, Intel Parallel Studio XE, Intel TEC, MATLAB, MPI, R, Spark, Weekly Newsletter Articles

New Bright for Deep Learning Solution Designed for Business

September 1, 2016 by Doug Black

“We have enhanced Bright Cluster Manager 7.3 so our customers can quickly and easily deploy new deep learning techniques to create predictive applications for fraud detection, demand forecasting, click prediction, and other data-intensive analyses,” said Martijn de Vries, Chief Technology Officer of Bright Computing. “Going forward, customers using Bright to deploy and manage clusters for deep learning will not have to worry about finding, configuring, and deploying all of the dependent software components needed to run deep learning libraries and frameworks.”

Filed Under: Compute, CPUs, GPUs, FPGAs, Datacenter, Enterprise HPC, High Performance Analytics, HPC Hardware, HPC Software, Industry Segments, News Tagged With: AI, Bright Computing, Caffe, Deep Learning, IDC, Machine Learning, nvidia, Spark, TensorFlow, Theano, Torch, Weekly Newsletter Articles

NERSC to Host Data Day on August 22

July 26, 2016 by Doug Black

Today NERSC announced plans to host a new, data-centric event called Data Day. The main event will take place on August 22, followed by a half-day hackathon on August 23. The goal: to bring together researchers who use, or are interested in using, NERSC systems for data-intensive work.

Filed Under: Compute, Events, Government, HPC Hardware, Industry Segments, News, Research / Education, Resources, Storage Tagged With: Burst Buffer, Cori supercomputer, Data Day, Hackathon, NERSC, Python, Spark

Video: Exploiting HPC Technologies to Accelerate Big Data Processing

April 15, 2016 by Doug Black

“This talk will present RDMA-based designs using OpenFabrics Verbs and heterogeneous storage architectures to accelerate multiple components of Hadoop (HDFS, MapReduce, RPC, and HBase), Spark and Memcached. An overview of the associated RDMAenabled software libraries being designed and publicly distributed as a part of the HiBD project.”

Filed Under: Compute, Datacenter, Enterprise HPC, Events, High Performance Analytics, HPC Hardware, HPC Software, Industry Segments, Network, Research / Education, Resources, Videos Tagged With: big data, Hadoop, Memcached, OpenFabrics Workshop, Spark

Interview: How Univa Short Jobs Brings Low Latency to Financial Services

June 22, 2015 by Doug Black

With the launch of Univa Small Jobs add-on for Univa Grid Engine, the company, the company offers “the world’s most efficient processing and lowest latency available for important tasks like real-time trading, transactions, and other critical applications.” To learn more, we caught up with Univa President & CEO Gary Tyreman.

Filed Under: Compute, HPC Hardware, HPC Software, Industry Perspectives, Resources, Systems Management, Tools Tagged With: Amazon EC2, Financial Services, Hadoop, Spark, Univa Grid Engine

GigaOm Radar for Evaluating Data Warehouse Platforms

Univa Grid Engine Powers University of Oxford Human Genetics Centre

Designing HPC, Big Data, & Deep Learning Middleware for Exascale

NEC Vector Computers Accelerate Machine Learning

Bringing HPC Algorithms to Big Data Platforms

Intel DAAL Accelerates Data Analytics and Machine Learning

New Bright for Deep Learning Solution Designed for Business

NERSC to Host Data Day on August 22

Video: Exploiting HPC Technologies to Accelerate Big Data Processing

Interview: How Univa Short Jobs Brings Low Latency to Financial Services

Sponsored Guest Articles

Lenovo and NVIDIA at GTC 2024: An Alliance Enabling AI at Scale

White Papers

Energy efficiency drives HPC to the cloud

Featured RSS Feed

More News from insideBIGDATA