Hadoop Archives - High-Performance Computing News Analysis

Quobyte Releases Hadoop Native Driver for Analytics, ML, Streaming and Real-time Applications

July 21, 2021 by staff

SANTA CLARA, CA, July 21, 2021 — Quobyte Inc., a developer of scale-out software-defined storage (SDS), today announced availability of its Hadoop Driver. Quobyte’s new native driver for Hadoop addresses the limitations of the Hadoop Distributed File System’s (HDFS) high-capacity design within the enterprise. The new native driver brings significant benefits in optimizing Hadoop clusters for a much wider […]

Filed Under: High Performance Analytics, HPC Software, News Tagged With: Hadoop, Quobyte, software defined storage, Weekly Newsletter Articles

Unify Your Analytics, and Keep Your Data Where It Suits You

August 6, 2020 by staff

In this sponsored post, Joy King, VP, Vertica Product Management & Product Marketing, believes that we need to stop our fixation with “data in one place.” The days of the single data repository are behind us. If you try doing that, you’ll incur so much data management time and cost that you’ll squander the savings even before you even get to the analysis stage. To put it simply, the goal is to unify, analyze, and act, because predictive analytics and proactive action is the definition of business success.

Filed Under: Business of HPC, Datacenter, Enterprise HPC, Google News Feed, HPC Hardware, News, Sponsored Post, Storage, Uncategorized Tagged With: AI, Analytics, EDW, enterprise data warehouse, Hadoop, Vertica, Weekly featured Newsletter Articles, Weekly Featured Newsletter Post

GigaOm Radar for Evaluating Data Warehouse Platforms

July 6, 2020 by staff

This new GigaOm Radar Report “GigaOm Radar for Evaluating Data Warehouse Platforms” provided by our friends over at Vertica, examines the leading platforms in the data warehouse marketplace, describes the fundamentals of the technology, identifies key criteria and evaluation metrics by which organizations can evaluate competing platforms, describes some potential technology developments to look out for in the future, and classifies platforms across those criteria and metrics.

Filed Under: Business of HPC, Datacenter, Enterprise HPC, Featured, Google News Feed, HPC Hardware, HPC Software, Industry Segments, News, Sponsored Post, Storage, Uncategorized, White Papers Tagged With: AI, Apache Spark, data warehouse, Hadoop, Vertica, Weekly featured Newsletter Articles, Weekly Featured Newsletter Post

GigaOm Radar for Evaluating Data Warehouse Platforms

July 1, 2020 by DO NOT USE Leave a Comment

This new GigaOm Radar Report provided by our friends over at Vertica, examines the leading platforms in the data warehouse marketplace, describes the fundamentals of the technology, identifies key criteria and evaluation metrics by which organizations can evaluate competing platforms, describes some potential technology developments to look out for in the future, and classifies platforms across those criteria and metrics.

Tagged With: Apache Spark, Data Lake, data warehouse, Hadoop, Spark, Vertica

NetApp Deploys Iguazio’s Data Science Platform for Optimized Storage Management

June 10, 2020 by staff

Previously built on Hadoop, NetApp said it was also looking to modernize the service infrastructure “to reduce the complexities of deploying new AI services and the costs of running large-scale analytics. In addition, the shift was needed to enable real-time predictive AI, and to abstract deployment, allowing the technology to run on multi-cloud or on premises seamlessly.”

Filed Under: Cloud HPC, Datacenter, High Performance Analytics, Machine Learning, News, Systems Management, Tools Tagged With: Data Management, Data Science, Hadoop, Iguazio, Machine Learning, NetApp

Designing HPC, Big Data, & Deep Learning Middleware for Exascale

October 31, 2017 by Doug Black

DK Panda from Ohio State University presented this talk at the HPC Advisory Council Spain Conference. “This talk will focus on challenges in designing HPC, Big Data, and Deep Learning middleware for Exascale systems with millions of processors and accelerators. For the HPC domain, we will discuss about the challenges in designing runtime environments for MPI+X (PGAS OpenSHMEM/UPC/CAF/UPC++, OpenMP, and CUDA) programming models. Features and sample performance numbers from MVAPICH2 libraries will be presented.”

Filed Under: Compute, Events, Exascale, High Performance Analytics, HPC Hardware, HPC Software, Industry Segments, Machine Learning, News, Research / Education, Resources, Videos Tagged With: AI, big data, Deep Learning, Hadoop, HPC Advisory Council Spain Conference, RoCE, Spark, Weekly Newsletter Articles

SC17 Invited Talk Preview: High Performance Machine Learning

August 22, 2017 by staff

Over at the SC17 Blog, Brian Ban begins his series of SC17 Session Previews with a look at a talk on High Performance Big Data. “Deep learning, using GPU clusters, is a clear example but many Machine Learning algorithms also need iteration, and HPC communication and optimizations.”

Filed Under: Cloud HPC, CPUs, GPUs, FPGAs, Events, High Performance Analytics, HPC Hardware, HPC Software, Industry Segments, Machine Learning, News, Research / Education, Resources Tagged With: big data, DAAL, Hadoop, HPC-ABDS, Indiana University, SC17

Podcast: PortHadoop Speeds Data Movement for Science

May 5, 2017 by staff

In this TACC Podcast, host Jorge Salazar interviews Xian-He Sun, Distinguished Professor of Computer Science at the Illinois Institute of Technology. Computer Scientists working in his group are bridging the file system gap with a cross-platform Hadoop reader called PortHadoop, short for portable Hadoop. “We tested our PortHadoop-R strategy on Chameleon. In fact, the speedup is 15 times faster,” said Xian-He Sun. “It’s quite amazing.”

Filed Under: Cloud HPC, High Performance Analytics, HPC Hardware, HPC Software, Industry Segments, News, Podcast, Research / Education, Resources Tagged With: chameleon testbed, Hadoop, NASA, PortHadoop, R, TACC

Accelerating Hadoop, Spark, and Memcached with HPC Technologies

March 31, 2017 by Doug Black

“This talk will present RDMA-based designs using OpenFabrics Verbs and heterogeneous storage architectures to accelerate multiple components of Hadoop (HDFS, MapReduce, RPC, and HBase), Spark and Memcached. An overview of the associated RDMA-enabled software libraries (being designed and publicly distributed as a part of the HiBD project for Apache Hadoop.”

Filed Under: Compute, Datacenter, Enterprise HPC, Events, HPC Hardware, HPC Software, Industry Segments, Main Feature, Network, Research / Education, Resources, Videos Tagged With: Apache Spark, big data, Hadoop, InfiniBand, Memcached, OFA, OpenFabrics Workshop, Weekly Newsletter Articles

Intel DAAL Accelerates Data Analytics and Machine Learning

February 23, 2017 by Richard Friedman

Intel DAAL is a high-performance library specifically optimized for big data analysis on the latest Intel platforms, including Intel Xeon®, and Intel Xeon Phi™. It provides the algorithmic building blocks for all stages in data analysis in offline, batch, streaming, and distributed processing environments. It was designed for efficient use over all the popular data platforms and APIs in use today, including MPI, Hadoop, Spark, R, MATLAB, Python, C++, and Java.

Filed Under: High Performance Analytics, HPC Software, Machine Learning, Main Feature, News, Parallel Programming, Sponsored Post, Tools Tagged With: big data, Hadoop, Intel DAAL, intel mkl, Intel Parallel Studio XE, Intel TEC, MATLAB, MPI, R, Spark, Weekly Newsletter Articles

Quobyte Releases Hadoop Native Driver for Analytics, ML, Streaming and Real-time Applications

GigaOm Radar for Evaluating Data Warehouse Platforms

NetApp Deploys Iguazio’s Data Science Platform for Optimized Storage Management

Designing HPC, Big Data, & Deep Learning Middleware for Exascale

SC17 Invited Talk Preview: High Performance Machine Learning

Podcast: PortHadoop Speeds Data Movement for Science

Accelerating Hadoop, Spark, and Memcached with HPC Technologies

Intel DAAL Accelerates Data Analytics and Machine Learning

Sponsored Guest Articles

Life Is Fleeting, But Data Is Forever – Meet Your Digital Twin

White Papers

Energy efficiency drives HPC to the cloud

Featured RSS Feed

More News from insideBIGDATA