Data Engineering Survey: 2021 Impact Report

This Data Engineering Survey: 2021 Impact Report summarizes key findings from the inaugural survey and provides a glimpse into the current and future state of data engineering and DataOps. The report highlights some of the major trends uncovered in this year’s survey including the adoption of cloud data platforms, what platforms are winning (and emerging), what data engineers find to be their biggest challenges, and how organizations are handling sensitive data.

XAI: Are We Looking Before We Leap?

This four part report from our friends over at SOSA highlights the challenges and advances when it comes to regulations, relevant use-cases, and emerging technologies taking the mystery out of AI.

The Graphcore Second Generation IPU

Our friends over at Graphcore, the U.K.-based startup that launched the Intelligence Processing Unit (IPU) for AI acceleration in 2018, has released a new whitepaper introducing the IPU-Machine. This second-generation platform has greater processing power, more memory and built-in scalability for handling extremely large parallel processing workloads. This paper will explores the new platform and assess its strengths and weaknesses compared to the growing cadre of potential competitors.

10 Questions to Ask When Starting With AI

In this insideHPC Guide, our friends over at WEKA offer 10 important questions to ask when starting with AI, specifically planning for success beyond the initial stages of a project. Reasons given for these failures include not having a plan ahead of time, not getting executive or business leadership buy-in, or failing to find the proper team to execute the project. Chasing the hot technology trend without having a proper strategy often leads companies down the path of failure.

Things to Know When Assessing, Piloting, and Deploying GPUs

In this insideHPC Guide, our friends over at WEKA suggest that when organizations decide to move existing applications or new applications to a GPU-influenced system there are many items to consider, such as assessing the new  environment’s required components, implementing a pilot program to learn about the system’s future  performance, and considering eventual scaling to production levels.

Modern HPC and Big Data Design Strategies for Data Centers

This insideHPC Special Research Report provides an overview of what to consider when selecting an infrastructure capable of meeting the new workload processing needs. Tyan has a wide range of bare bones server and storage hardware solutions  available for organizations and enterprise customers.

Unleash the Future of Innovation with HPC & AI

This whitepaper reviews how cutting-edge solutions from Supermicro and NVIDIA are enabling customers to transform and capitalize on HPC and AI innovation. Data is the driving force for success in the global marketplace. Data volumes are erupting in size and complexity as organizations work to collect, analyze, and derive intelligence from a growing number of sources and devices. These workloads are critical to powering applications that translate insight into business value.

Deep Learning GPU Cluster

In this whitepaper, our friends over at Lambda walk you through the Lambda Echelon multi-node cluster reference design: a node design, a rack design, and an entire cluster level architecture. This document is for technical decision-makers and engineers. You’ll learn about the Echelon’s compute, storage, networking,  power distribution, and thermal design. This is not a cluster administration handbook, this is a high level technical overview of one possible system architecture.

Driving ROI Through AI

This new report from ESI ThoughtLab was conducted alongside our friends over at DataRobot as well as a coalition of other AI leaders. The report shows that despite high adoption rates of AI in enterprises, ROI in AI still remains a work in progress and will take skill, scale, and time.

Massive Scalable Cloud Storage for Cloud Native Applications

In this comprehensive technology white paper, written by Evaluator Group, Inc. on behalf of Lenovo, we delve into OpenShift, a key component of Red Hat’s portfolio of products designed for cloud native applications. It is built on top of Kubernetes, along with numerous other open source components, to deliver a consistent developer and operator platform that can run across a hybrid environment and scale to meet the demands of enterprises. Ceph open source storage technology is utliized by Red Hat to provide a data plane for Red Hat’s OpenShift environment.