Nvidia Launches Flagship ‘Blackwell’ GPU at GTC

Nvidia CEO Jensen Huang took the stage for a two-hour GTC keynote event and victory lap, which was held before a sold-out crowd at the 17,500-seat SAP Center, home of the San Jose Sharks of the National Hockey League. Today’s news begins with Blackwell, which Nvidia said delivers six “revolutionary” technologies that together enable AI training and real-time LLM inference for models scaling up to 10 trillion parameters. Blackwell chips are comprised of 208 billion transistors ….

Survey on AI Infrastructure Spotlights GPU Challenges

 SAN FRANCISCO – March 13, 2024 – ClearML today announced new research findings from a global AI survey conducted with FuriosaAI and the AI Infrastructure Alliance (AIIA) called “The State of AI Infrastructure at Scale 2024“: Findings: 96 percent of respondents plan to expand their AI compute infrastructure with availability, cost, and infrastructure challenges weighing on their minds, […]

Vultr Expands Cloud Nvidia H100 GPU Capacity

March 5, 2024 — Cloud hosting and cloud servers company Vultr announced the expansion of our Seattle cloud data center region at Sabey Data Centers’ SDC Columbia location in East Wenatchee, Washington. This expansion includes an increase in NVIDIA HGX H100 clusters, which are now available on demand and via reserved contracts. Vultr’s decision to […]

NVIDIA Reveals Eos Supercomputer: 4,600 H100 GPUs for 18 AI Exaflops

NVIDIA Thursday released a video that offers the first public look at Eos (pictured here), a monster 18.4 exaflops FP8 AI supercomputer powered by 576 DGX H100 systems… NVIDIA said Eos would be ranked no. 9 on the TOP500 list of the world’s fastest supercomputers, according….

Trillions for Chips: A Roiled Semiconductor Industry Strains to Meet AI Demand

We’re seeing the chip industry’s version of the scientific aphorism: “nature hates a vacuum.” The vacuum is the short supply of – and vast demand for – AI chips, and it’s roiling the semiconductor industry. Chip foundry companies TSMC, Intel and Samsung are straining to expand GPU fab capacity….

Paderborn Center for Parallel Computing to Install Lenovo HPC Cluster

Lenovo announced it has won a contract for the joint construction of an AMD/NVIDIA-powered HPC cluster with the University of Paderborn in Germany. The system will include the Lenovo ThinkSystem SD665 V3 server with AMD EPYC processor technology….

WEKA Partners with NexGen Cloud on AI Supercloud

CAMPBELL, Calif., January 31, 2024 – AI-native data platform provider WekaIO (WEKA) announced today it is partnering with NexGen Cloud, a sustainable infrastructure-as-a-service provider based in the UK, to provide the high-performance infrastructure foundation underpinning its forthcoming AI Supercloud, as well as the on-demand services offered by Hyperstack, NexGen Cloud’s GPUaaS platform.  With a commitment […]

Eviden to Deliver Modular Data Center for Europe’s 1st Exascale System

This morning, Eviden made it official: it has been awarded a contract by Jülich Supercomputing Centre in Germany to build the modular data center to host the EuroHPC JUPITER supercomputer, Europe’s first exascale system. This is not surprising because….

Chip War: Banned NVIDIA GPUs Trickle into China, TSMC Shares Jump on AI

The advanced chips sector and its geopolitical significance is in the news this week as a Reuters story reports that “Chinese military bodies, state-run artificial intelligence research institutes and universities have over the past year purchased small batches of NVIDIA semiconductors,” including….

GigaIO’s SuperNODE to Power TensorWave Deployment with AMD MI300X

San Jose, California, December 6, 2023 – GigaIO, provider of open workload-defined infrastructure for AI and accelerated computing, has announced what the company said is the largest order yet for its SuperNODE utilizing tens of thousands of the AMD Instinct MI300X accelerators. GigaIO’s infrastructure will form the backbone of a bare-metal specialized AI cloud code-named […]