Artificial Analysis Archives - High-Performance Computing News Analysis

Cerebras Claims Fastest AI Inference

August 27, 2024 by staff

AI compute company Cerebras Systems today announced what it said is the fastest AI inference solution. Cerebras Inference delivers 1,800 tokens per second for Llama3.1 8B and 450 tokens per second for Llama3.1 70B, according to the company, making it 20 times faster than GPU-based solutions in hyperscale clouds.

Filed Under: Compute, HPC Hardware, Machine Learning, News Tagged With: AI inference, Artificial Analysis, Cerebras, CS-3, DeepLerning.AI, LLMs, Meta, Meta Llama, Wafer Scale Engine

WEATHER RESEARCH AND FORECASTING ON MICROSOFT® AZURE® HBv3 VIRTUAL MACHINES FEATURING AMD 3D V-CACHE™ TECHNOLOGY

COMPUTATIONAL FLUID DYNAMICS AMD EPYC™7003 Series Processors with AMD 3D V-Cache deliver outstanding scale-out performance running WRF® on Microsoft® Azure® HBv3 virtual machines. Weather Research Forecasting Developed and maintained by the National Center for Atmospheric Research (NCAR), the Weather Research & Forecasting (WRF®) model has over 48,000 registered users in over 160 countries. WRF is […]

Download

Cerebras Claims Fastest AI Inference

Sponsored Guest Articles

How Supercomputing at a Billion Billion Calculations Per Second Is Changing the World

White Papers

WEATHER RESEARCH AND FORECASTING ON MICROSOFT® AZURE® HBv3 VIRTUAL MACHINES FEATURING AMD 3D V-CACHE™ TECHNOLOGY

Featured RSS Feed

More News from insideAI News