SUNNYVALE, Calif. & RIYADH, Saudi Arabia – Cerebras Systems today announced the signing of a memorandum of understanding with Aramco under which they aim to bring high performance AI inference to industries, universities, and enterprises in Saudi Arabia. Aramco plans to build, train and deploy large language models using Cerebras’ CS-3 systems. Aramco’s new high-performance […]
Aramco and Cerebras Sign AI MoU
Filed Under: Business of HPC, Google News Feed, HPC Hardware, HPC Software, Machine Learning, News Tagged With: AI, Aramco, Cerebras, CS-3, HPC', Wafer Scale Engine
Cerebras Claims Fastest AI Inference
AI compute company Cerebras Systems today announced what it said is the fastest AI inference solution. Cerebras Inference delivers 1,800 tokens per second for Llama3.1 8B and 450 tokens per second for Llama3.1 70B, according to the company, making it 20 times faster than GPU-based solutions in hyperscale clouds.
Filed Under: Compute, HPC Hardware, Machine Learning, News Tagged With: AI inference, Artificial Analysis, Cerebras, CS-3, DeepLerning.AI, LLMs, Meta, Meta Llama, Wafer Scale Engine