PALO ALTO, CA — Sept. 10th, 2024 — AI chips and models company SambaNova Systems announced SambaNova Cloud AI inference service powered by its SN40L AI chip. The company said developers can log on for free via an API today — no waiting list — and create their own generative AI applications using both the […]
Google in $1.67B AI Chip Patent Infringement Trial
Designing an AI specialty chip that gains traction is a sure way to riches. That’s why so much money – $1.67 billion – is at stake in a patent infringement lawsuit brought against Google. Reuters reported this week on a federal trial in Boston in which computer scientist Joseph Bates, founder of Singular Computing….
HPC News Bytes 20231120: SC23 Overview – Exascale Update, New AI Chips, Quantum Village, UCIe-PCIe-Ultra Ethernet
In this edition of the HPC News Bytes podcast, Shahin takes us on a rapid (5:04) tour of SC23, analyzing the key developments and new technologies that highlighted last week’s conference in Denver: Conference attendance expands to 14,000; exascale update and future; raft of new chips, many focused on AI; Quantum Village at SC23; UCIe, PCIe and Ultra Ethernet
SambaNova: New AI Chip Runs 5 Trillion Parameter Models
Specialty AI chip maker SambaNova Systems today announced the SN40L processor, which the company said will power SambaNova’s full stack large language model (LLM) platform, the SambaNova Suite. Manufactured by TSMC, the SN40L can serve a 5 trillion parameter model, with 256k+ sequence length possible on a single system node, according to the company.
@HPCpodcast: Evaluating Specialty AI Chips at Argonne’s AI Testbed
Our latest episode of @HPCpodcast delves into the Cambrian explosion of AI specialty chips – of which there so many have been released on the HPC-AI market that it’s hard to discern which chip is right for what workload. Hence the AI Testbed at the Argonne Leadership Computing Facility. Shahin and Doug spoke with Venkat Vishwanath….
Tenstorrent Selects Arteris IP for HPC RISC-V Chiplets
CAMPBELL, Calif. – May 2, 2023 – Arteris, Inc. (Nasdaq: AIP), a provider of system IP designed to accelerate system-on-chip (SoC) creation, today announced that Tenstorrent, the Toronto-based AI chip startup, has licensed Ncore and FlexNoC interconnect IP for its AI chiplet systems. According to Arteris, the flexible network-on-chip (NoC) interconnect meets the demanding time-to-market […]
Ng and Keller to Keynote at AI Hardware and Edge AI Summit, Sept. 12-14
March 17, 2023 — Two leading names in AI, Jim Keller of Tenstorrent and Andrew Ng of Landing AI, will deliver keynote addresses at this year’s AI Hardware & Edge AI Summit, Sept. 12-14, in Santa Clara. Registration for the conference is here. Keller, well-known HPC, AI and data center chip architecture and CEO of […]
Tattile Selects AI Chip Maker Hailo for Smart LPR Cameras for ITS
Mairano, Italy & Tel Aviv, Israel, September 14, 2022 – Italian automatic number plate recognition (ANPR) specialist, Tattile, part of TKH Group, announced today its technology partnership with leading edge AI chip maker Hailo to power their next generation of high-end cameras: the “SMART+.” The Hailo-8 AI processor will be integrated into Tattile’s new product line, […]
Untether AI Unveils At-Memory Compute Architecture at Hot Chips
PALO ALTO — Untether AI, an at-memory computation company for artificial intelligence (AI) workloads, today announced at the HOT CHIPS 2022 conference its next-generation architecture for accelerating AI inference workloads called speedAI devices, with an internal codename “Boqueria.” At 30 TeraFlops per watt (TFlops/W) and 2 PetaFlops of performance, the speedAI architecture sets a new […]
Habana Gaudi AI Chip Joins cnvrg.io’s Metacloud Marketplace of AI Infrastructure Solutions
SANTA CLARA, Calif., Dec. 8, 2021 — cnvrg.io, an operating system for artificial intelligence (AI) and machine learning (ML) built by data scientists, today announced that Habana’s Gaudi AI Accelerator is available on cnvrg.io’s Metacloud marketplace of AI infrastructure solutions. cnvrg.io Metacloud enables AI developers to run AI/ML workloads on a mix of infrastructure and hardware choices, […]