Lambda Launches Inference API

Dec. 13, 2024 — AI company Lambda today announced its Inference API, which the company said enables access to LLMs through a serverless AI for “a fraction of a cent.” The company said Lambda Inference API offers low-cost, scalable AI inference with such models as Meta’s recently released Llama 3.3 70B Instruct (FP8) at $0.20 […]

Lambda Launches Grant for AI Researchers, Expands Research Program

SAN JOSE, Dec. 9, 2024 — AI developer cloud Lambda today announced a research grant to power AI researchers’ most GPU-dependent workloads. Applications for the grant are now open and will be accepted on a rolling basis. AI researchers can apply here. Lambda said it expects to sponsor hundreds of researchers in the coming year, […]

Lambda Launches Nvidia-Based Cloud Clusters for AI Model Training

SAN JOSE, July 24, 2024 — GPU cloud company Lambda has unveiled Lambda 1-Click Clusters, designed for AI engineers’ and researchers’ short-term access to multi-node GPU clusters in the cloud for large-scale AI model training. Lambda said the launch marks the first time such access to NVIDIA H100 Tensor Core GPUs on 2 to 64 […]

Deep Learning GPU Cluster

In this whitepaper, “Deep Learning GPU Cluster,” our friends over at Lambda walk you through the Lambda Echelon multi-node cluster reference design: a node design, a rack design, and an entire cluster level architecture. This document is for technical decision-makers and engineers. You’ll learn about the Echelon’s compute, storage, networking,  power distribution, and thermal design. This is not a cluster administration handbook, this is a high level technical overview of one possible system architecture.

Deep Learning GPU Cluster

In this whitepaper, our friends over at Lambda walk you through the Lambda Echelon multi-node cluster reference design: a node design, a rack design, and an entire cluster level architecture. This document is for technical decision-makers and engineers. You’ll learn about the Echelon’s compute, storage, networking,  power distribution, and thermal design. This is not a cluster administration handbook, this is a high level technical overview of one possible system architecture.