Oct. 27, 2023 — Datasaur, a natural language processing (NLP) data-labeling platform, today launched LLM Lab, an interface designed for data scientists and engineers to build and train custom LLM models like ChatGPT. The product will provide a wide range of features for users to test different foundation models, connect to their own internal documents, […]
Hyperion: HPC Community’s Interest in LLMs Has ‘Exploded,’ with Complexity, Cost Concerns
HPC-AI industry analyst firm Hyperion Research said its new study on Large Language Models in the HPC community shows that interest in LLMs has exploded in the last six months driven by unique capabilities of the technology to answer queries, generate concise summaries, and even produce unique works of fiction….
Large Language Models: The Largeness, the Power and the ‘Emergent’ Mystery
Large language models fit the classic model of a red-hot technology in an early stage of commercial viability: there’s more talk about it than knowledge, and FOMO – the fear that your competitors are implementing it at your peril – is helping to drive explosive demand. There’s also an allure and mystery around LLMs: some of the awe-inspiring “zero shot” things they do surprise even the data scientists who trained the models (more on this below). Against this backdrop….
Code Intelligence Launches LLM-Powered AI-Assistant for Software Security
Bonn, Germany, September 13th, 2023 – Code Intelligence today announced CI Spark, an LLM-powered AI-assistant for software security testing. CI Spark automatically identifies attack surfaces and suggests test code for them, enabling developers to reduce the manual effort needed to generate powerful white-box tests from multiple hours down to a few minutes, the company said, adding […]
HPC News Bytes 20230911: NVIDIA LLM Inferencing; Honeywell and Quantinuum; TSMC in Silicon Photonics; Microsoft Copilot AI Indemnification
As we reflect on the events of 9/11/2001 (it’s still living hell no matter how long ago it happened), let’s quickly (4:52) review last week’s HPC news highlights, including: NVIDIA TensorRT-LLM for faster AI inferencing; Honeywell integrates quantum-hardened encryption keys from Quantinuum; TSMC enters the silicon photonics arena; Microsoft to defend Copilot AI customers; Hyperion Research hosts HPC User Forum
MLCommons: MLPerf Results Show AI Performance Gains
Today ML Commons announced new results from two industry-standard MLPerf benchmark suites: Training v3.0, which measures the performance of training machine learning models, and Tiny v1.1, which measures how quickly a trained neural network can process new data for extremely low-power devices in the smallest form factors. To view the results and to find additional […]
Generative AI: Databricks to Acquire MosaicML for $1.3B
San Francisco-based data and AI startup Databricks today announced a $1.3 billion deal to acquire generative AI platform MosaicML, whose large language models (MPT-7B and MPT-30B) have more than 3.3 million downloads. The goal of the acquisition: reduce the time and cost of large language model training for generative AI, the companies said. Databricks, which […]