[SPONSORED GUEST ARTICLE] What sets Quobyte’s File Query Engine apart is its integration with the file system’s distributed metadata architecture. Unlike solutions that require separate database layers, Quobyte’s engine operates ….
Faster AI and HPC Workflows with Quobyte’s New File Query Engine
Quobyte Software-Defined Storage Brings Its HPC and Hyperscaler Heritage to AI Workloads
Quobyte comes from an HPC heritage as well as providing storage solutions for hyperscalers. “We wanted to bring HPC to the enterprise with the software approach that made the ….
Facial Recognition, Video Analytics, Geospatial and Other AI Techniques Aiding Ukraine’s Investigations of Russian War Crimes
…. investigators in Ukraine are engaged in the grim task of gathering evidence for possible war crime trials of Russian soldiers. For that, the investigators are aided by advanced machine learning technology, including facial recognition, geospatial data analytics, image analysis and analysis of streaming video from security cameras.
Atlan Raises $50M Series B to Build Collaboration Hub for Data Teams
NEW YORK, March 9, 2022 — Atlan announced that it has closed its $50M Series B round led by Insight Partners, Salesforce Ventures, and Sequoia Capital India. Several founders in the data space participated in the round including Fivetran founder Taylor Brown and ThoughtSpot founder Ajeet Singh. Atlan creates a unified discovery and collaboration experience, bringing context and […]
Beyond Discoverability: Metadata to Drive Your Data Management
Terrell Russell from iRODS gave this talk at SC19. “The Integrated Rule-Oriented Data System (iRODS) is open source data management software used by research organizations and government agencies worldwide. iRODS is released as a production-level distribution aimed at deployment in mission critical environments. It virtualizes data storage resources, so users can take control of their data, regardless of where and on what device the data is stored.”
Panel Discussion: Metadata and Archiving at Scale
In this video from the HPC User Forum, Henry Newman from Seagate Government Solutions leads a panel discussion on Metadata and Archiving at Scale. “Metadata is the key to keeping track of all this unstructured scientific data. It is “data about data.” It makes scientific data easy to find, track, share, move and manage – at low cost. Unfortunately, today’s high capacity storage systems only provide bare bones system consisting of as little as file name, owner and creation/access timestamps. Data intensive scientific workflows need supplemental enhanced metadata, along with access rights and security safeguards.”
Metadata Used in Science
Metadata is the key to keeping track of all this unstructured scientific data. It is “data about data.” In the case of scientific data, it is structured data (written in a prescribed schema or order) that describes what the data is, how it was derived, and where it is located.
Video: Scalability Testing of DNE2 in Lustre 2.7
Keeping Up with the Growth of Scientific Data
“Metadata, or data about data, lets scientists find the valuable data they are looking for. Metadata especially helps find value in data that’s been created by others, no matter when or where. Without rich metadata, scientists increasingly risk spending their time just looking for data, or worse, losing it – instead of exploiting that data for analysis and discovery.”
Lustre Metadata Performance and Solutions from Seagate
“Alongside the increasingly high demands of streaming bandwidth in HPC storage solutions, there is a growing need for higher levels of metadata performance for various applications and workloads. The Lustre parallel file system provides a distributed namespace, divided across multiple metadata servers, that allows the metadata throughput to scale with increasing servers. This presentation addresses meeting the increasing requirements for high performance metadata in Lustre environments with the ultimate aim of reducing the time to results and improving overall efficiency.”