Google Mesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing

Print Friendly, PDF & Email

mesaOver at GigaOM, Derrick Harris writes that Google has developed a data warehousing system called Mesa that is designed to handle real-time data while maintaining performance even if an entire data center goes offline. As an ACID-compliant database Mesa is built for speed, scale, and reliability. 

Mesa is a highly scalable analytic data warehousing system that stores critical measurement data related to Google’s Internet advertising business. Mesa is designed to satisfy a complex and challenging set of user and systems requirements, including near real-time data ingestion and queryability, as well as high availability, reliability, fault tolerance, and scalability for large data and query volumes. Specifically, Mesa handles petabytes of data, processes millions of row updates per second, and serves billions of queries that fetch trillions of rows per day. Mesa is geo-replicated across multiple datacenters and provides consistent and repeatable query answers at low latency, even when an entire datacenter fails.

Google engineers are presenting a paper on Mesa at the Very Large Database Conference next month in China.