New 500TB TPC-H* benchmarks demonstrate 17X faster query performance vs. Snowflake, 36X faster than Google BigQuery, and 17X faster than Databricks.
Oracle today announced the general availability of MySQL HeatWave Lakehouse, delivering an industry-first by enabling customers to query data in object storage as fast as querying data inside the database. MySQL HeatWave Lakehouse supports various object store file formats such as CSV, Parquet, and export files from other databases. It can combine object storage file data and MySQL database transactional data in the same query. Object store files are queried directly by HeatWave without copying the data into the MySQL database. As a result, MySQL HeatWave Lakehouse sets new standards for scalability and performance for query processing, speed of loading data, cluster provisioning time, and automation to query data in object storage.
In an industry first, Oracle makes lakehouse performance identical to database query performance.
“More than 80 percent of data is stored in file systems, and that number is growing. Customers want to integrate and analyze this varied external data with their internal transactional data, but it’s often too complex or too expensive to process,” said Edward Screven, chief corporate architect Oracle. “MySQL HeatWave Lakehouse makes it easy for customers to get valuable real-time insights by combining their data in object storage with database data while gaining significantly higher query performance and much faster data loading at a lower cost.”
Querying data in object storage is as fast as querying data inside the database
As demonstrated by a 10 TB TPC-H* benchmark, querying data in object storage in popular file formats with MySQL HeatWave Lakehouse is as fast as querying data in the MySQL database. This is made possible by MySQL Autopilot, a built-in capability of MySQL HeatWave that provides machine learning-powered automation, which learns from the execution of queries and improves the execution plan of future queries. MySQL Autopilot is an innovation in MySQL HeatWave that is unavailable anywhere else. AMD EPYC™ processors power MySQL HeatWave on Oracle Cloud Infrastructure (OCI).
“The AMD and MySQL HeatWave engineering teams are closely collaborating to optimize MySQL HeatWave for AMD EPYC processors to take advantage of new processor capabilities,” said Forrest Norrod, executive vice president and general manager of the Data Center Solutions Business Group, AMD. “Thanks to this collaboration, MySQL customers running MySQL HeatWave on AMD EPYC CPU-powered OCI instances benefit from an outstanding price-performance advantage for their business-critical workloads, including real-time analytics on massive amounts of data stored in object storage.”
Best performance for lakehouse use cases
As demonstrated by a 500 TB TPC-H* benchmark, the query performance of MySQL HeatWave Lakehouse is:
1. 9X faster than Amazon Redshift
2. 17X faster than Snowflake
3. 17X faster than Databricks
4. 36X faster than Google BigQuery
The performance to load data from the object store with MySQL HeatWave Lakehouse is as follows:
1. 9X faster than Amazon Redshift
2. 6X faster than Databricks
3. 2X faster than Snowflake
4. 8X faster than Google BigQuery
MySQL HeatWave’s unrivaled performance results from its scale-out architecture, enabling massive parallelism to provision the cluster, load data, and process queries with up to 512 nodes. In addition, enhancements to MySQL Autopilot automate metadata creation for object files and dynamically adapt to the performance of the underlying object store to provide the best performance in any OCI region.
MySQL HeatWave is the only cloud service that provides transaction processing, real-time analytics, machine learning, data lake querying, and machine learning-based automation within a single MySQL database service. A core part of Oracle’s distributed cloud strategy, MySQL HeatWave is available in OCI, natively on Amazon Web Services, as part of the Oracle Database Service for Azure, and in customers’ data centers with OCI Dedicated Region.
“With the availability of Oracle MySQL HeatWave Lakehouse, our customers will further experience enormous improvements in performance, automation, and cost. India’s enterprises of all sizes can gain much deeper insights into their business performance in real-time that no one else in the industry can offer. With MySQL Heatwave Lakehouse, in a single query, customers can query transactional data in the MySQL database and combine it with data in the object store. It also makes it easy for customers to get valuable real-time insights,” said Saravanan Palanivel, vice president, Cloud Engineering, Oracle India.