Denodo Lakehouse Accelerator¶
Note
This feature is only available with the subscription bundle Enterprise Plus. To find out the bundle you have, open the About dialog of Design Studio. See more about this in the section Denodo Platform - Subscription Bundles.
The Denodo Platform includes the Denodo Lakehouse Accelerator (formerly known as Denodo Embedded MPP), which embeds a Massive Parallel Processing engine to improve performance on environments containing data in an object storage. For this purpose, Denodo now embeds a customized version of Presto, which is an open source parallel SQL query engine that excels in accessing data lake content.
To deploy the Denodo Lakehouse Accelerator cluster, pull the images from the Denodo’s Harbor Registry https://harbor.open.denodo.com or load them from the Denodo Connect available at the Support Site. Then follow the instructions on the Denodo Lakehouse Accelerator Guide. The Denodo Connect “Denodo Lakehouse Accelerator” includes an MPP engine based on Presto that has been customized to interact with the Denodo Platform. In addition, the deployment process includes a final step that creates a new special data source in Denodo called “embedded_mpp”.
It also configures the Denodo query optimizer to consider this embedded MPP for query acceleration.
The data source “embedded_mpp” is located in a new database “admin_denodo_mpp”. It can be used for multiple purposes:
Explore an object storage like Amazon S3, Azure Data Lake Storage or HDFS and create base views over data stored in Parquet or Delta Lake format (see Object Storage Data in Open Table Formats (Parquet, Iceberg and Delta Lake)).
Read data in Parquet and Delta Lake format from the object storage with the power of a massive parallel processing engine.
Load data into the object storage using Parquet or Iceberg format (see Create Iceberg Tables in the Denodo Lakehouse Accelerator).
Cache the Denodo server views data. The Denodo server can also use the “embedded_mpp” data source as cache. (see Configure the Denodo Server to Use the Denodo Lakehouse Accelerator as Cache Using Iceberg Tables).
Accelerate queries. The embedded MPP of the Denodo Lakehouse Accelerator allows the query optimizer to apply new Embedded MPP Acceleration techniques that have been specially designed for queries accessing this kind of data.
