Azure Data Lake Gen 2¶
Before deploying the Denodo Embedded MPP on Azure Kubernetes Service check Denodo Embedded MPP Azure Checklist to make sure you have everything you need.
There are three options to deploy a Denodo Embedded MPP that will access Data Lake Storage gen2 datasets:
The recommended one: Add MSI Tenant ID and Client ID in the
kubectl create secret
command.Used when the Denodo Embedded MPP will run in Azure Kubernetes Service and will access Data Lake Storage gen2 using Azure Managed Identities.
You have to provide the following properties to the
kubectl create secret
command before the Embedded MPP is deployed:ENV Variable
Description
AZURE_OAUTH_MSI_TENANT_ID
MSI Tenant ID
AZURE_OAUTH_CLIENT_ID
Client ID
kubectl create secret generic mpp-credentials --from-literal=METASTORE_DB_PASSWORD=hive --from-literal=AZURE_OAUTH_MSI_TENANT_ID=azureoauthmsitenantid --from-literal=AZURE_OAUTH_CLIENT_ID=azureoauthclientid
It’s also necessary to set true the
objectStorage.azure.managedIdentities.enabled
property in values.yamlRun
helm install
sentencehelm install prestocluster prestocluster/
Provide OAuth2 client credentials to the
kubectl create secret
command.You have to provide the following properties to the
kubectl create secret
command before the Embedded MPP is deployed:ENV Variable
Description
AZURE_OAUTH_DIRECTORY_ID(*)
URL of OAuth endpoint
AZURE_OAUTH_CLIENT_ID
Client ID
AZURE_OAUTH_CLIENT_SECRET
Client Secret
(*) Note that AZURE_OAUTH_DIRECTORY_ID is
https://login.microsoftonline.com/${env.AZURE_OAUTH_DIRECTORY_ID}/oauth2/token
kubectl create secret generic mpp-credentials --from-literal=METASTORE_DB_PASSWORD=hive --from-literal=AZURE_OAUTH_DIRECTORY_ID=azureoauthdirectoryid --from-literal=AZURE_OAUTH_CLIENT_ID=azureoauthclientid --from-literal=AZURE_OAUTH_CLIENT_SECRET=azureoauthclientsecret
It’s also necessary to set true the
objectStorage.azure.oauth2ClientCredentials.enabled
property in values.yamlRun
helm install
sentencehelm install prestocluster prestocluster/
Provide the Azure credentials for the Shared Key authentication method to the
kubectl create secret
command before the Embedded MPP is deployed:ENV Variable
Description
ABFS_STORAGE_KEY
Azure Data Lake Gen 2 Storage Key
It’s also necessary to set true the
objectStorage.azure.sharedKey.enabled
and add the Azure Data Lake Gen 2 Storage Account in theobjectStorage.azure.sharedKey.account
properties invalues.yaml
.Run
helm install
sentencehelm install prestocluster prestocluster/