Azure Data Lake Gen 2¶
Before deploying the Denodo Embedded MPP on Azure Kubernetes Service check Denodo Embedded MPP Azure Checklist to make sure you have everything you need.
There are three options to deploy a Denodo Embedded MPP that will access Data Lake Storage gen2 datasets:
The recommended one: Add MSI Tenant ID and Client ID in the
kubectl create secretcommand.Used when the Denodo Embedded MPP will run in Azure Kubernetes Service and will access Data Lake Storage gen2 using Azure Managed Identities.
You have to provide the following properties to the
kubectl create secretcommand before the Embedded MPP is deployed:ENV Variable
Description
AZURE_OAUTH_MSI_TENANT_ID
MSI Tenant ID
AZURE_OAUTH_CLIENT_ID
Client ID
kubectl create secret generic mpp-credentials --from-literal=METASTORE_DB_PASSWORD=hive --from-literal=AZURE_OAUTH_MSI_TENANT_ID=azureoauthmsitenantid --from-literal=AZURE_OAUTH_CLIENT_ID=azureoauthclientid
It’s also necessary to set true the
objectStorage.azure.managedIdentities.enabledproperty in values.yamlRun
helm installsentencehelm install prestocluster prestocluster/
Provide OAuth2 client credentials to the
kubectl create secretcommand.You have to provide the following properties to the
kubectl create secretcommand before the Embedded MPP is deployed:ENV Variable
Description
AZURE_OAUTH_DIRECTORY_ID(*)
Microsoft Directory ID or Tenant ID
AZURE_OAUTH_CLIENT_ID
Client ID
AZURE_OAUTH_CLIENT_SECRET
Client Secret
(*) Note that AZURE_OAUTH_DIRECTORY_ID is
https://login.microsoftonline.com/${env.AZURE_OAUTH_DIRECTORY_ID}/oauth2/tokenkubectl create secret generic mpp-credentials --from-literal=METASTORE_DB_PASSWORD=hive --from-literal=AZURE_OAUTH_DIRECTORY_ID=azureoauthdirectoryid --from-literal=AZURE_OAUTH_CLIENT_ID=azureoauthclientid --from-literal=AZURE_OAUTH_CLIENT_SECRET=azureoauthclientsecret
It’s also necessary to set true the
objectStorage.azure.oauth2ClientCredentials.enabledproperty in values.yamlRun
helm installsentencehelm install prestocluster prestocluster/
Provide the Azure credentials for the Shared Key authentication method to the
kubectl create secretcommand before the Embedded MPP is deployed:ENV Variable
Description
ABFS_STORAGE_KEY
Azure Data Lake Gen 2 Storage Key
It’s also necessary to set true the
objectStorage.azure.sharedKey.enabledand add the Azure Data Lake Gen 2 Storage Account in theobjectStorage.azure.sharedKey.accountproperties invalues.yaml.Run
helm installsentencehelm install prestocluster prestocluster/
