USER MANUALS

HTTP Path

Use an HTTP path when the data has to be obtained by sending an HTTP request to a server. This page explains everything related to the HTTP paths:

Find information about the Filters tab in Compressed or Encrypted Data Sources; filters work the same way for any type of path (local, HTTP, FTP…).

Configuration

With HTTP paths you can either send a request with the HTTP method GET, PUT, POST, DELETE or PATH.

There are two ways of adding parameters to the body of a POST, PUT or PATH request:

  1. URL parameters: if the URL contains query parameters, they will be removed from the URL and sent in the body of the request as the values of an HTML form. The “Content-type” header of the HTTP request will be “application/x-www-form-urlencoded”.

    For example, if the URL is http://acme/customer?first_name=John&last_name=Smith at runtime, the Server will send the HTTP request to http://acme/customer (without the query parameters) and the body of the request will be first_name=John&last_name=Smith.

  2. Post body: the contents of the text area below this option will be sent in the body of the HTTP request. Enter the content type of the body in the Content type box.

To add a header to the HTTP requests sent to retrieve the data, click HTTP headers. In the “HTTP Headers” dialog, click New and enter the name and the value of the header.

The following elements of an HTTP path can contain interpolation variables (the section Paths and Other Values with Interpolation Variables explains what interpolation variables are):

  1. The URL. It can contain one or more interpolation variables. For example, https://acme.com/department?id=@{department_id}

  2. The HTTP headers. Both the names and their values can be interpolation variables.

  3. The “Post body” can contain one or more interpolation variables. This is useful if you want to send a POST request and one or more values of the body need to be set at runtime, when the data source is queried. For example, let us say that the body of the POST request is the following:

    <employee_info>
       <last_name>@last_name</last_name>
       <first_name>@first_name</first_name>
    </employee_info>
    
HTTP POST request with XML Body with an interpolation variable

HTTP POST request with XML Body with an interpolation variable

The XML contains the string @first_name and @last_name. The character “@” indicates that first_name and last_name are interpolation variables, which means that at runtime, the strings @first_name and @last_name will be substituted by a value when this data source is queried.

When you create a base view over this data source, the view will have three extra fields: department_id, last_name and first_name. The queries to this view will always have to provide a value for these fields in the WHERE clause.

For example, if the query is

SELECT *
FROM view
WHERE department_id = 2 AND first_name = 'John' AND last_name = 'Smith'

the Server will send a request to https://acme.com/department?id=@{department_id} with this body:

<employee_info>
    <last_name>Smith</last_name>
    <first_name>John<first_name>
</employee_info>

Any query involving a base view created over this data source that does not provide a value for these three fields will fail.

Note

You have to escape the characters @, { and } when they are not part of the name of an interpolation variable. They are escaped with the character \ and the character \ with itself (\\).

<employee_info_request>
  <department_name>department \@1 \{ACCT\}</department_name>
</employee_info_request>

If the body of the request is loaded with Load file, the Tool escapes all these characters automatically.

The benefit of using interpolation variables is that the HTTP request to the Web server is not static and can be different for every query.

If the definition of the HTTP path has interpolation variables and you click “Test connection”, you will have to provide the value of the interpolation variables. You also have to do this when creating a base view over this data source. The Administration Tool will display a dialog like the following to provide the value of the variable.

Providing the value of an interpolation variable for an HTTP path

Providing the value of an interpolation variable for an HTTP path

In this dialog, select the URI parameter check box if the value of the variable is the value of a query parameter of the URL. If selected, the value of the variable will be escaped accordingly. Otherwise, if the variable is part of the URL, the value will be escaped as any other part of the URL. For example, if the URL is http://acme/recipe?name=@dish, when you provide the value of the variable “dish”, select the check box “URI parameter”. That way, if the value of dish is “Mac&Cheese”, the Server will send a request to http://acme/recipe?name=Mac%26cheese. Note that “&” has been properly encoded by replacing “&” with “%26”. If dish is not marked as a “URI parameter”, the URL will be http://acme/recipe?name=Mac&cheese, which in this case is not correct because as the URI has “&” after “Mac”, the Web server will treat “cheese” as another query parameter.

When the interpolation variable does not belong to the URL of the path, leave the “URI parameter” check box cleared.


The Connection timeout field enables the configuration of a timeout, in milliseconds, for the connection to be established. If the connection cannot be stablished within this time, the data source will cancel the request, and the query will fail.

Select the check box Check certificates if you are in one of these scenarios:

  • The service uses SSL/TLS (i.e. the URL starts with https) and you want Virtual DataPort to validate that the certificate presented by this service was issued by a Certificate Authority (CA) trusted by the Java Virtual Machine (JVM) included with the Denodo Platform. This validation will be performed for every connection established with the service.

    If the certificate presented by this service was not issued by a trusted CA or it was self-signed, but you still want Virtual DataPort to validate it, import the certificate into the list of trusted certificates of the JVM. The section Importing the Certificates of Data Sources (SSL Connections) of the Installation Guide explains how to do this.

  • Also, select this check box if the service requires SSL client authentication.

Clearing the check box has two implications:

  1. Virtual DataPort will accept any certificate presented by the service without checking who issued it.

  2. And, all the requests will fail if the service requires SSL client authentication.

Select the check box Generate output on empty content to allow the creation of a base view with this data source even if the http endpoint does not return data. The resulting base view will include a column named result, but this column will not contain any data when a query is executed.

Select the check box Ignore HTTP route errors to configure which HTTP errors the data source has to ignore. When the data source retrieves data from a source and one of the selected HTTP errors occur, the data source will fail silently and will not return rows. If any other error occurs, the data source will return an error, which will make the query to fail.

Configuring an HTTP path to ignore the 404 HTTP errors

Configuring an HTTP path to ignore the 404 HTTP errors

The difference between this option and Ignore route errors is that with Ignore route errors, the data source ignores any error that occurs when accessing the source.

Proxy Settings

In the Proxy tab, you can set a proxy configuration for this data source or use the Default configuration of the Server (see section Default Configuration of HTTP Proxy).

Authentication in HTTP Paths

The supported authentication methods for HTTP connections are:

If you select the check box Pass-through session credentials (available for the authentication methods “Basic”, “Digest”, “NTLM” and “SPNEGO (Kerberos)”), when a client executes a query that involves this data source, the credentials used to send a request to the service are the credentials of the user that executes the query; not the credentials of the fields “Login” and “Password”. When this option is selected, the credentials of the fields “Login” and “Password” are used only when creating base views over this data source, to send a request to the service and analyze the output of the URL.

The section SPNEGO (Kerberos) explains in detail the behavior of Virtual DataPort when the authentication method is “SPNEGO (Kerberos)” and “Pass-through session credentials” is selected.

Warning

Be careful when enabling the cache on views that involve data sources with pass-through credentials enabled. The appendix Considerations When Configuring Data Sources with Pass-Through Credentials explains the issues that may arise.

Mutual Authentication

When establishing an SSL/TLS communication (e.g. with “https”), the client (in this case, Denodo) verifies the identity of the service by checking if the certificate used by this service is signed by a certification authority (CA). With “mutual authentication” (also known as two-way SSL/TLS), the client (in this case, Denodo server) also uses a certificate for authentication instead of user and password or an OAuth token.

To use this feature, you need a key store file that contains the private key to access the service. This file has to be in the formats PKCS#12 or Java Key Store (JKS).

To enable this authentication method on an HTTP route, follow these steps:

  1. In the “Edit HTTP connection” dialog, click the Authentication tab.

  2. In the Authentication list, select Mutual (two-way SSL).

  3. In Certificate password, enter the password of the file with the private key.

  4. Click Load certificate and select the file with the private key.

    If the certificate is valid, the tool will display the issuer of the certificate, the expiration date of the certificate, etc.

Note

If you want Virtual DataPort to validate the certificate sent by the service, select Check certificates in the Configuration tab. In order for this validation to succeed, the certificate used by the service has to be signed by a Certification Authority (CA). Otherwise, you have to import the certificate into the TrustStore of the Denodo server or the communication will fail.

Mutual Authentication wizard

Mutual Authentication wizard

OAuth Authentication

OAuth is an authorization framework that allows third-party applications (in this case, Virtual DataPort), to access resources on a server on behalf of a resource owner.

The main benefit is that you do not need to share your username and password with third-party applications in order to authorizing them to access your data.

The following subsections explain how to use the wizards that help you obtain the credentials needed to connect to a service with OAuth 1.0a or OAuth 2.0 authentication.

Note

Before creating the data sources in Virtual DataPort, you have to register Virtual DataPort as an application in the service that you want to access.

Note

We recommend creating a single data source for all the views that retrieve data from the same OAuth-authenticated service. The reason is that, if at any point, the OAuth credentials change, you will only have to change them in one data source. To do this, you can create the data source with an interpolation variable in the URL (http://service.com/@OBJECT_TYPE/ <http://service.com/@OBJECT_TYPE/>)

OAuth 1.0a

Note

If you are not sure if your service uses OAuth 1.0a or 2.0, most likely it uses OAuth 2.0, not this one.

This section explains how to configure an “HTTP Client” route to retrieve data from a service with OAuth 1.0a authentication. The Tool provides the OAuth 1.0a credentials wizard to help you obtain these credentials.

Follow these steps:

  1. In the “Edit HTTP connection” dialog, click the Authentication tab.

  2. Select OAuth 1.0a in the Authentication list.

  3. Enter the Client identifier and the Client shared secret provided by the service.

  4. Select the Signature method. The HMAC-SHA1 signature is the most used, so usually is the right option.

  5. If you already have the Access token and the Access token secret, enter them in the boxes below and click Ok.

    If you do not have these tokens, click launch the OAuth 1.0a credentials… to open the wizard that will help you obtain them.

OAuth 1.0a credentials wizard

OAuth 1.0a credentials wizard

  1. Enter the Temporary credential request URL, the Resource owner authorization URL and the Token request URL

    The documentation of the service you are accessing must provide these details.

  2. Select the Callback URL. When you get to the step 2 of the wizard, you will have to open a URL in your browser. In this URL, the service displays a page where you have to authorize Virtual DataPort to access your data. If you proceed, you will obtain the Verification code, which Virtual DataPort will use to send an HTTP request to the service. The response will contain the Access token and the Access token secret.

    The Callback URL determines how the service will return the Verification code.

    Note

    Depending on the service, you cannot select any option. Some of them force you to use a specific redirect URL, others only allow oob, etc.

    1. oob: with this option, the wizard will request the service to display the Verification code in your browser after the authentication process.

    2. If you select the second or the third option, the service will redirect your browser to this URL and it will add the parameter code to it. The value of this parameter is the Verification code.

      The default URL (http://localhost:9090/oauth/1.0a/callbackURL.jsp) points to a JSP located in the Apache Tomcat embedded with the Denodo Platform, which will display the value of the code parameter in a box that makes it easier to copy it.

      If you have to indicate another callback URL, you will have to extract manually the value of the code parameter from the URL.

  3. Click Generate the authorization URL. Virtual DataPort will request a Temporary token and with it, it will generate the Authorization URL.

  4. Click Open URL. If the browser is not launched, copy the URL and open it manually.

    In this URL, you have to authorize the Virtual DataPort server to retrieve data from the service.

  5. After authorizing Virtual DataPort to access your data, the service returns the Verification code. Enter this code in the Paste the verification code text field.

    If the Callback URL is oob, you have to type the value. If you have selected the default URL, you can copy it and paste it into this box.

  6. Click Obtain the OAuth 1.0a credentials. The Server will request the OAuth tokens using all the details you have provided and the Verification code.

  7. Click Ok to close the wizard.

    The wizard will fill the text areas “OAuth access token” and “OAuth access token secret”.

  1. Click Ok to close the “Edit HTTP Connection” dialog and then, Save to create the data source.

To use this wizard independently, you can do so by clicking on OAuth 1.0a wizard on the menu Tools > OAuth credentials wizards of the Administration Tool.

You may need to use this wizard when using a custom wrapper whose input parameters are OAuth credentials.

OAuth 2.0

This section explains how to configure an “HTTP Client” route to retrieve data from a service with OAuth 2.0 authentication. The Tool provides the OAuth 2.0 credentials wizard to help you obtain these credentials.

Note

If you are not sure if your service uses OAuth 1.0a or 2.0, most likely it uses OAuth 2.0.

The Tool provides the OAuth 2.0 credentials wizard to help you obtain these credentials.

Configuring OAuth 2.0 authentication for a data source

Configuring OAuth 2.0 authentication for a data source

Follow these steps:

  1. In the “Edit HTTP connection” dialog, click the Authentication tab.

  2. Select OAuth 2.0 in the Authentication list.

  3. Select the appropriate Authentication grant:

    1. Authorization code grant. This is the safest option because you do not have to enter your user name and password for the service. You only have to obtain an “access token” and “refresh token”, which you can do with the help of the wizard of this dialog. An additional benefit of this grant is that generally - it depends on the service - you can limit the operations this data source will be able to do (e.g. only allow read access to the data). In addition, if the access token or the refresh token ever get compromised, they can be revoked without having to change the password of your user account in the service.

    2. Resource owner password credentials

    3. Client credentials grant

    The second and third options are easier to configure because you do not have to obtain the access token nor the refresh token. However, they do not have the benefits of the first option.

    Check the documentation of this service to see what options are available.

    These options are described in detail in the standard (RFC 6749 - The OAuth 2.0 Authorization Framework).

  4. Enter the Client identifier and the Client secret provided by the service.

  5. Enter the User identifier and the User password (only if you selected Resource owner password credentials).

  6. Select one of the options of Authentication method used by the authorization server. This controls how Virtual DataPort will send the credentials to the service when requesting a new OAuth access token. The options are:

    1. Include the client credentials in the body of the request: Virtual DataPort will add the credentials to the body of the request, in the parameters client_id and client_secret.

    2. Send credentials using the HTTP Basic authentication scheme: Virtual DataPort will send the credentials of the user in the Authentication header of the HTTP request.

    These two options are described in the section “2.3.1. Client Password” of the OAuth 2.0 specification (RFC 6749 - The OAuth 2.0 Authorization Framework).

    Although the first option is more common, some services require the second one.

  7. If you already have the OAuth access token, enter it in the Access token box and select the appropriate Request signing method. If you also have the Refresh token, enter it in the Refresh token box enter the value of the Token endpoint URL and, if you know it, the number of seconds until the access token expires.

    If you do not have the access token and it will be provided at runtime instead of being stored in the data source, select Access token value is an interpolation variable and, in the box below, enter a name for the variable. At runtime, the queries to the base views of this data source will have to provide a value for this variable. This value will be the access token used to connect to the source. This option is useful if the source requires OAuth 2.0 authentication but does not fully implement the standard. In this case, you can develop a stored procedure that obtains this token and pass it to the base view.

    If you do not have the access token and want to obtain it from the source, click launch the OAuth 2.0 credentials… to open the wizard that will help you obtain it.

OAuth 2.0 credentials wizard

OAuth 2.0 credentials wizard

  1. Enter the Token endpoint URL.

  2. Only if you selected Authorization code grant, enter the Authorization server URL.

  3. Only if you selected Authorization code grant, select the Redirect URI. When you get to the step 2 of the wizard, you will have to open a URL in your browser. In this URL, the service displays a page where you have to authorize Virtual DataPort to access your data.

    If you proceed, the service will redirect your browser to the Redirect URI and it will add several parameters to it. Virtual DataPort will use the values of these parameters to send an HTTP request to the service. The response will contain the Access token and maybe, the Refresh token.

  4. Click the button image5 for each scope you want to add and enter its name.

    Scopes are “privileges” defined by the service, which control the data that the application can request.

    For example, Twitter defines several scopes and depending on the scopes requested in this wizard, Virtual DataPort will be able to retrieve your tweets, but may not post new ones on your behalf.

Only do steps e. to h. if you selected the Authorization code grant in the previous dialog.

  1. Usually, you can leave the Set the “state” request parameter selected. However, if the process of obtaining the OAuth credentials fails, check that the service allows setting this parameter.

  2. Click on Generate the authorization URL.

    Virtual DataPort will generate a URL with all the parameters you have provided.

  3. Click on Open URL.

    If the browser is not launched, copy the URL and open it manually.

    In this URL, you have to authorize the Virtual DataPort server to retrieve data from the service.

  4. After authorizing the application, the service will redirect you to a URL. Paste this URL in the text field of step 3.

  5. Click on Obtain the OAuth 2.0 credentials.

    The Server will request the OAuth credentials using all the details you have provided and the parameters of the URL you have pasted in the previous step.

  6. Click Ok to close the wizard.

    The wizard will fill the text areas and text fields with the information returned by the service.

    Not all the services provide a Refresh token, so this text area may be empty.

  1. Select the Request signing method. Virtual DataPort has to sign each request with the Access token. Usually, all OAuth services allow the “Authorization” request header method, which consists on adding a special HTTP header to the request. If the service does not support this method, you can select the other methods defined by the standard:

    1. Form-encoded body parameter: send the token in the body of the request (only available with HTTP POST requests)

    2. URL query parameter (“access_token”): the token is sent in the parameter access_token of the URL.

    3. Or, add the token as a query parameter with a name different from “access_token” (URL custom query parameter).

  2. If you do not have the refresh token and it will be provided at runtime instead of being stored in the data source, select Refresh token value is an interpolation variable and, in the box below, enter a name for the variable. At runtime, the queries to the base views of this data source will have to provide a value for this variable. This value will be used to refresh the access token if necessary. This option is useful if the source requires OAuth 2.0 authentication but does not fully implement the standard. In this case, you can develop a stored procedure that obtains this token and pass it to the base view.

  3. Some REST APIs with OAuth 2.0 authentication require clients to send additional parameters. Click extra parameters of the refresh token requests to add these parameters. Denodo will add them to these requests:

    1. In the requests to obtain an access token for the first time. That is, it will add this parameter to the URL generated in the OAuth Credentials Wizard, when you click Obtain the OAuth 2.0 credentials.

    2. In the requests the data source sends to refresh the access token. That is, when the current access token expires and the data source has to obtain a new one, using the refresh token.

    For instance, when connecting to Microsoft services on the cloud with OAuth 2.0 authentication, you may need to add the parameter resource. Its value is the identifier of the application you want to connect to. This is a requirement of this API.

    In this dialog, if you select the check box Encrypted next to the value of the parameter, the data source stores this value encrypted.

  4. If you select Pass-through session credentials, select one of the following pass-through strategies:

    Important

    The selected strategy is only relevant with pass-through session credentials when the user logged in to Virtual DataPort with OAuth authentication.

    • Token exchange flow (RFC 8693). The data source will use the Token exchange flow (defined in the RFC 8693) to retrieve the access token to establish the connection.

    • On-behalf-of flow (Azure AD). The data source will use the Microsoft proprietary On-behalf-of flow to retrieve the access token to establish the connection.

    • OAuth Token pass-through. The data source will connect to the source with the same access token the client application used to log in to Virtual DataPort.

    Besides, there will be some fields with default values and it will show new fields:

    • Authentication grant will be selected as Resource owner password credentials.

    • Authentication method used by the authorization server will be selected as Include the client credentials in the body of the request.

    • Request signing method will be selected as “Authorization” request header.

    • Scope will appear in the end and you must enter the desired scopes to get a new access token in case the access token of the session expires.

    Note

    If the user will authenticate through Denodo SSO, it is important to configure the Denodo Security Token to include the original OAuth token (see Denodo Security Token Configuration File) to make sure that Virtual DataPort server retrieves the token that will be used in the pass-through strategy.

  1. Click Ok to close Edit HTTP Connection and then, Save to create the data source.

Usually, you only need to launch the OAuth 2.0 wizard from the dialogs “Create JSON data source” or “Create XML data source”. However, if you need to use this wizard independently, you can do so by clicking on OAuth 2.0 wizard on the menu Tools > OAuth credentials wizards of the Administration Tool.

You may need to use this wizard when using a custom wrapper whose input parameters are OAuth credentials.

SPNEGO (Kerberos)

When the authentication method of the data source is “SPNEGO (Kerberos)”, Virtual DataPort will use a Kerberos ticket to add an authentication header to the HTTP requests sent to the service.

If you clear the check box Pass-through session credentials, the Server will use the values of the “Login” and “Password” boxes to connect to the Key Distribution Center (KDC) and request a Kerberos service ticket.

If you select the check box Pass-through session credentials, Virtual DataPort will use the credentials of the client to obtain a Kerberos service ticket, on behalf of the client that is executing the query that involves this data source. The exact behavior of Virtual DataPort depends on the authentication method used by the client:

  1. The client connects to the Virtual DataPort server using Kerberos authentication: the Server will request a service ticket to the Key Distribution Center (KDC) on behalf of the client that executes the query, using the ticket-granting ticket (TGT) obtained when this client opened the connection to the Server. Then, it will use this service ticket to add an authentication header to the HTTP requests sent to the service.

  2. The client connects to the Virtual DataPort server using standard authentication: the Server will request a service ticket to the KDC using the user name and password of the client that executes the query. Take into account the following:

    1. If the Virtual DataPort server is running on Windows but the host does not belong to a Windows domain, define the system properties “java.security.krb5.realm” and “java.security.krb5.kdc” as explained in the section Enabling Kerberos Authentication Without Joining a Kerberos Realm of the Installation Guide.

    2. If the Virtual DataPort server is running on Linux, you need the system to have a krb5.ini file. See the section Providing a Krb5 File for Kerberos Authentication of the Installation Guide for more information about how to check if there is already one in your system.

Connection Pool

By default, data sources with HTTP paths maintain a pool of connections to the source, this way requests will be executed faster. This connection pool can be disabled by setting the following properties:

-- This property indicates if connections are checked when they are retrieved from the pool.
-- (Default value: false)
SET 'com.denodo.parser.connection.http.testOnBorrow' = 'true';

-- This property indicates the time in milliseconds that a connection can remain idle at the pool available for being used to execute a new request.
-- When this timeout is reached, then the connection will be removed from the pool in the next checking. This property only applies when "testOnBorrow" is true.
-- (Default value: 10000)
SET 'com.denodo.parser.connection.http.minEvictableIdleTimeMillis' = 0;
Add feedback