USING THE DATA MARKETPLACE

The Denodo Data Marketplace (also known as Denodo Data Catalog) is a web based self service tool included in Denodo Platform that lets both technical and business users query, search and browse information and metadata stored in a Virtual DataPort server. With this tool, users can generate new knowledge and pave the way to make better decisions.

Denodo Architecture

Scenario

In this tutorial, we are going to show this use case:

The IT/Data department in our company frequently receives requests for data access. These requests are often unclear about the source where the data is stored, which causes delays in processing them. In many cases, the lack of understanding of the underlying systems by business users results in the requests taking longer to fulfill than necessary.

For solving that use case, following this tutorial you will learn how to:

  • Use Denodo Data Marketplace for exploring the data assets available in the company
  • Enrich the Data Marketplace metadata, adding Tags, Domains, and Categories
  • Manage Data Products Graphically
  • Use Generative AI for getting que required data
  • Collaborate with other users of the Data Marketplace

Before starting this tutorial, please ensure to complete the configuration steps performed in the previous tutorial for Data Marketplace Administration, which will allow you to fully explore the features of the Data Marketplace that we will go through in this tutorial.

If you have followed the previous tutorial, in your Design Studio you will have something similar to this:

Design Studio - Elements tree

Metadata Search

To start, open the Denodo Platform Control Center, and start the Virtual DataPort server and the Data Marketplace. Once it changes the status to "Running", click the Data Marketplace link to open the Web tool (by default: https://127.0.0.1:9090/denodo-data-catalog).

  • Now login to the Data Marketplace using the standard login details (admin/admin):

Data Marketplace login page

Our first example is from the Data Marketplace home screen. Let's use the scenario of the Business Analyst to explore a simple use case, by searching for clients, by typing in client and hitting enter.

Data Marketplace search bar

Here we have the results of our search. This search will seek views or web services that contain the query terms in the element's metadata, such as:

  • The name and description.
  • The categories and tags assigned to the view.

Data Marketplace search results


For example, let's click on the view bv_crm_client to be taken to the summary of the selected view, showing more information like the values of any custom properties it has assigned:

Data Marketplace view details

Content Search

We can also use the Index feature to explore data using the Content Search function.

  1. In the Data Marketplace, click on the Back link link to return to the Search page, and select the following options in the Filters:

    Select Content (this option appears only after configuring the index following the steps of the previous tutorial)
    Database: tutorial
  2. Click on Apply filters button

Data Marketplace content search selection

  1. Now, type James into the search field and hit Enter to run the search:

Data Marketplace content search

  1. The search will return all the views that include the string James in their data sets. Click the Preview results link to expand the results to show the field that matches the search.

Data Marketplace content search - results

  1. You can also click on the bv_crm_client view name and see the indexed data. In this Search tab, you can search the index directly.

Data Marketplace view details - search 1

  1. For example, we can now search Jack, and the results from the Index are returned.

Data Marketplace view details - search 2

Completed! We have done a search in the Virtual DataPort metadata and content. In the next section, we will explore other advanced functions of the Data Marketplace!

In this section, we will explore the features of the Data Marketplace to enrich the metadata of the data assets available.

A useful feature of the Data Marketplace is the ability to display view metadata, such as the View Description, as well as the Field Descriptions. By default, this metadata is obtained from Virtual DataPort server but the Data Marketplace allows users to update the view and field descriptions.

In our example we are going to add descriptions to the fields of the bv_crm_client view, to allow more specific discovery of this view. Let's see how to modify that information.

View and Field Descriptions

  1. Navigate to the Summary page of the bv_crm_client view and click on the Edit icon icon of the Description field.

Data Marketplace view details - edit description

  1. Add the appropriate description to the View (for example, Base view of the Client table on the CRM data source) and click Ok.

Data Marketplace view details - edit description window

  1. Similarly, add a description to fields by navigating to the Schema tab and click on the Edit icon icon under each field.

    client_id: Unique client identifier
    name: First name of the customer as captured on the CRM
    surname: Surname of the customer as captured in the CRM
    client_type: Indicates the client type, either 01 or 02

Data Marketplace view details - field descriptions

  1. The view now displays the added descriptions. These descriptions are saved in the Data Marketplace metadata.

You can use the Denodo Data Marketplace to add tags and categories to views.

Tags & Categories are useful to allow users to search with more accuracy through the Data Marketplace. While the amount of Data Sources and Views is small in our tutorial, it will pay off over the long term to maintain good Categorization and Tagging habits to allow users to navigate the Data Marketplace more easily.

Categories

  1. Navigate to Administration > Set-up and Management
  2. In the Administration window under Marketplace Management, click on Categories option

Data Marketplace Categories Management

  1. Click the + Add Category icon.

Data Marketplace - Add Category

  1. Create a category with the following details:

    Name: Customer
    Description: Data products related to customers

Data Marketplace - Create Category wizard

  1. Create another category with the following details:
  • Name: CRM
  • Description: CRM System
  • Choose Customer category checkbox to select it as parent category

Data Marketplace - Create Category wizard with parent category 1

  1. Create a final category with the following details:
  • Name: Billing
  • Description: Customer Billing
  • Choose Customer category checkbox again

Data Marketplace - Create Category wizard with parent category 2

We now have a useful set of categories to link to our Views.

Data Marketplace - Categories list

Tags

  1. Navigate to Administration > Set-up and Management.
  2. In the Administration window under Marketplace Management, click on the Tags option.
  3. Click the + Add Tag icon and create a new Tag with the following details:
  • Name: #public
  • Description: Data that is readily available for public use

Data Marketplace - Create tag 1

  1. Create another tag with the following details:
  • Name: #confidential
  • Description: Sensitive data that could negatively impact operations or execution processes

Data Marketplace - Create tag 2

We now have a useful set of tags to link to our Views.

Data Marketplace - Tag list

  1. We can now navigate to the bv_crm_client view and click on the Actions > Add Tag / Add Category options in the Summary tab.

Data Marketplace - View actions

  1. Click on Add Category, select CRM, and then click on Ok.

Data Marketplace - View actions - add category

  1. Now click on the Add Tag button, select #confidential, and click Ok.

Data Marketplace - View actions - add tag

  1. We have now added this view to the Customer > CRM category and have tagged it with the #confidential tag.

Data Marketplace - View details with categories and tags

Browse using Tags & Categories

Finally, let's do a quick review on how to browse your views and web services by tags, go to Browse > Tags.

  1. In the sidebar, you will see the list of tags available in the Data Marketplace.

Data Marketplace - Browse tags

  1. Click the tag #confidential to see the elements that have been assigned with this selected tag (select the Views tab).

Data Marketplace - Browse by confidential tag

  1. Similarly you can browse by categories, going to Browse > Categories!

We have now seen how the effective use of the Categories and Tags can enable powerful data exploration.

In the next sections we are now going to explore other features that offer more in-depth interrogation of a view in the Data Marketplace. This includes:

  • Querying a view, Saving the query, filtering results and exporting results to a file
  • Creating new fields with the Data Preparation wizard
  • Exploring view relationships and data lineage

From the previous section, we have selected our bv_crm_client view. We can now explore the contents of this view.

From the Query tab, ad-hoc queries can be run against the view (the query is created graphically). For example, select the following fields and drag them into the Output columns area.

  • client_id, name, surname, client_type

Data Marketplace - Query View

Now click Execute, to get the results:

Data Marketplace - Query View - results


Of course, the Data Marketplace allows exporting the results! You can select CSV, HTML, Excel or Tableau as output format by clicking the Export iconbutton.

Data Marketplace - Export results

More Options Available When Querying a View

If we want to filter the results of the view, and, for example, order the results by the surname, we can easily do so. Click the Definition link to bring back the query options.

Data Marketplace - Query View - return to definition

Begin by dragging the field by which we want to filter, for example dragging field client_type to the Filters section. We will now need to select an operator ( = ) and the value ('02').

Data Marketplace - Query View - filter wizard

We could also add the surname field to the Order By section for which we want to order the results by, and click the arrow to change the Order By to descending order.

Data Marketplace - Query View - add filters and order by

Now click Execute. The results now are filtered to only include results for customer_type = '02', and the results are ordered by the surname field.

Data Marketplace - Query View - results after filter

You can click on the VQL icon button to see the query that is being executed in Denodo Platform!

Adding New Output Fields

Let us consider the scenario where we want to combine the name and surname fields into a new full_name field. We can do this by concatenating the name with the surname following these steps:

  1. Go back to the Definition of the query. In the Output columns section, click on three dots and then click on Add option.

Data Marketplace - Query View - Add new field

  1. In the "New output field" dialog, click on the Edit button beside the Field name column and provide field name as full_name and Expression as concat(name,' ',surname)

Data Marketplace - Query View - Add new field wizard

  1. Click on Execute. Now our results include the newly created full_name field.

Data Marketplace - Query View - Results with new fields

  1. If we would like to save this query for later use, we can click Save icon.

Data Marketplace - Query View - Save query

  1. This will save the query under the Query > My Queries section of the top menu.

Data Marketplace - My queries

Relationships

The Denodo Data Marketplace is able to show the associations/relationships between the views. This is useful for the business user to understand how certain views are related.

This information is shown under the Relationships tab.

Data Marketplace - View details - Relationships

Data Lineage

The Data Lineage tab displays a tree graph with all the data sources and views used to build the current view.

If we click on one of the fields under View fields, we will be able to see the lineage of a specific field. This is especially useful when dealing with complicated derived views, as we will explore later.

Data Marketplace - View details - Data lineage

By clicking on a node, you can see the details of the corresponding data source or view (e.g. Name, Type, Description, Projected fields, Join conditions, etc).

Data Marketplace - View details - Data lineage details

Lineage of Complex Views

Let us now view the lineage of a more complex view.

Return to the Search page and search for iv_client_with_bills (ensure you are searching in the Metadata). Open this view and navigate to the Data lineage tab and select the primary_phone field.

Data Marketplace - View details - Data lineage complex

We can now see the value of the Data lineage tab, where we can identify the lineage of the primary_phone field including the data source where this field comes from and all of the operations involved to transform the field.

In the next section we will explore the Assisted Query feature which lets you explain your needs in natural language via the Natural language query input.


This section explores the Assisted Query feature which lets users explain their data needs in natural language invoking external LLM services. From the previous tutorial, we have already configured this feature to use OpenAI's public API, so let's dive in.

Assisted Query for views

  1. Open again the bv_crm_client view and navigate to the Assisted Query tab
  2. In the Natural language query input, specify the following instruction:
  • Give me the name of clients with client type 01

Data Marketplace - Assisted query - Ask

  1. Click the Ask button
  2. Now you can see that a VQL SELECT query is generated based on the natural language input, with the corresponding query explanation

Data Marketplace - Assisted query - Review VQL query

  1. Click the Execute button to see the results of the query, this is great! We have used natural language for querying our views!

Data Marketplace - Assisted query - Execute generated query

Finally, let's explore how users can collaborate in the Data Marketplace to allow Data Stewards to better communicate with their business users.

In this section, we will review:

  • Endorsements
  • Warnings
  • Deprecation notes to views and web services
  • Request access to a data product

Endorsements

The endorsements are the comments added by users on a view or a web service to show their support. A user can only endorse a view or web service once, meaning, when a new comment is written, the previous endorsement will be replaced.

  1. To create endorsement, navigate to the Summary tab of the bv_crm_client view and click on the Action > Endorse option.

Data Marketplace - View details - Endorse

  1. In the Endorse dialog, provide the details which you would like other users to see. For example, add the details as follows:
    "This Client view is a key component of our model. It is associated with Address and Client Type views to give expanded information about each client."
  2. Click Ok to save the endorsement.

Data Marketplace - View details - Endorse wizard

  1. In the Summary tab, the Endorsements label displays the number of endorsements on this view and their authors.

Data Marketplace - View details - Endorsements

Warnings

Warnings are used to write and display the "advise against" messages on views and web services by users. A user can write only one warning against a view or web service.

  1. To create a warning message, go to the Summary tab of the Client view, click on Actions > Warn option.

Data Marketplace - View details - Warn

  1. In the Warn dialog, add the following warning information:
    "This view will be updated with delta records once in a week"
  2. Click Ok to save the warning message.

Data Marketplace - View details - Warn wizard

  1. In the Summary tab, the Warnings label displays the number of warnings on this view and their authors.

Data Marketplace - View details - Warnings

Deprecation

Deprecations are used for informing users that it is obsolete and should not be used anymore. A user can write only one deprecation about a view or web service.

  1. To deprecate a view, go to the Summary tab of the Client view, and Click on Actions > Deprecate option.

Data Marketplace - View details - Deprecate

  1. In the Deprecate dialog, we will add the following deprecation notes:
    "This view will be deprecated from next cycle. Users will be notified about the latest view by the end of this month."
  2. Click Ok to save the deprecation note.

Data Marketplace - View details - Deprecate wizard

  1. Next time when you open the view, you will see the ⚠ icon in the toolbar and a notification will pop up every time you click on the icon or access the view.

Data Marketplace - View details - Deprecation notification

Request Workflows

To provide workflow capabilities in the Data Marketplace, it has introduced some types of requests that will allow users to initiate cases on elements that will be attended by an administrator from a data stewardship perspective.

Most frequent types of requests: access requests, report data quality issues, request changes and ask general questions.

Let's review one of them, requesting access to a view.

Create access request

  1. Log in to the Data Marketplace with the user tutorial / denodo.
  2. Open the bv_crm_client view, and notice that there are only limited tabs available to this user. Right now, the user can only see the metadata of the view. The user does not have permission to execute and retrieve the data.

Data Marketplace - View details - Restricted user

  1. To request for an access, create a request by clicking on the Action > Request Access button

Data Marketplace - View details - Request access

  1. In the pop up window, specify the details of the request, and click Ok

Data Marketplace - View details - Request access wizard

Granting access request

  1. Log in to the Data Marketplace as an admin user. This user has permission to manage access requests and grant execute permissions to the view
  2. Navigate to the Administration > Request management page
  3. In this page, you can see all the requests that have been created, including the one we have created in the previous step.

Data Marketplace - Request Management

  1. Click the id of the request for bv_crm_client view, and change the status to In Progress, clicking on the Accept iconicon

Data Marketplace - Request Management wizard

  1. In the Messages tab, you can send a message to inform the requester that the request is being worked on.

Data Marketplace - Request Management - Add comment

  1. In the Design Studio, the admin user can now grant access to the bv_crm_client view for the restricted role

Design Studio - add privileges to a role

  1. Still as an admin user, log back into the Data Marketplace, and update the request in the Administration > Request management page

Data Marketplace - Request Management - Complete request

Data Marketplace - Request Management - Complete request adding a comment

  1. When you log back into the Data Marketplace as tutorial user, navigate to the bv_crm_client view, you can now see the Query tab and be able retrieve the data for this view.

Data Marketplace - View details - Query

GREAT! We have now seen how collaborative features help users in the Data Marketplace.

In this tutorial, we have only had a limited number of Views, Data Sources, Tags and Categories, but it is clear that through the use of the Data Marketplace, business users will be able to explore the companies data, easily and quickly, with minimal overhead on the IT team. We have also learnt about how the feature packs included in the Data Marketplace can be used and how it helps users in a collaborative environment.

Thanks!