Data Virtualization Basics

Folders, data sources, combination and execution

Welcome to these Basic Tutorials about the Denodo Platform. Here you will find a complete set of lessons that describe briefly the main topics you need to know when you start your Data Virtualization project.

All new technologies require some kind of coaching before starting to use them at your own. To demonstrate the features of the Denodo Data Virtualization software we are going to play with several sample data sources included in the Tutorial files (you can adapt them to your specific needs if you want).

Problem to solve

Before starting your project, first, you have to stop and understand the problem. Let's suppose we are facing the following scenario:

  • You are working in a company that has the information about its customers (CRM) stored in a MySQL database.
  • The complete billing information for any customer is exposed in an internal Web Service.
  • End users do not want to use different applications to get available information of customers (CRM, Sales application, ...).
  • IT department does not like the idea of create a specific application for this business need. IT wants to reuse this customer global view, if possible, in any other current application.

We said in the previous paragraph: "End users do not want to use different applications to get all the information of customers", so the problem here is to connect disparate data to create a single view of the customer. And that single view has to be reusable (it is an IT requisite).

If you follow a bad design, not reusing components, creating ad-hod code/applications... surely your systems will grow as in the image below:

Why Data Virtualization

The main idea is to have a single virtual layer to connect disparate data and to provide unified access to consuming applications. These applications will use the semantic components defined in the virtual layer reusing them when needed. This way your applications will be independent from the physical sources where the data is stored.

At the end of the Tutorial you will notice that:

  • It is easy to generate Data Services.
  • Your Data Services are independent of the physical source.
  • You have a single point to control your data sources.
  • Small development cycles.
  • No code is required!
  • You started solving a simple need (refined generating new versions quickly)
  • Your model can be reused by all clients
  • ...

Before starting the Tutorial, be sure you have all the necessary materials. Check the Installation & Bootstrapping section.

If you already have installed all the components, please, go ahead to the First Steps section.