How to integrate Amazon S3 with the Denodo Platform
Last modified on: 28 Oct 2019
You can translate the document:
There are two main options to access Amazon S3 from Denodo:
- Using the Denodo Distributed File System Custom Wrapper available at the Denodo Support Site we are able to access delimited files, as well as non-standard files (Avro, Map files, Sequence files) stored in HDFS and AWS S3. This option allows to access delimited files (CSV), Avro, Map files, Sequence files and Parquet files. In the case of Parquet Files, the wrapper is able to push down predicate evaluations and column projections in order to reduce the amount of data needed to be transferred to the Denodo server in scenarios where data is being filtered. Any other file format stored in S3, such as JSON or XML can’t be accessed through these custom wrapper. When possible this is the recommended option, for more information check How to integrate Amazon S3 with Denodo Distributed File System Custom Wrapper.
- Using the Amazon S3 REST API. This option allows to access file types not supported by the HDFS Custom Wrappers, such as JSON, XML or Excel. On the other hand, this method is more manual as Amazon S3 REST API has a proprietary authentication schema. For more information check How to connect to the Amazon S3 REST API from Denodo.