Pentaho Data Integration (PDI)Pentaho Data Integration is a powerful, metadata-driven ETL tool designed to bridge the gap between business and IT, turning your company’s data into increased profits.

other ETL tools

Which features does Pentaho Data Integration support?

If you want to know how Pentaho Data Integration (PDI) scores on different selection criteria and exactly which functionality it provides. We did all the work for you. Vendors will often tell you only the strengths of their product(s). In our vendor neutral survey, all the features are revealed, including the weak points. Why buy our survey?

Review of Pentaho Data Integrator in our survey (PARTIAL results)
  • Basic functionality
  • Connectivity
  • Future prospects


"Pentaho also provides in-Hadoop execution. We see this as the beginning of a trend, where the “L” in ETL will lose significance. It is making big steps in terms of product maturity and also has a respectable user base." Read more in our 100% vendor neutral ETL Tools & Data Integration Survey.

User Rating: 5.0 (1 votes)
Information from the vendor

Open source product

Pentaho Data Integrator is an open source product which has improved significantly in the last years. Because we now consider it a mature product it has been added to the Business Intelligence Integrated category. Kettle/Pentaho Data Integration is an open source ETL product, free to download, install and use. It is therefore impossible to know how many customers or installations there are.

It also offers a community edition

Pentaho is not expensive, and also offers a community edition which can be downloaded free of charge.

Whole series of available plug-ins

Pentaho has fostered an active development community that supplies Pentaho with a whole series of available plug-ins, some free, some commercial products. This also means that some of the functionality isn’t in the basic product but relies on relationships with third parties.

Does the ETL tool support “Changed Data Capture”?

Yes, via triggers & via Merge Rows transformation which flags incoming records as ‘insert’, ‘change’, ‘delete’ or ‘unchanged’

Is there functionality available within the ETL process to check the data quality using external products or, for example, fuzzy logic?

Yes, by calling DB, SQL or Java procedures, or by using the JavaScript component which contains various data quality functions.


“Pentaho is one of the few products that run on a Mac”

Rick van der Linden, senior analyst and author of the ETL Tools & Data Integration Survey said about Pentaho Data Integration (PDI): “Pentaho is one of the few products that run on a Mac. Pentaho also prides itself on its deep Hadoop integration.” Find out more and order now the ETL Tools & Data Integration Survey 2015.

100% vendor independent research

In the ETL Tools & Data Integration Survey 2015 you’ll find the list of ETL tools in the market, including for each ETL solution an expert review, many comparison graphs and a comparison matrix with all the features. And a thorough 100% vendor independent evaluation of Pentaho Data Integration and all the major ETL platforms.

to the etl tools survey