Webvar
Pentaho Data Integration - logo

Pentaho Data Integration

Ingest, Blend, Cleanse and Prepare Diverse Data From Any Source, in Any Environment - With No Code.
awsPurchase this listing from Webvar in AWS Marketplace using your AWS account. In AWS Marketplace, you can quickly launch pre-configured software with just a few clicks. AWS handles billing and payments, and charges on your AWS bill.

About

For Private Offer Pricing, please contact:PrivateOfferPricing@pentaho.com

Datasheet:Pentaho Data Integration

With Pentaho Data Integration - Managing the enormous volumes, variety, and velocity of data is simplified

By allowing data preparation from any source and automating your data pipeline, Pentaho Data Integration allows you to curate data better for your business user. This software delivers business analytics to end users faster with visual tools that reduce time and complexity - without writing SQL or coding in Java or Python. Organizations immediately gain real value from their various data sources in the cloud or on premises, including files, relational databases, big data sets and more.

Turn Data Into Actionable Insights

More than just ETL (Extract, Transform, Load), Pentaho Data Integration is a codeless data orchestration tool that blends diverse data sets into a single source of truth as a basis for analysis and reporting. Effortlessly managed in a drag-and-drop graphical interface, so you can easily track where it's coming from, where it's going and how it's transforming.

Data Processing Performance and Productivity

PDI speeds performance time, reduces the complexity of integrating big data sources, and provides:

Code-free data transformation

Template-based approach to rapidly onboard data sources into Hadoop

Scalability, Simplicity, and Self-Service

With broad connectivity to any data type and high-performance Spark and MapReduce execution, PDI simplifies and speeds the process of integrating existing databases with new sources of data.

Intuitive, drag-and-drop designer

Rich library of prebuilt components

Powerful orchestration capabilities

Integration and Extensibility

API Integration: Comprehensive REST and SOAP APIs

Plugin Architecture: Extend capabilities with a rich plugin ecosystem

Third-Party Tool Integration: BI tools, databases, etc

Broad Connectivity and Data Delivery

PDI offers broad connectivity to a variety of diverse data, including structured, unstructured and semi-structured data.

Relational database management system (RDBMS): Oracle, IBM DB2, MySQL, Microsoft SQL Server, Postgres, IBM MQ

Spark and Hadoop: Cloudera, Hortonworks, Amazon EMR, MapR (HPE Ezmeral Data Fabric), Microsoft Azure HDInsights, and Elastic Search

NoSQL databases and object stores: MongoDB, Cassandra, HBase, Hitachi Content Platform, AWS S3, Google Cloud Storage, Microsoft Azure ADLS Gen 2

Analytic databases: Redshift, Snowflake, Vertica, Greenplum, Teradata, SAP HANA, Amazon Redshift, Google Big Query

Business applications: SAP, Salesforce, Google Analytics

Files: XML, JSON, Microsoft Excel, CSV, txt, Avro, Parquet, ORC, EBCDIC (mainframe), unstructured files with metadata, including audio, video and visual files

Related Products

How it works?

Search

Search 25000+ products and services vetted by AWS.

Request private offer

Our team will send you an offer link to view.

Purchase

Accept the offer in your AWS account, and start using the software.

Manage

All your transactions will be consolidated into one bill in AWS.

Create Your Marketplace with Webvar!

Launch your marketplace effortlessly with our solutions. Optimize sales processes and expand your reach with our platform.