Data Integration Tools 2015
The data integration tools listed here are broken down into platforms that are part of a larger suite of products, platforms that are independent and those that are open source. Some products are listed in their open source community editions and the enhanced, supported editions.
Data Integration as Part of Larger Product Suite
Actian DataConnect is part of the Actian big data analytics platform and delivers sophisticated, highly scalable data integration, profiling and matching capabilities that excel in big data environments. With Actian DataConnect you can integrate, migrate, sync, validate, standardize or enrich your data while maintaining data quality in every facet of your business. This includes:
- Streamline data integration using a convenient browser-based UI and visual design process and drag & drop link-style mapper – with no need to maintain custom-coding
- Monitoring integration server and execution status with message-driven integrations
- Deploying anywhere – with on-premise, cloud or hybrid options Scaling to any volume of data and connect to any endpoint Taking advantage of end-to-end lifecycle management
IBM’s data integration solutions enable you to understand, cleanse, monitor, transform and deliver data, as well as to collaborate to bridge the gap between business and IT. IBM provides capabilities for delivering data in real time to business applications, whether through bulk (extract, transform, load (ETL)), virtual (federated) or incremental (change data capture) data delivery.
Information Builders iWay Integration Suite is part of an extensive BI and analytics platform. The iWay Integration Suite allows for direct access to all of your data, so you can design your architecture to address the unique information needs of all your users. iWay accelerates the deployment and reduces the risk of all types of data integration projects – including extract, transform, and load (ETL); enterprise information integration (EII) initiatives; and web services deployments.
- End-to-end integration of a wide variety of sources, including cloud-based information, social systems, and big data
- Support for real-time and batch integration
- Flexible extract, transform, and load (ETL) and message-based styles of integration
Oracle Data Integration is part of the broader Oracle range of products and delivers pervasive and continuous access to timely and trusted data across heterogeneous systems. Its comprehensive capabilities include real-time and bulk data movement, transformation, bi-directional replication, metadata management, data services, and data quality for customer and product domains.
Pentaho data integration is part of the Pentaho BI suite of products and prepares and blends data to create a complete picture of your business that drives actionable insights. The complete data integration platform delivers accurate, “analytics ready” data to end users from any source. With visual tools to eliminate coding and complexity, Pentaho puts big data and all data sources at the fingertips of business and IT users alike.
QlikView Expressor is metadata management “the QlikView way” — a disruptive approach to data management. It is simple and descriptive, not complex and prescriptive. Consistently capture and manage metadata as you build analytic apps, rather than be locked into a semantic layer up front.
SAS Data Integration Studio provides a powerful visual design tool for building, implementing and managing data integration processes regardless of data sources, applications, or platforms. An easy-to-manage, multiple-user environment enables collaboration on large enterprise projects with repeatable processes that are easily shared. The creation and management of data and metadata are improved with extensive impact analysis of potential changes made across all data integration processes. It enables users to quickly build and edit data integration, to automatically capture and manage standardized metadata from any source, and to easily display, visualize, and understand enterprise metadata and your data integration processes, and is a component in a number of SAS software offerings, including SAS Data Management Advanced
Independent Data Integration Products
Adeptia Suite covers data integration, application integration, B2B integration and BPM. Adeptia ETL Suite is a graphical, easy to use software that supports ANY TO ANY conversion. It consists of three distinct components. It has a web-based “Design Studio” that provides wizard-driven, graphical ability to document data rules as they relate to validations, mapping and edits. This tool includes a library of functions which can be pre-created and reused again and again. Data Mapper has a “preview” capability to see actual source and target data, while the rules are being specified, if the source data file is available. The second component is the “Central Repository” where all the rules and mapping objects are saved. The third component is the “Run-time Execution Engine” where the mapping rules and data flow transactions are executed on incoming data files and messages.
Apatar provides connectivity to many popular applications and data sources (Oracle, MS SQL, MySQL, Sybase, DB2, MS Access, PostgreSQL, XML, InstantDB, Paradox, BorlandJDataStore, Csv, MS Excel, Qed, HSQL, Compiere ERP, SalesForce.Com, SugarCRM, Goldmine, any JDBC data sources and more). Supports bi-directional integration, is platform independent and can be used without coding via the Visual Job Designer. An on-demand version supports Salesforce and QuickBooks.
Centerprise Data Integrator provides a powerful, scalable, high-performance, and affordable integration platform designed for ease and is robust enough to deal with complex data integration challenges. The complex data mapping capabilities make it a good platform for overcoming the challenges of complex hierarchical structures such as XML, electronic data interchange (EDI), web services, and more. The expanding library of Centerprise Connectors is preconfigured to provide a plethora of integration options, enabling high-speed integration and migration to quickly and easily integrate with, or migrate to, leading enterprise CRM and ERP applications, as well as connectors for SOAP and REST web services that can be used to connect to a wide range of web services, including search engines and social media platforms .
CloverETL product family comes in the free community edition with core functionality and three paid for versions that incrementally include more connectors, scheduling and automation, and parallel processing and big data support.
Elixir Data ETL is designed to provide on-demand, self-serviced data manipulation for business users as well as for enterprise level data processing needs. Its visual-modeling paradigm drastically reduces the time required to design, test and implement data extraction, aggregation and transformation – a critical process for any application processing, enterprise reporting and performance measurement, data mart or data warehousing initiatives. Ready for web-based deployment, Elixir Data ETL allows business users to quickly obtain the critical information for their business decisions and operational needs, freeing up the IT group to focus on enterprise level IT issues.
Informatica’s family of enterprise data integration products access and integrate data from any business system, in any format, and deliver that data throughout the enterprise at scale and at any speed. Powered by Vibe™, these enterprise data integration products enable your IT organization to scale with your business needs, dramatically lower costs, boost productivity, and reduce risk. At the same time, they enable business-IT collaboration and co-development to deliver on business demands for timely, relevant, trustworthy data. Informatica PowerCenter caters for highly scalable, high-performance enterprise data integration software. By leveraging Vibe, it serves as the foundation for all data integration projects – from departmental and project-based work, to enterprise integration initiatives, and beyond, for Integration Competency Centers (ICC). Informatica PowerCenter promotes reuse, flexibility, and consistent deployment of data integration best practices and empowers your IT organization to implement a single data integration approach without having to resort to hand coding.
Talend’s data integration products provide an extensible, highly-performant, open source set of tools to access, transform and integrate data from any business system in real time or batch to meet both operational and analytical data integration needs. With 800+ connectors, it integrates almost any data source. The broad range of use cases addressed include: massive scale integration (big data/ NoSQL), ETL for business intelligence and data warehousing, data synchronization, data migration, data sharing, and data services.
Syncsort products cover three main areas of functionality:
- DMX is full-featured data integration software that helps organizations extract, transform and load more data in less time
- DMX-h offers a unique approach to Hadoop Sort and Hadoop ETL, that lowers the barriers for wider adoption, helping organizations unleash the full potential of Hadoop. Eliminate the need for custom code, get smarter connectivity to all your data, and improve Hadoop’s processing efficiency.
- Syncsort MFX delivers the fastest and most resource-efficient mainframe sort, copy, join technology available, and is the only mainframe sort solution that offloads CPU cycles to zIIP engines.
Open Source
Apatar provides connectivity to many popular applications and data sources (Oracle, MS SQL, MySQL, Sybase, DB2, MS Access, PostgreSQL, XML, InstantDB, Paradox, BorlandJDataStore, Csv, MS Excel, Qed, HSQL, Compiere ERP, SalesForce.Com, SugarCRM, Goldmine, any JDBC data sources and more). Supports bi-directional integration, is platform independent and can be used without coding via the Visual Job Designer. An on-demand version supports Salesforce and QuickBooks.
Clover Editions are built on an Open Source Engine. The engine is a Java library and does not come with any User Interface components such as a Graph Designer. The developer / embedding application is responsible for managing graphs rather than using the Clover Designer or Server UI. However, your application does have access to most of powerful data transformation and ETL features that Clover uses throughout its own product range. The CloverETL Open Source Engine can be embedded in any application, commercial ones as well.
Jaspersoft ETL is easy to deploy and out-performs many proprietary and open source ETL systems. It is used to extract data from your transactional system to create a consolidated data warehouse or data mart for reporting and analysis.
KETL™ is a premier, open source ETL tool. The data integration platform is built with portable, java-based architecture and open, XML-based configuration and job language. KETL™ features successfully compete with major commercial products available today. Highlights include:
- Support for integration of security and data management tools
- Proven scalability across multiple servers and CPU’s and any volume of data
- No additional need for third party schedule, dependency, and notification tools
Pentaho’s Data Integration, also known as Kettle, delivers powerful extraction, transformation, and loading (ETL) capabilities. You can use this stand-alone application to visually design transforms and jobs that extract your existing data and make it available for easy reporting and analysis.
Talend Open Studio is a powerful and versatile set of open source products for developing, testing, deploying and administrating data management and application integration projects. Talend delivers the only unified platform that makes data management and application integration easier by providing a unified environment for managing the entire lifecycle across enterprise boundaries. Developers achieve vast productivity gains through an easy-to-use, Eclipse-based graphical environment that combines data integration, data quality, MDM, application integration and big data.