IBM InfoSphere DataStage vs Talend Open Studio comparison

Cancel
You must select at least 2 products to compare!
IBM Logo
10,952 views|9,105 comparisons
82% willing to recommend
Talend Logo
12,417 views|9,443 comparisons
97% willing to recommend
Comparison Buyer's Guide
Executive Summary
Updated on Oct 30, 2022

We performed a comparison between IBM InfoSphere Datastage and Talend Open Studio based on our users’ reviews in five categories. After reading all of the collected data, you can find our conclusion below.

  • Ease of Deployment: Users report that the initial setup and deployment of both solutions is straightforward.
  • Features: Users of both products are happy with their stability and scalability.

    Users of IBM Infosphere DataStorage like that the solution is robust, very user friendly, and has good drag-and-drop features. Users feel the solution is lacking virtualization features, is a bit dated, and needs to focus more on cloud technologies.

    Talend Open Studio reviewers like the solution’s ETL tools, flexibility, and integration capabilities. Users mention, however, that the solution consumes a lot of memory.
  • Pricing: Users feel IBM Infosphere Data Storage is an expensive solution. Talend Open Studio reviewers share mixed reviews on the pricing.
  • Service and Support: Overall, users are satisfied with the service and support of both solutions.
  • ROI: IBM Infosphere DataStorage users do not mention ROI. Users of Talend Open Studio report a positive ROI.

Comparison Results: Users feel that IBM Infosphere DataStorage needs to have a stronger focus on cloud technologies to be a more considerable option in today’s marketplace. In addition, its reviewers do not mention an ROI, whereas Talend Open Studio users report a positive one. For these reasons, Talend Open Studio wins out in this comparison.

To learn more, read our detailed IBM InfoSphere DataStage vs. Talend Open Studio Report (Updated: May 2024).
770,428 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The best feature of IBM InfoSphere DataStage for me was that it was very much user-friendly. The solution didn't require that much raw coding because most of its features were drag and drop, plus it had a large number of functionalities.""The ETL tools are probably the most valuable feature. It has an IBM tool, a friendly UI and it makes things more comfortable.""The performance optimization is quite good in DataStage. It provides parallelism and pipelining mechanisms""The Hierarchical Data Stage is good.""It works with multiple servers and offers high availability.""DataStage works better with Linux operating systems when the application services are hosted on Linux system equipment, but it's powerful on Windows too.""It is quite useful and powerful.""Compared to other ETL tools, DataStage has excellent debugging and development capabilities. And the availability of connectors, even though we sometimes have to opt for specific ones. Also, the availability of patches is good."

More IBM InfoSphere DataStage Pros →

"The initial setup was quite straightforward. The deployment took between two and three days.""The API integration and big data approach are very good because of how you extract data from JSP files or big data web repositories like MongoDB.""The most interesting aspect of the solution for us is that Talend Open Studio has a good balance between the features and the cost of the data management platform.""Talend is safe to use because it is very restrictive. It is easy to use when one learns how to manipulate data with SQL.""The standout feature for me is the user-friendly nature of the components.""The Talend Studio connected to the Talend MDM (Master Data Management) is the most valuable feature. Talend Studio is used to create a job stream that connects to multiple data sources, matches, compares or creates a golden record for overall identification. It also has a good catalogue of objects that can be dragged and dropped for building models.""The initial setup of the product was very easy.""The rapidity of integration with data may be one of the valuable features."

More Talend Open Studio Pros →

Cons
"It doesn't have any big data connections. It would be good to have them because most of the systems are moving towards big data. There should also be a user-friendly way to interact with the cloud. Its loading process is very slow. It takes a lot of time for around 5 or 6 million records, and we are not able to provide real-time data to the vendors due to this delay. Its performance needs to be improved. It is also like a legacy system. It is not updated much. In higher versions, they only do small changes. We would like to have new features and new technologies.""What needs improvement in IBM InfoSphere DataStage is its pricing. The pricing for the solution is higher than its competitors, so a lot of the clients my company has worked with prefer other tools over IBM InfoSphere DataStage because of the high price tag. Another area for improvement in the solution stems from a lot of new types of databases, for example, databases in the cloud and big data have become available, and IBM InfoSphere DataStage is working on various connectors for different data sources, but that still isn't up-to-date, meaning that some connectors are missing for modern data sources. The latest version of IBM InfoSphere DataStage also has a complex architecture, so my team faced frequent outages and that should be improved as well.""It would be great if they can include some basic version of data quality checking features.""I want the tool to continue with the on-prem version, not the cloud one.""The solution should be more user-friendly.""The troubleshooting guide is very bad.""The documentation and in-application help for this solution need to be improved, especially for new features.""The setup is extremely difficult."

More IBM InfoSphere DataStage Cons →

"Talend should improve the log and error handling to better track the errors you find during development. Sometimes it's challenging to see what's causing an issue, and tracking that on Talend is complicated.""The technical support and documentation need a lot of work to come up to standard.""It doesn't have the ability to keep the repository of the source code (visual pipeline). It can be integrated with Git.""The profiling perspective needs improvement. Instead of using it in the studio, we are using a different tool which is also provided by Talend. It's redundant.""It needs better installation configuration for other databases. Although the installation allows you to select another database, this doesn't mean that all connection points in the application point to the database selected. You actually need to do a search through the entire install to locate the configuration settings and change them.""In terms of what can be improved, the scheduling is not there in the sister version, while it is there in the cloud one, which is a paid version. If all kinds of scheduling could be available on the Open Studio that we generally use and practice on, that would be great. The scheduling of the data migration is currently not available in the sister version of Talend Open Studio that we are working on. It is available in the advanced version of the Talend. This is the one thing that can be improvised.""We don't get continuous replication of the data.""I think my biggest problem with the tool is that the errors are very hard to debug."

More Talend Open Studio Cons →

Pricing and Cost Advice
  • "High-cost of ownership: They could take a page from open source software."
  • "Pricing varies based on use, and it is not as costly as some competing enterprise solutions."
  • "Small and medium-sized companies cannot afford to pay for this solution."
  • "The cost is too high."
  • "It's very expensive."
  • "Our internal team takes care of group licensing and cost. We don't have individual licenses. We have group licensing at the company level. Usually, IBM doesn't charge anything separately on the licensing side. For storage and everything else, we are paying around $6,000 per month, which is not very high. It includes Linux data storage, execution, and licensing. They're charging $40 for one-hour execution. Based on that, we are spending around $2,000 on the production environment and $1,000 on the lower environment for testing and development-side executions. For the mainframe, we are using the Db2 mainframe database, and we are spending around $1,000 on the Db2 mainframe database as well. All this comes out to be around $6,000. We, however, would like to have some cost reduction."
  • "The price is expensive but there are no licensing fees."
  • "It is quite expensive."
  • More IBM InfoSphere DataStage Pricing and Cost Advice →

  • "Pricing and licensing are fairly straightforward. It is reasonably priced and managed."
  • "Talend is free and you can download it."
  • "The paid version of this solution has a very high price, but even with the limitations, the Community version works fine."
  • "Price could be lower. It is getting too expensive when compared to some other solutions, which is actually a little bit concerning."
  • "There are many versions available and one is open-sourced which is free."
  • "The cost for one year for the ETL tools, not for the big data, is 6K per year. It is a good price."
  • "It does the job well for nothing — without cost. That's the advantage of this product."
  • "Talend Open Studio is priced too high."
  • More Talend Open Studio Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
    770,428 professionals have used our research since 2012.
    Questions from the Community
    Top Answer: My company currently uses the free version of the product, and we are definitely switching to a paid one. We needed a tool that can help us not only integrate our data but use it effectively. For the… more »
    Top Answer: I think the tool may cause some difficulties if you have not used other data integration solutions before. I have worked at companies that used different tools for data integration, and they work… more »
    Top Answer:IBM Cloud Paks makes a big difference in your data integration. My company has been using it alongside IBM InfoSphere DataStage and while the main product is good on its own, this one truly expands… more »
    Top Answer:We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in… more »
    Top Answer:It is easy to use and covers most of the functions needed. We can use the code without any extra effort. The open source is very good. They have the same commercials with additional connectors. The… more »
    Top Answer:The product’s pricing is reasonable. It has an annual subscription.
    Ranking
    7th
    out of 101 in Data Integration
    Views
    10,952
    Comparisons
    9,105
    Reviews
    16
    Average Words per Review
    467
    Rating
    7.9
    5th
    out of 101 in Data Integration
    Views
    12,417
    Comparisons
    9,443
    Reviews
    15
    Average Words per Review
    559
    Rating
    7.9
    Comparisons
    Also Known As
    Open Studio
    Learn More
    Overview

    IBM InfoSphere DataStage is a high-quality data integration tool that aims to design, develop, and run jobs that move and transform data for organizations of different sizes. The product works by integrating data across multiple systems through a high-performance parallel framework. It supports extended metadata management, enterprise connectivity, and integration of all types of data.

    The solution is the data integration component of IBM InfoSphere Information Server, providing a graphical framework for moving data from source systems to target systems. IBM InfoSphere DataStage can deliver data to data warehouses, data marts, operational data sources, and other enterprise applications. The tool works with various types of patterns - extract, transform and load (ETL), and extract, load, and transform (ELT). The scalability of the platform is achieved by using parallel processing and enterprise connectivity.

    The solution has various versions, catering to different types of companies, which include the Server Edition, the Enterprise Edition, and the MVS Edition. Depending on which version a company has bought, different goals can be achieved. They include the following:

    • Designing data flows to extract information from multiple sources, transform the data, and deliver it to target databases or applications.

    • Delivery of relevant and accurate data through direct connections to enterprise applications.

    • Reduction of development time and improvement of consistency through prebuilt functions.

    • Utilization of InfoSphere Information Server tools for accelerating the project delivery cycle.

    IBM InfoSphere DataStage can be deployed in various ways, including:

    • As a service: The tool can be accessed from a subscription model, where its capabilities are a part of IBM DataStage on IBM Cloud Park for Data as a Service. This option offers full management on IBM Cloud.

    • On premises or in any cloud: The two editions - IBM DataStage Enterprise and IBM DataStage Enterprise Plus - can run workloads on premises or in any cloud when added to IBM DataStage on IBM Cloud Pak for Data as a Service.

    • On premises: The basic jobs of the tool can be run on premises using IBM DataStage.

    IBM InfoSphere DataStage Features

    The tool has various features through which users can integrate and utilize their data effectively. The components of IBM InfoSphere DataStage include:

    • AI services: The tool offers services such as data science, event messaging, data warehousing, and data virtualization. It accelerates processes through artificial intelligence (AI) and offers a connection with IBM Cloud Paks - the cloud-native insight platform of the solution.

    • Parallel engine: Through this feature, ETL performance can be optimized to process data at scale. This is achieved through parallel engine and load balancing, which maximizes throughput.

    • Metadata support: This feature of the product uses the IBM Watson Knowledge Catalog to protect companies' sensitive data and monitor who can access it and at what levels.

    • Automated delivery pipelines: IBM InfoSphere DataStage reduces costs by automating continuous integration and delivery of pipelines.

    • Prebuilt connectors: The feature for prebuilt connectivity and stages allows users to move data between multiple cloud sources and data warehouses, including IBM native products.

    • IBM DataStage Flow Designer: This feature offers assistance through machine learning design. The product offers its clients a user-friendly interface which facilitates the work process.

    • IBM InfoSphere QualityStage: The tool provides a feature that automatically resolves data quality issues and increases the reliability of the delivered data.

    • Automated failure detection: Through this feature, companies can reduce infrastructure management efforts, relying on the automated detection that the tool offers.

    • Distributed data processing: Cloud runtimes can be executed remotely through this feature while maintaining its sovereignty and decreasing costs.

    IBM InfoSphere DataStage Benefits

    This solution offers many benefits for the companies that utilize it for data integration. Some of these benefits include:

    • Increased speed of workload execution due to better balancing and a parallel engine.

    • Reduction of data movement costs through integrations and seamless design of jobs.

    • Modernization of data integration by extending the capabilities of companies' data.

    • Delivery of reliable data through IBM Cloud Pak for Data.

    • Utilization of a drag-and-drop interface which assists in the delivery of data without the need for code.

    • Effective data manipulation allows data to be merged before being mapped and transformed.

    • Creating easier access of users to their data by providing visual maps of the process and the delivered data.

    Reviews from Real Users

    A data/solution architect at a computer software company says the product is robust, easy to use, has a simple error logging mechanism, and works very well for huge volumes of data.

    Tirthankar Roy Chowdhury, team leader at Tata Consultancy Services, feels the tool is user-friendly with a lot of functionalities, and doesn't require much coding because of its drag-and-drop features.

    Talend Open Studio is a free, open source ETL tool for data integration and Big Data. The solution enables you to extract diverse datasets and normalize and transform them into a consistent format which can be loaded into a number of third-party databases and applications.

    Talend Open Studio Features

    Talend Open Studio has many valuable key features. Some of the most useful ones include:

    • Automatic identification of data types and potential errors
    • tMap module
    • Graphical conversion tools
    • Charts
    • Database SCD Tools
    • Business intelligence formats (Jasper, OLAP, SPSS, Splunk)
    • ETL and ELT support
    • Eclipse-based development tooling
    • Versioning
    • Large library of connectors
    • Data flow orchestration
    • File management without scripting
    • Data transformations

    Talend Open Studio Benefits

    There are several benefits to implementing Talend Open Studio. Some of the biggest advantages the solution offers include:

    • Reduces the time taken to develop the integration.
    • Provides a wide selection of source and target connectors.
    • Monitor and manage problematic deployments with ease.
    • Allows developers to have the lowest cost of ownership for any solution.
    • Improves collaboration between different teams who need access to data.
    • Automated data integration process synchronizes the data and eases real time and periodic reporting, which would be time-consuming if done manually.
    • Achieve better data quality because data matures and improves over time.

    Reviews from Real Users

    Below are some reviews and helpful feedback written by PeerSpot users currently using the Talend Open Studio solution.

    Elio B., Data Integration Specialist/CTO at Asset messages, says, "The solution has a good balance between automated items and the ability for a developer to integrate and extend what he needs. Other competing tools do not offer the same grade of flexibility when you need to go beyond what is provided by the tool. Talend, on the other hand, allows you to expand very easily."

    A Practice Head, Analytics at a tech services company mentions, “The data integration aspect of the solution is excellent. The product's data preparation features are very good. There's very useful data stewardship within the product. From a technical standpoint, the solution itself is pretty good. There are very good pre-built connectors in Talend, which is good for many clients or businesses, as, in most cases, companies are dealing with multiple data sources from multiple technologies. That is where a tool like Talend is extremely helpful.”

    Prerna T., Senior System Executive at a tech services company, comments, “The best thing I have found with Talend Open Studio is their major support for the lookups. With Salesforce, when we want to relate our child objects to their parent object, we need to create them via IDs. Then the upsert operation, which will allow you to relate a child object to the event, will have an external ID. That is the best thing which keeps it very sorted. I like that.”

    An Implementation Specialist, Individual Contributor at a computer software company, states, “I can connect with different databases such as Oracle Database or SQL Server. It allows you to extract the data from one database to another. I can structure the data by filtering and mapping the fields.” He also adds, “It is very user-friendly. You need to know the basics of SQL development or SQL queries, and you can use this tool.”

    PeerSpot user Badrakh V., Information System Architect at Astvision, explains, "The most valuable features are the ETL tools."

    Sample Customers
    Dubai Statistics Center, Etisalat Egypt
    Almerys, BF&M, Findus
    Top Industries
    REVIEWERS
    Computer Software Company50%
    Insurance Company14%
    Transportation Company7%
    Healthcare Company7%
    VISITORS READING REVIEWS
    Financial Services Firm26%
    Manufacturing Company11%
    Computer Software Company10%
    Insurance Company7%
    REVIEWERS
    Computer Software Company29%
    Insurance Company12%
    Financial Services Firm12%
    University12%
    VISITORS READING REVIEWS
    Computer Software Company16%
    Financial Services Firm14%
    Manufacturing Company7%
    Government7%
    Company Size
    REVIEWERS
    Small Business45%
    Midsize Enterprise6%
    Large Enterprise49%
    VISITORS READING REVIEWS
    Small Business16%
    Midsize Enterprise9%
    Large Enterprise75%
    REVIEWERS
    Small Business43%
    Midsize Enterprise24%
    Large Enterprise33%
    VISITORS READING REVIEWS
    Small Business20%
    Midsize Enterprise15%
    Large Enterprise65%
    Buyer's Guide
    IBM InfoSphere DataStage vs. Talend Open Studio
    May 2024
    Find out what your peers are saying about IBM InfoSphere DataStage vs. Talend Open Studio and other solutions. Updated: May 2024.
    770,428 professionals have used our research since 2012.

    IBM InfoSphere DataStage is ranked 7th in Data Integration with 37 reviews while Talend Open Studio is ranked 5th in Data Integration with 47 reviews. IBM InfoSphere DataStage is rated 7.8, while Talend Open Studio is rated 8.0. The top reviewer of IBM InfoSphere DataStage writes "User-friendly with a lot of functions for transmission rules, but has slow performance and not suitable for a huge volume of data". On the other hand, the top reviewer of Talend Open Studio writes "An open-source ETL tool, when deployed on-premises, requiring an easy installation phase". IBM InfoSphere DataStage is most compared with IBM Cloud Pak for Data, SSIS, Azure Data Factory, Informatica PowerCenter and IBM InfoSphere Information Server, whereas Talend Open Studio is most compared with SSIS, Talend Data Fabric, Talend Data Management Platform, AWS Glue and Informatica PowerCenter. See our IBM InfoSphere DataStage vs. Talend Open Studio report.

    See our list of best Data Integration vendors.

    We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.