We compared IBM InfoSphere DataStage and IBM Cloud Pak for Data based on our user's reviews in several parameters.
IBM InfoSphere DataStage is praised for its strong data integration, connectors, workflow management, ETL functionalities, and data quality controls. In contrast, IBM Cloud Pak for Data is commended for its analytics capabilities, user interface, data management tools, integration, scalability, governance, security, collaboration, and AI-driven features. Feedback on customer service, setup duration, pricing, and ROI varies between the two products.
Features: IBM InfoSphere DataStage is praised for its strong data integration capabilities, comprehensive set of connectors, efficient workflow management, and robust ETL functionalities. On the other hand, IBM Cloud Pak for Data is valued for its robust analytics capabilities, ease of use, comprehensive data management tools, seamless integration, and advanced data governance and security features. It also offers AI-driven capabilities like machine learning and predictive analytics.
Pricing and ROI: The available data does not provide any information about the setup cost for IBM InfoSphere DataStage. Similarly, the pricing and licensing information for IBM Cloud Pak for Data is not provided in the available data source., IBM InfoSphere DataStage has no available data to determine its ROI, while there is also no information or insights about the ROI of IBM Cloud Pak for Data.
Room for Improvement: IBM InfoSphere DataStage does not have specific areas for improvement identified in the available responses. Similarly, there is no specific feedback or review available for IBM Cloud Pak for Data on what needs improvement.
Deployment and customer support: Based on the available summaries, it is not possible to compare the user reviews regarding the duration to establish IBM InfoSphere DataStage and IBM Cloud Pak for Data as the feedback related to these aspects is not provided for both products., Based on the available data, there is not enough information to provide a summary of the customer service and support of IBM InfoSphere DataStage. The customer service and support of IBM Cloud Pak for Data received a lack of feedback from the reviews provided.
The summary above is based on 24 interviews we conducted recently with IBM InfoSphere DataStage and IBM Cloud Pak for Data users. To access the review's full transcripts, download our report.
"Scalability-wise, I rate the solution a nine or ten out of ten."
"It is a scalable solution, and we have had no issues with its scalability in our company. I rate the solution's scalability a nine out of ten."
"Its data preparation capabilities are highly valuable."
"The most valuable feature of IBM Cloud Pak for Data is the Modeler flows. The ability to develop models using a graphical approach and the capability to connect to various sources, as well as the data virtualization capabilities, allow me to easily access and utilize data that is dispersed across different sources."
"You can model the data there, connect the data models with the business processes and create data lineage processes."
"The most valuable features are data virtualization and reporting."
"What I found most helpful in IBM Cloud Pak for Data is containerization, which means it's easy to shift and leave in terms of moving to other clouds. That's an advantage of IBM Cloud Pak for Data."
"DataStage allows me to connect to different data sources."
"The product is easy to deploy."
"As a data integration platform, it is easy to use. It is quite robust and useful for volumetric analysis when you have huge volumes of data. We have tested it for up to ten million rows, and it is robust enough to process ten million rows internally with its parallel processing. Its error logging mechanism is far simpler and easier to understand than other data integration tools. The newer version of InfoSphere has the data catalog and IDC lineage. They are helpful in the easy traceability of columns and tables."
"We can view what we want to do. We can transform data and put them on tables."
"DataStage works better with Linux operating systems when the application services are hosted on Linux system equipment, but it's powerful on Windows too."
"Once you have Infosphere up and running properly, it is stable."
"The most valuable feature of the solution is the ability to incorporate very complex business rules in Data Stage."
"The most valuable feature is the product's versatility to inject data."
"The solution's scalability is really good...we are using multi-instance jobs where you can scale them easily."
"The solution could have more connectors."
"The tool depends on the control plane, an OpenShift container platform utilized as an orchestration layer...So, we have communicated this issue to IBM and asked if it is feasible to adapt the solution to work on a Kubernetes platform that we support."
"The product must improve its performance."
"Cloud Pak would be improved with integration with cloud service providers like Cloudera."
"The technical support could be a little better."
"One challenge I'm facing with IBM Cloud Pak for Data is native features have been decommissioned, such as XML input and output. Too many changes have been made, and my company has around one hundred thousand mappings, so my team has been putting more effort into alternative ways to do things. Another area for improvement in IBM Cloud Pak for Data is that it's more complicated to shift from on-premise to the cloud. Other vendors provide secure agents that easily connect with your existing setup. Still, with IBM Cloud Pak for Data, you have to perform connection migration steps, upgrade to the latest version, etc., which makes it more complicated, especially as my company has XML-based mappings. Still, the XML input and output capabilities of IBM Cloud Pak for Data have been discontinued, so I'd like IBM to bring that back."
"The solution's user experience is an area that has room for improvement."
"The interface could improve because sometimes it becomes slow. Sometimes there is a delay between clicks when using the software, which can make the development process slow. It can take a few seconds to complete one action, and then a few more seconds to do the next one."
"The interface needs improvement."
"The troubleshooting guide is very bad."
"The graphical user interface (GUI) feels a lot like the interfaces from the 1980s."
"I really like this tool, but the administration should be on the same client application because a lot of administration features are not on the client-side, and they usually need to have administrative access. It's quite complicated to force IT teams to have separate administrative access from the developers."
"There are three things that could improve - the cloud, monitoring and cloud integration. It's a solid product but not a modern one and of course it depends what you're looking for."
"Improvements for DataStage could include better integration with modern data sources like cloud solutions and documents, along with enhancing its capability to handle non-structured data."
"The interface needs work to be more user-friendly."
"So, there are some features that are missing. If I compare DataStage to Talend, Talend allows you to write custom code in Java or use these tools in your applications as well if you are building a job application. But in DataStage, it does not allow you to write custom code for any component."
IBM Cloud Pak for Data is ranked 17th in Data Integration with 11 reviews while IBM InfoSphere DataStage is ranked 7th in Data Integration with 37 reviews. IBM Cloud Pak for Data is rated 8.0, while IBM InfoSphere DataStage is rated 7.8. The top reviewer of IBM Cloud Pak for Data writes "A scalable data analytics and digital transformation tool that provides useful features and integrations". On the other hand, the top reviewer of IBM InfoSphere DataStage writes "User-friendly with a lot of functions for transmission rules, but has slow performance and not suitable for a huge volume of data". IBM Cloud Pak for Data is most compared with Azure Data Factory, Informatica Cloud Data Integration, Palantir Foundry, Denodo and IBM InfoSphere Information Server, whereas IBM InfoSphere DataStage is most compared with SSIS, Azure Data Factory, Talend Open Studio, Informatica PowerCenter and IBM InfoSphere Information Server. See our IBM Cloud Pak for Data vs. IBM InfoSphere DataStage report.
See our list of best Data Integration vendors.
We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.