Cloudera Data Science Workbench vs KNIME comparison

Cancel
You must select at least 2 products to compare!
Cloudera Logo
2,070 views|1,837 comparisons
66% willing to recommend
Knime Logo
10,966 views|7,554 comparisons
93% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Cloudera Data Science Workbench and KNIME based on real PeerSpot user reviews.

Find out what your peers are saying about Databricks, Microsoft, Alteryx and others in Data Science Platforms.
To learn more, read our detailed Data Science Platforms Report (Updated: April 2024).
771,157 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don't interfere with each other. The deployment of machine learning is fast and easy to manage. Its API calls are also fast.""The Cloudera Data Science Workbench is customizable and easy to use."

More Cloudera Data Science Workbench Pros →

"We have been able to appreciate the considerable reduction in prototyping time.""Clear view of the data at every step of ETL process enables changing the flow as needed.""Stability is excellent. I would give it a nine out of ten.""Since KNIME is a no-code platform, it is easy to work with.""The product is very easy to understand even for non-analytical stakeholders. Sometimes we provide them with KNIME workflows and teach them how to run it on their own machine.""It is very fast to develop solutions.""It has allowed us to easily implement advanced analytics into various processes.""I was able to apply basic algorithms through just dragging and dropping."

More KNIME Pros →

Cons
"The tool's MLOps is not good. It's pricing also needs to improve.""Running this solution requires a minimum of 12GB to 16GB of RAM."

More Cloudera Data Science Workbench Cons →

"The solution is inconvenient when it comes to wrangling data that includes multiple steps or features because each step or feature requires its own icon.""Though I can use KNIME in a 64-bit platform in the lab, it's missing some features. For example, from my laptop, I can use the image reader feature of KNIME. However, in the lab, the image reader node is missing.""Data visualization needs improvement.""KNIME could improve when it comes to large data markets.""I'd like something that would make it easier to connect/parse websites, although I will fully admit that I'm not as proficient in KNIME as I would like to be, so it could be I'm just missing something.""It could input more data acquisitions from other sources and it is difficult to combine with Python.""The overall user experience feels unpolished. In particular: Data field type conversion is a real hassle, and date fields are a hassle; documentation is pretty poor; user community is average at best.""It's pretty straightforward to understand. So, if you understand what the pipeline is, you can use the drag-and-drop functionality without much training. Doing the same thing in Python requires so much more training. That's why I use KNIME."

More KNIME Cons →

Pricing and Cost Advice
  • "It is free of cost. It is GNU licensed."
  • "KNIME desktop is free, which is great for analytics teams. Server is well priced, depending on how much support is required."
  • "KNIME is free as a stand-alone desktop-based platform but if you want to get a KNIME server then you can find the cost on their website."
  • "The price of KNIME is quite reasonable and the designer tool can be used free of charge."
  • "It's an open-source solution."
  • "The price for Knime is okay."
  • "At this time, I am using the free version of Knime."
  • "This is an open-source solution that is free to use."
  • More KNIME Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
    771,157 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don't interfere with each other. The deployment of machine learning is fast and easy to… more »
    Top Answer:The tool's MLOps is not good. It's pricing also needs to improve.
    Top Answer:We have different use cases. Our banking use case uses machine learning to identify customer life events and recommend the best-suited card products. These machine-learning models are deployed in our… more »
    Top Answer:Since KNIME is a no-code platform, it is easy to work with.
    Top Answer:We're using the free academic license just locally. I went for KNIME because they have a free academic license. And to be honest, I never bothered to check the prices.
    Top Answer:KNIME is not good at visualization. I would like to see NLQ (Natural language query) and automated visualizations added to KNIME.
    Ranking
    18th
    Views
    2,070
    Comparisons
    1,837
    Reviews
    1
    Average Words per Review
    353
    Rating
    6.0
    4th
    Views
    10,966
    Comparisons
    7,554
    Reviews
    21
    Average Words per Review
    501
    Rating
    7.9
    Comparisons
    Also Known As
    CDSW
    KNIME Analytics Platform
    Learn More
    Overview

    Cloudera Data Science Workbench (CDSW) makes secure, collaborative data science at scale a reality for the enterprise and accelerates the delivery of new data products. With CDSW, organizations can research and experiment faster, deploy models easily and with confidence, as well as rely on the wider Cloudera platform to reduce the risks and costs of data science projects. Access any data anywhere – from cloud object storage to data warehouses, CDSW provides connectivity not only to CDH but the systems your data science teams rely on for analysis.

    KNIME is an open-source analytics software used for creating data science that is built on a GUI based workflow, eliminating the need to know code. The solution has an inherent modular workflow approach that documents and stores the analysis process in the same order it was conceived and implemented, while ensuring that intermediate results are always available. 

    KNIME supports Windows, Linux, and Mac operating systems and is suitable for enterprises of all different sizes. With KNIME, you can perform functions ranging from basic I/O to data manipulations, transformations and data mining. It consolidates all the functions of the entire process into a single workflow. The solution covers all main data wrangling and machine learning techniques, and is based on visual programming.

    KNIME Features

    KNIME has many valuable key features. Some of the most useful ones include:

    • Scalability through data handling (intelligent automatic caching of data in the background while maximizing throughput performance)
    • High extensibility via a well-defined API for plugin extensions
    • Intuitive user interface
    • Import/export of workflows
    • Parallel execution on multi-core systems
    • Command line version for "headless" batch executions
    • Activity dashboard
    • Reporting & statistics
    • Third-party integrations
    • Workflow management
    • Local automation
    • Metanode linking
    • Tool blending
    • Big Data extensions

    KNIME Benefits

    There are many benefits to implementing KNIME. Some of the biggest advantages the solution offers include:

    • Integrated Deployment: KNIME’s integrated deployment moves both the selected model, and the entire data model preparation process into production simply and automatically, allowing for continuous optimization in production and also saving time because it eliminates error.
    • Elastic and Hybrid Execution: KNIME’s elastic and hybrid executions helps you reduce costs while covering periods of high demand, dynamically.
    • Metadata Mapping: KNIME enables complete metadata mapping of all aspects of your workflow. In addition, KNIME offers blueprint workflows for documenting the nodes, data sources, and libraries used, as well as runtime information.
    • Guided Analytics: KNIME’s guided analytics applications can be customized based on reusable components.
    • Powerful analytics, local automation, and workflow difference: KNIME uses advanced predictive and machine learning algorithms to provide you with the analytics you need. In combination with powerful analytics, KNIME’s automation capabilities and workflow difference prepare your organization with the tools you need to make better business decisions.
    • Supports enterprise-wide data science practices: The deployment and management functionalities of KNIME make it easy to productionize data science applications and services, and deliver usable, reliable, and reproducible insights for the business.
    • Helps you leverage insights gained from your data: Using KNIME ensures the data science process immediately reflects changing requirements or new insights.

    Reviews from Real Users

    Below are some reviews and helpful feedback written by PeerSpot users currently using the KNIME solution.

    An Emeritus Professor at a university says, “It can read many different file formats. It can very easily tidy up your data, deleting blank rows, and deleting rows where certain columns are missing. It allows you to make lots of changes internally, which you do using JavaScript to put in the conditional. It also has very good fundamental machine learning. It has decision trees, linear regression, and neural nets. It has a lot of text mining facilities as well. It's fairly fully-featured.”

    Benedikt S., CEO at SMH - Schwaiger Management Holding GmbH, explains, “All of the features related to the ETL are fantastic. That includes the connectors to other programs, databases, and the meta node function. Technical support has been extremely responsive so far. The solution has a very strong and supportive community that shares information and helps each other troubleshoot. The solution is very stable. The initial setup is pretty simple and straightforward.”

    Piotr Ś., Test Engineer at ProData Consult, says, “What I like the most is that it works almost out of the box with Random Forest and other Forest nodes.”

    Sample Customers
    IQVIA, Rush University Medical Center, Western Union
    Infocom Corporation, Dymatrix Consulting Group, Soluzione Informatiche, MMI Agency, Estanislao Training and Solutions, Vialis AG
    Top Industries
    VISITORS READING REVIEWS
    Financial Services Firm32%
    Healthcare Company10%
    Computer Software Company8%
    Manufacturing Company7%
    REVIEWERS
    University25%
    Comms Service Provider17%
    Retailer14%
    Government8%
    VISITORS READING REVIEWS
    Manufacturing Company12%
    Financial Services Firm11%
    Computer Software Company9%
    Educational Organization8%
    Company Size
    VISITORS READING REVIEWS
    Small Business9%
    Midsize Enterprise10%
    Large Enterprise81%
    REVIEWERS
    Small Business28%
    Midsize Enterprise26%
    Large Enterprise46%
    VISITORS READING REVIEWS
    Small Business19%
    Midsize Enterprise14%
    Large Enterprise67%
    Buyer's Guide
    Data Science Platforms
    April 2024
    Find out what your peers are saying about Databricks, Microsoft, Alteryx and others in Data Science Platforms. Updated: April 2024.
    771,157 professionals have used our research since 2012.

    Cloudera Data Science Workbench is ranked 18th in Data Science Platforms with 2 reviews while KNIME is ranked 4th in Data Science Platforms with 50 reviews. Cloudera Data Science Workbench is rated 7.0, while KNIME is rated 8.2. The top reviewer of Cloudera Data Science Workbench writes "Useful for data science modeling but improvement is needed in MLOps and pricing ". On the other hand, the top reviewer of KNIME writes "A low-code platform that reduces data mining time by linking script". Cloudera Data Science Workbench is most compared with Databricks, Amazon SageMaker, Microsoft Azure Machine Learning Studio, Dataiku and SAS Enterprise Miner, whereas KNIME is most compared with RapidMiner, Microsoft Power BI, Alteryx, Dataiku and Weka.

    See our list of best Data Science Platforms vendors.

    We monitor all Data Science Platforms reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.