We performed a comparison between Amazon EMR and Hortonworks Data Platform based on real PeerSpot user reviews.
Find out in this report how the two Hadoop solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."When we grade big jobs from on-prem to the cloud, we do it in EMR with Spark."
"The solution helps us manage huge volumes of data."
"It allows users to access the data through a web interface."
"The ability to resize the cluster is what really makes it stand out over other Hadoop and big data solutions."
"One of the valuable features about this solution is that it's managed services, so it's pretty stable, and scalable as much as you wish. It has all the necessary distributions. With some additional work, it's also possible to change to a Spark version with the latest version of EMR. It also has Hudi, so we are leveraging Apache Hudi on EMR for change data capture, so then it comes out-of-the-box in EMR."
"Amazon EMR is a good solution that can be used to manage big data."
"The solution is scalable."
"In Amazon EMR it is easy to rebuild anything, easy to upgrade and has good fault tolerance."
"Now, using this solution, it is much cheaper to have all of the data available for searching, not in real-time, but whenever there is a pending request."
"Ranger for security; with Ranger we can manager user’s permissions/access controls very easily."
"The product offers a fairly easy setup process."
"It is a scalable platform."
"Ambari Web UI: user-friendly."
"The Hortonworks solution is so stable. It is working as a production system, without any error, without any downtime. If I have downtime, it is mostly caused by the hardware of the computers."
"Distributed computing, secure containerization, and governance capabilities are the most valuable features."
"Hortonworks should not be expensive at all to those looking into using it."
"The problem for us is it starts very slow."
"The initial setup was time-consuming."
"There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange."
"We don't have much control. If we have multiple users, if they want to scale up, the cost will go and increase and we don't know how we can restrict that price part."
"Amazon EMR is continuously improving, but maybe something like CI/CD out-of-the-box or integration with Prometheus Grafana."
"As people are shifting from legacy solutions to other technologies, Amazon EMR needs to add more features that give more flexibility in managing user data."
"The most complicated thing is configuring to the cluster and ensure it's running correctly."
"There is room for improvement in pricing."
"I would like to see more support for containers such as Docker and OpenShift."
"The cost of the solution is high and there is room for improvement."
"I work a lot with banking, IT and communications customers. Hortonworks must improve or must upgrade their services for these sectors."
"It would also be nice if there were less coding involved."
"It's at end of life and no longer will there be improvements."
"More information could be there to simplify the process of running the product."
"Deleting any service requires a lot of clean up, unlike Cloudera."
"The version control of the software is also an issue."
Amazon EMR is ranked 3rd in Hadoop with 20 reviews while Hortonworks Data Platform is ranked 6th in Hadoop with 25 reviews. Amazon EMR is rated 7.8, while Hortonworks Data Platform is rated 8.0. The top reviewer of Amazon EMR writes "Provides efficient data processing features and has good scalability ". On the other hand, the top reviewer of Hortonworks Data Platform writes "Good for secure containerization, and governance capabilities ". Amazon EMR is most compared with Snowflake, Cloudera Distribution for Hadoop, Azure Data Factory, Amazon Redshift and AWS Lake Formation, whereas Hortonworks Data Platform is most compared with Apache Spark, Cloudera DataFlow and HPE Ezmeral Data Fabric. See our Amazon EMR vs. Hortonworks Data Platform report.
See our list of best Hadoop vendors.
We monitor all Hadoop reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.