We performed a comparison between Amazon EMR and Amazon Redshift based on real PeerSpot user reviews.
Find out in this report how the two Cloud Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."When we grade big jobs from on-prem to the cloud, we do it in EMR with Spark."
"We are using applications, such as Splunk, Livy, Hadoop, and Spark. We are using all of these applications in Amazon EMR and they're helping us a lot."
"The initial setup is straightforward."
"It allows users to access the data through a web interface."
"The ability to resize the cluster is what really makes it stand out over other Hadoop and big data solutions."
"The initial setup is pretty straightforward."
"One of the valuable features about this solution is that it's managed services, so it's pretty stable, and scalable as much as you wish. It has all the necessary distributions. With some additional work, it's also possible to change to a Spark version with the latest version of EMR. It also has Hudi, so we are leveraging Apache Hudi on EMR for change data capture, so then it comes out-of-the-box in EMR."
"The solution is pretty simple to set up."
"I found the Amazon Redshift computing services easy. I found the computing instances the most incredible in the solution."
"We have found Machine Learning use cases are very nice."
"Has a very user-friendly SQL editor and it's very easy to use the connectors."
"The most valuable features of Amazon Redshift are that its fast and efficient. We have lots of TBs of data and it's very fast."
"The product offers good support for the data lake."
"The solution's flexibility is its most valuable feature. It's also easy to scale and has relatively painless pricing."
"The initial setup is easy."
"Redshift's versioning and data security are the two most critical features. When migrating into the cloud, it's vital to secure the data. The encryption and security are there."
"The product must add some of the latest technologies to provide more flexibility to the users."
"The dashboard management could be better. Right now, it's lacking a bit."
"We don't have much control. If we have multiple users, if they want to scale up, the cost will go and increase and we don't know how we can restrict that price part."
"Modules and strategies should be better handled and notified early in advance."
"There is no need to pay extra for third-party software."
"The product's features for storing data in static clusters could be better."
"There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange."
"There is room for improvement in pricing."
"The initial setup is a complex process, especially for someone who is not familiar with nodes and configuring terms like RPUs."
"There is some missing functionality and sometimes it's so difficult to work in. We need to convert these functionalities using VACUUM inside Amazon Redshift and then it causes some complexity."
"There are physically too many pipelines for a company of this size to maintain. For a data scientist, it's very difficult to learn the data in all of these different environments."
"Infinite storage is available in Snowflake and is not available in Redshift."
"In the next release, a pivot function would be a big help. It could save a lot of time creating a query or process to handle operations."
"The technical support should be better in terms of their knowledge, and they should be more customer-friendly."
"Migrating data from other data sources can be challenging when you are working with multibyte character sets."
"The initial deployment was complex."
Amazon EMR is ranked 8th in Cloud Data Warehouse with 20 reviews while Amazon Redshift is ranked 4th in Cloud Data Warehouse with 59 reviews. Amazon EMR is rated 7.8, while Amazon Redshift is rated 7.8. The top reviewer of Amazon EMR writes "Provides efficient data processing features and has good scalability ". On the other hand, the top reviewer of Amazon Redshift writes "Provides one place where we can store data, and allows us to easily connect to other services with AWS". Amazon EMR is most compared with Snowflake, Cloudera Distribution for Hadoop, Azure Data Factory, Apache Spark and Microsoft Azure Synapse Analytics, whereas Amazon Redshift is most compared with Snowflake, Teradata, AWS Lake Formation, Vertica and SAP BW4HANA. See our Amazon EMR vs. Amazon Redshift report.
See our list of best Cloud Data Warehouse vendors.
We monitor all Cloud Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.