We performed a comparison between Amazon EMR and Snowflake based on real PeerSpot user reviews.
Find out in this report how the two Cloud Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."The ability to resize the cluster is what really makes it stand out over other Hadoop and big data solutions."
"Amazon EMR's most valuable features are processing speed and data storage capacity."
"We are using applications, such as Splunk, Livy, Hadoop, and Spark. We are using all of these applications in Amazon EMR and they're helping us a lot."
"When we grade big jobs from on-prem to the cloud, we do it in EMR with Spark."
"The initial setup is straightforward."
"It allows users to access the data through a web interface."
"The solution helps us manage huge volumes of data."
"The initial setup is pretty straightforward."
"The cloning functionality has been the most valuable. I have been able to completely copy databases. The data sharing concept is also useful. As compared to, for example, SAP, Snowflake is a lot more open, and it allows a lot more connectivity for other providers than an SAP ecosystem."
"Snowflake's most valuable features are data enrichment and flattening."
"It has great flexibility whenever we are loading data and performs ELT (extract, load, transform) techniques instead of ETL."
"The most valuable features are the clustering, LS50, being able to change the size, the pay per use feature, the flexibility with many different sources and analytic applications."
"The most valuable feature has been the Snowflake data sharing and dynamic data masking."
"The feature that is really striking is the ability to translate the SQL workloads into the NoSQL version that can be used by Snowflake."
"My company wanted to have all our data in one single place and this what we use Snowflake for. Snowflake also allows us to build connectors to different data sources."
"The features that I have found most valuable are the ease of use, the rapidness, how quickly the solution can be implemented, and of course that it's been very easy to move from the on-premise world to the Cloud world because Snowflake is based on SQL also."
"The problem for us is it starts very slow."
"There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange."
"The product's features for storing data in static clusters could be better."
"There is room for improvement in pricing."
"We don't have much control. If we have multiple users, if they want to scale up, the cost will go and increase and we don't know how we can restrict that price part."
"Modules and strategies should be better handled and notified early in advance."
"Amazon EMR can improve by adding some features, such as megastore services and HiveServer2. Additionally, the user interface could be better, similar to what Apache service provides, cross-platform services."
"There is no need to pay extra for third-party software."
"I see room for improvement when it comes to credit performance. The other thing I'd like to be improved is the warehouse facility."
"We are yet to figure out how to integrate tools, such as Liquibase, to release changes to our data warehouse model."
"The cost efficiency and monitoring of this solution could be improved. It's easy to spend a lot on Snowflake and it does offer monitoring tools but they're pretty basic."
"There is a scope for improvement. They don't currently support integration with some of the Azure and AWS native services. It would be good if they can enhance their product to integrate with these services."
"Product activation queries can't be changed while executing."
"There is room for improvement in Snowflake's integration with Python. We do a lot of SQL programming in Snowflake, but we go to a different tool to program when we have to in Python."
"The scheduling system can definitely be better because we had to use external airflow for that. There should be orchestration for the scheduling system. Snowflake currently does not support machine learning, so it is just storage. They also need some alternatives for SQL Query. There should also be support for Spark in different languages such as Python."
"The solution should offer an on-premises version also. We have some requirements where we would prefer to use it as a template."
Amazon EMR is ranked 8th in Cloud Data Warehouse with 20 reviews while Snowflake is ranked 1st in Cloud Data Warehouse with 92 reviews. Amazon EMR is rated 7.8, while Snowflake is rated 8.4. The top reviewer of Amazon EMR writes "Provides efficient data processing features and has good scalability ". On the other hand, the top reviewer of Snowflake writes "Good usability, good data sharing and elastic compute features, and requires less DBA involvement". Amazon EMR is most compared with Cloudera Distribution for Hadoop, Azure Data Factory, Amazon Redshift, Apache Spark and Microsoft Azure Synapse Analytics, whereas Snowflake is most compared with BigQuery, Azure Data Factory, Teradata, Vertica and Oracle Autonomous Data Warehouse. See our Amazon EMR vs. Snowflake report.
See our list of best Cloud Data Warehouse vendors.
We monitor all Cloud Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.