Amazon EMR vs Snowflake comparison

Cancel
You must select at least 2 products to compare!
Amazon Web Services (AWS) Logo
2,342 views|2,016 comparisons
85% willing to recommend
Snowflake Computing Logo
21,234 views|11,994 comparisons
96% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Amazon EMR and Snowflake based on real PeerSpot user reviews.

Find out in this report how the two Cloud Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Amazon EMR vs. Snowflake Report (Updated: March 2024).
770,292 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The ability to resize the cluster is what really makes it stand out over other Hadoop and big data solutions.""Amazon EMR's most valuable features are processing speed and data storage capacity.""We are using applications, such as Splunk, Livy, Hadoop, and Spark. We are using all of these applications in Amazon EMR and they're helping us a lot.""When we grade big jobs from on-prem to the cloud, we do it in EMR with Spark.""The initial setup is straightforward.""It allows users to access the data through a web interface.""The solution helps us manage huge volumes of data.""The initial setup is pretty straightforward."

More Amazon EMR Pros →

"The cloning functionality has been the most valuable. I have been able to completely copy databases. The data sharing concept is also useful. As compared to, for example, SAP, Snowflake is a lot more open, and it allows a lot more connectivity for other providers than an SAP ecosystem.""Snowflake's most valuable features are data enrichment and flattening.""It has great flexibility whenever we are loading data and performs ELT (extract, load, transform) techniques instead of ETL.""The most valuable features are the clustering, LS50, being able to change the size, the pay per use feature, the flexibility with many different sources and analytic applications.""The most valuable feature has been the Snowflake data sharing and dynamic data masking.""The feature that is really striking is the ability to translate the SQL workloads into the NoSQL version that can be used by Snowflake.""My company wanted to have all our data in one single place and this what we use Snowflake for. Snowflake also allows us to build connectors to different data sources.""The features that I have found most valuable are the ease of use, the rapidness, how quickly the solution can be implemented, and of course that it's been very easy to move from the on-premise world to the Cloud world because Snowflake is based on SQL also."

More Snowflake Pros →

Cons
"The problem for us is it starts very slow.""There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange.""The product's features for storing data in static clusters could be better.""There is room for improvement in pricing.""We don't have much control. If we have multiple users, if they want to scale up, the cost will go and increase and we don't know how we can restrict that price part.""Modules and strategies should be better handled and notified early in advance.""Amazon EMR can improve by adding some features, such as megastore services and HiveServer2. Additionally, the user interface could be better, similar to what Apache service provides, cross-platform services.""There is no need to pay extra for third-party software."

More Amazon EMR Cons →

"I see room for improvement when it comes to credit performance. The other thing I'd like to be improved is the warehouse facility.""We are yet to figure out how to integrate tools, such as Liquibase, to release changes to our data warehouse model.""The cost efficiency and monitoring of this solution could be improved. It's easy to spend a lot on Snowflake and it does offer monitoring tools but they're pretty basic.""There is a scope for improvement. They don't currently support integration with some of the Azure and AWS native services. It would be good if they can enhance their product to integrate with these services.""Product activation queries can't be changed while executing.""There is room for improvement in Snowflake's integration with Python. We do a lot of SQL programming in Snowflake, but we go to a different tool to program when we have to in Python.""The scheduling system can definitely be better because we had to use external airflow for that. There should be orchestration for the scheduling system. Snowflake currently does not support machine learning, so it is just storage. They also need some alternatives for SQL Query. There should also be support for Spark in different languages such as Python.""The solution should offer an on-premises version also. We have some requirements where we would prefer to use it as a template."

More Snowflake Cons →

Pricing and Cost Advice
  • "You don't need to pay for licensing on a yearly or monthly basis, you only pay for what you use, in terms of underlying instances."
  • "The cost of Amazon EMR is very high."
  • "The price of the solution is expensive."
  • "Amazon EMR's price is reasonable."
  • "There is a small fee for the EMR system, but major cost components are the underlying infrastructure resources which we actually use."
  • "There is no need to pay extra for third-party software."
  • "Amazon EMR is not very expensive."
  • "The product is not cheap, but it is not expensive."
  • More Amazon EMR Pricing and Cost Advice →

  • "Pricing can be confusing for customers."
  • "The whole licensing system is based on credit points. You can also make a license agreement with the company so that you buy credit points and then you use them. What you do not use in one year can be carried over to the next year."
  • "You pay based on the data that you are storing in the data warehouse and there are no maintenance costs."
  • "It is not cheap."
  • "The pricing for Snowflake is competitive."
  • "On average, with the number of queries that we run, we pay approximately $200 USD per month."
  • "Pricing is approximately $US 50 per DB. Terabyte is around $US 50 per month."
  • "The price of Snowflake is very reasonable."
  • More Snowflake Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Cloud Data Warehouse solutions are best for your needs.
    770,292 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:Amazon EMR is a good solution that can be used to manage big data.
    Top Answer:As people are shifting from legacy solutions to other technologies, Amazon EMR needs to add more features that give more flexibility in managing user data.
    Top Answer:The best thing about Snowflake is its flexibility in changing warehouse sizes or computational power.
    Top Answer:The real-time streaming feature is limited with Snowflake and could be improved. Currently, Snowflake doesn't support unstructured data. With Snowflake, you need to be very particular about the type… more »
    Ranking
    8th
    Views
    2,342
    Comparisons
    2,016
    Reviews
    12
    Average Words per Review
    346
    Rating
    7.8
    1st
    Views
    21,234
    Comparisons
    11,994
    Reviews
    36
    Average Words per Review
    464
    Rating
    8.3
    Comparisons
    Also Known As
    Amazon Elastic MapReduce
    Snowflake Computing
    Learn More
    Overview
    Amazon Elastic MapReduce (Amazon EMR) is a web service that makes it easy to quickly and cost-effectively process vast amounts of data. Amazon EMR simplifies big data processing, providing a managed Hadoop framework that makes it easy, fast, and cost-effective for you to distribute and process vast amounts of your data across dynamically scalable Amazon EC2 instances.

    Snowflake is a cloud-based data warehousing solution for storing and processing data, generating reports and dashboards, and as a BI reporting source. It is used for optimizing costs and using financial data, as well as for migrating data from on-premises to the cloud. The solution is often used as a centralized data warehouse, combining data from multiple sources.

    Snowflake has helped organizations improve query performance, store and process JSON and XML, consolidate multiple databases into one unified table, power company-wide dashboards, increase productivity, reduce processing time, and have easy maintenance with good technical support.

    Its platform is made up of three components:

    1. Cloud services - Snowflake uses ANSI SQL to empower users to optimize their data and manage their infrastructure, while Snowflake handles the security and encryption of stored data.
    2. Query processing - Snowflake's compute layer is made up of virtual cloud data warehouses that let you analyze data through requests. Each of the warehouses does not compete for computing resources, nor do they affect the performance of each other.
    3. Database storage - Snowflake automatically manages all parts of the data storage process, including file size, compression, organization, structure, metadata, and statistics.

    Snowflake has many valuable vital features. Some of the most useful ones include:

    • Snowflake architecture provides nearly unlimited scalability and high speed because it uses a single elastic performance engine. The solution also supports unlimited concurrent users and workloads, from interactive to batch.
    • Snowflake makes automation easy and enables enterprises to automate data management, security, governance, availability, and data resiliency.
    • With seamless cross-cloud and cross-region connections, Snowflake eliminates ETL and data silos. Anyone who needs access to shared secure data can get a single copy via the data cloud. In addition, Snowflake makes remote collaboration and decision-making fast and easy via a single shared data source.
    • Snowflake’s Data Marketplace offers third-party data, which allows you to connect with Snowflake customers to extend workflows with data services and third-party applications.

    There are many benefits to implementing Snowflake. It helps optimize costs, reduce downtime, improve operational efficiency, and automate data replication for fast recovery, and it is built for high reliability and availability.

      Below are quotes from interviews we conducted with users currently using the Snowflake solution:

      Sreenivasan R., Director of Data Architecture and Engineering at Decision Minds, says, "Data sharing is a good feature. It is a majorly used feature. The elastic computing is another big feature. Separating computing and storage gives you flexibility. It doesn't require much DBA involvement because it doesn't need any performance tuning. We are not doing any performance tuning, and the entire burden of performance and SQL tuning is on Snowflake. Its usability is very good. I don't need to ramp up any user, and its onboarding is easier. You just onboard the user, and you are done with it. There are simple SQL and UI, and people are able to use this solution easily. Ease of use is a big thing in Snowflake."

      A director of business operations at a logistics company mentions, "It requires no maintenance on our part. They handle all that. The speed is phenomenal. The pricing isn't really anything more than what you would be paying for a SQL server license or another tool to execute the same thing. We have zero maintenance on our side to do anything and the speed at which it performs queries and loads the data is amazing. It handles unstructured data extremely well, too. So, if the data is in a JSON array or an XML, it handles that super well."

      A Solution Architect at a wholesaler/distributor comments, "The ability to share the data and the ability to scale up and down easily are the most valuable features. The concept of data sharing and data plumbing made it very easy to provide and share data. The ability to refresh your Dev or QA just by doing a clone is also valuable. It has the dynamic scale up and scale down feature. Development and deployment are much easier as compared to other platforms where you have to go through a lot of stuff. With a tool like DBT, you can do modeling and transformation within a single tool and deploy to Snowflake. It provides continuous deployment and continuous integration abilities. There is a separation of storage and compute, so you only get charged for your usage. You only pay for what you use. When we share the data downstream with business partners, we can specifically create compute for them, and we can charge back the business."

      Sample Customers
      Yelp
      Accordant Media, Adobe, Kixeye Inc., Revana, SOASTA, White Ops
      Top Industries
      REVIEWERS
      Computer Software Company27%
      Wholesaler/Distributor18%
      Media Company18%
      Comms Service Provider9%
      VISITORS READING REVIEWS
      Financial Services Firm23%
      Computer Software Company13%
      Manufacturing Company8%
      Educational Organization6%
      REVIEWERS
      Computer Software Company29%
      Financial Services Firm20%
      Healthcare Company6%
      Manufacturing Company6%
      VISITORS READING REVIEWS
      Educational Organization27%
      Financial Services Firm13%
      Computer Software Company10%
      Manufacturing Company6%
      Company Size
      REVIEWERS
      Small Business26%
      Midsize Enterprise26%
      Large Enterprise47%
      VISITORS READING REVIEWS
      Small Business16%
      Midsize Enterprise11%
      Large Enterprise72%
      REVIEWERS
      Small Business24%
      Midsize Enterprise20%
      Large Enterprise55%
      VISITORS READING REVIEWS
      Small Business15%
      Midsize Enterprise34%
      Large Enterprise51%
      Buyer's Guide
      Amazon EMR vs. Snowflake
      March 2024
      Find out what your peers are saying about Amazon EMR vs. Snowflake and other solutions. Updated: March 2024.
      770,292 professionals have used our research since 2012.

      Amazon EMR is ranked 8th in Cloud Data Warehouse with 20 reviews while Snowflake is ranked 1st in Cloud Data Warehouse with 92 reviews. Amazon EMR is rated 7.8, while Snowflake is rated 8.4. The top reviewer of Amazon EMR writes "Provides efficient data processing features and has good scalability ". On the other hand, the top reviewer of Snowflake writes "Good usability, good data sharing and elastic compute features, and requires less DBA involvement". Amazon EMR is most compared with Cloudera Distribution for Hadoop, Azure Data Factory, Amazon Redshift, Apache Spark and Microsoft Azure Synapse Analytics, whereas Snowflake is most compared with BigQuery, Azure Data Factory, Teradata, Vertica and Oracle Autonomous Data Warehouse. See our Amazon EMR vs. Snowflake report.

      See our list of best Cloud Data Warehouse vendors.

      We monitor all Cloud Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.