AWS Glue vs Matillion ETL comparison

Cancel
You must select at least 2 products to compare!
Amazon Web Services (AWS) Logo
11,729 views|8,292 comparisons
92% willing to recommend
Matillion Logo
3,247 views|2,215 comparisons
95% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between AWS Glue and Matillion ETL based on real PeerSpot user reviews.

Find out in this report how the two Cloud Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed AWS Glue vs. Matillion ETL Report (Updated: March 2024).
772,649 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The facility to integrate with S3 and the possibility to use Jupyter Notebook inside the pipeline are the most valuable features.""Its user interface is quite good. You just need to choose some options to create a job in AWS Glue. The code-generation feature is also useful. If you don't want to customize it and simply want to read a file and store the data in the database, it can generate the code for you.""The solution is serverless so it allows us to transform data while optimizing the cost and performance of Spark jobs.""The solution helps organizations gain flexibility in defining the structure of the data.""I like the fact that AWS Glue works with Python scripts.""The solution is stable and reliable.""The most valuable features currently are glue studio, jobs, and triggers.""Transformations are valuable because you can modify or override complex data logic from an open source or Spark to solve issues."

More AWS Glue Pros →

"It's highly scalable. It takes upon itself the Redshift scalability, so it's very good.""It is pretty user-friendly, even for people who aren't super technical.""The simplicity of this tool is nice. It has a good graphical user interface. You can also do a lot of generic stuff in the tool. If there is good connectivity to a cloud database, such as Snowflake, and you can have a lot of Snowflake functionality in the tool.""It can scale to a great extent. It can handle the load that we are putting on it, which is about 5TBs.""It is an incredibly user-friendly and intuitive tool, making the learning curve quite smooth""Matillion ETL has great Git integration that is perfect and convenient to use.""The tool's middle-dimensional structure significantly simplifies obtaining the right data at the appropriate level. This feature makes deploying our applications easier since we utilize a single source without publishing data from various sources.""It has good integrations with Amazon Redshift and other AWS services."

More Matillion ETL Pros →

Cons
"The solution should offer features for streaming data in addition to batching data.""I would like to see a more robust interface on the no-code side. This would be nice to be able to split cells.""There should be more connectors for different databases.""In terms of performance, if they can further optimize the execution time for serverless jobs, it would be a welcome improvement.""We face performance issues when using AWS Glue for data transformation and integration.""The start-up time is really high right now. For instance, when you start up a new job, you have to wait for five or eight minutes before it starts. If the start-up time is reduced to one or two minutes, it will be great. It will be better to have a direct linkage to Redshift in AWS. If we can use data catalogs from Redshift, it will be so easy to create some data catalogs. Currently, we can only use data catalogs from S3.""One area that could be improved is the ETL view. The drag-and-drop interface is not as user-friendly as some other ETL tools.""The setup and installation is a bit complex without advanced knowledge or training."

More AWS Glue Cons →

"The improvement area could be possible if the tool provides better integration capabilities with other ecosystems, including governance tools or data cataloging tools, as it is currently an area where the solution is lacking.""It needs integration with more data sources.""Ideally, I would like it to integrate with Secrets Manager as well as the AWS.""Unlike Snowflake which automatically takes care of upgrading to the latest version and includes additional features, with Matillion ETL we need to do this ourselves.""The cost of the solution is high and could be reduced.""One of the features that's in development is data privacy in the cloud, along with further SAP integration. For connectivity to SAP systems.""Sometimes, we have issues with the solution's stability and need to restart it for three weeks or more.""I am looking forward to seeing the expansion of the source range for their data loader product."

More Matillion ETL Cons →

Pricing and Cost Advice
  • "The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes."
  • "It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us."
  • "Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
  • "Technical support is a paid service, and which subscription you have is dependent on that. You must pay one of them, and it ranges from $15,000 to $25,000 per year."
  • "This solution is affordable and there is an option to pay for the solution based on your usage."
  • "AWS Glue is quite costly, especially for small organizations."
  • "AWS Glue uses a pay-as-you-go approach which is helpful. The price of the overall solution is low and is a great advantage."
  • "The overall cost of AWS Glue could be better. It cost approximately $1,000 a month. There is paid support available from AWS Glue."
  • More AWS Glue Pricing and Cost Advice →

  • "I have heard from my manager and other higher ups, "This product is cheaper than other things on the market," and they have done the research."
  • "It is cost-effective. Based on our use case, it's efficient and cheap. It saves a lot of money and our upfront costs are less."
  • "The prices needs to be lower."
  • "It was very easy to purchase through the AWS Marketplace, but it was also expensive."
  • "Purchasing it through the AWS Marketplace is pretty convenient. There is a little bit of back and forth in terms of the licensing based on the machine size, but it seems to have worked out well. it is convenient to have it all as part of our AWS billing."
  • "It is not necessarily a cheap solution. However, it's reasonable priced, especially with the smaller machines that we run it on."
  • "The AWS pricing and licensing are a cost-effective solution for data integration needs."
  • "It was procured through the AWS Marketplace because it keeps things simple. They offer retail-like checkout and bill through your existing Amazon Web Services account."
  • More Matillion ETL Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
    772,649 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:AWS Glue and Azure Data factory for ELT best performance cloud services.
    Top Answer:We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in… more »
    Top Answer:AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or… more »
    Top Answer:The new version with the Productivity Cloud is very simple. It's easy to use, navigate, and understand.
    Top Answer:The pricing depends on what edition the customer opts for. For example, a standard edition and then business critical of different editions. Each of those has a different cost per unit, which is… more »
    Top Answer:One of the features that's in development is data privacy in the cloud, along with further SAP integration. For connectivity to SAP systems.
    Ranking
    1st
    Views
    11,729
    Comparisons
    8,292
    Reviews
    32
    Average Words per Review
    419
    Rating
    7.8
    4th
    Views
    3,247
    Comparisons
    2,215
    Reviews
    13
    Average Words per Review
    687
    Rating
    8.6
    Comparisons
    Also Known As
    Matillion ETL for Redshift, Matillion ETL for Snowflake, Matillion ETL for BigQuery
    Learn More
    Overview

    AWS Glue is a serverless cloud data integration tool that facilitates the discovery, preparation, movement, and integration of data from multiple sources for machine learning (ML), analytics, and application development. The solution includes additional productivity and data ops tooling for running jobs, implementing business workflows, and authoring.

    AWS Glue allows users to connect to more than 70 diverse data sources and manage data in a centralized data catalog. The solution facilitates visual creation, running, and monitoring of extract, transform, and load (ETL) pipelines to load data into users' data lakes. This Amazon product seamlessly integrates with other native applications of the brand and allows users to search and query cataloged data using Amazon EMR, Amazon Athena, and Amazon Redshift Spectrum.

    The solution also utilizes application programming interface (API) operations to transform users' data, create runtime logs, store job logic, and create notifications for monitoring job runs. The console of AWS Glue connects all of these services into a managed application, facilitating the monitoring and operational processes. The solution also performs provisioning and management of the resources required to run users' workloads in order to minimize manual work time for organizations.

    AWS Glue Features

    AWS Glue groups its features into four categories - discover, prepare, integrate, and transform. Within those groups are the following features:

    • Automatic schema discovery: AWS Glue crawlers connect to the organization's source or target data source through a prioritized list of classifiers to determine the schema for users' data. This feature creates metadata in companies' AWS Glue Data Catalog.

    • Schemas for data stream management: The AWS Glue Schema Registry enables users to validate and control the evolution of streaming data through registered Apache Avro schemas for no additional charge.

    • Automatic scaling based on workload: This feature dynamically scales resources up and down based on workload. The feature controls job resources, removing them depending on how much the workload can be split up.

    • FindMatches: This feature is for machine learning-based data deduplication and cleansing, and works by finding records that are imperfect matches of each other to remove useless data copies.

    • Edit, debug, and test ETL code: This feature helps users who have chosen to interactively develop their ETL code by providing development endpoints for editing, debugging, and testing the code it generates for them.

    • AWS Glue DataBrew: An interactive, point-and-click visual interface for specialists to clean and normalize data without the need to write any code.

    • AWS Glue Interactive Sessions: This feature simplifies the development of data integration jobs by enabling data engineers to interactively prepare and explore data.

    • AWS Glue Studio Job Notebooks: This AWS Glue feature provides serverless notebooks with minimal setup, allowing developers to start working in a timely manner.

    • Complex ETL pipeline building: This feature allows the product to be invoked on a schedule, on demand, or based on an event, allowing users to start multiple jobs in parallel or specify dependencies to build complex ETL pipelines.

    • AWS Glue Studio: This AWS Glue feature allows users to visually transform data through a drag-and-drop interface. The product automatically generates the code for ETL processes for users' data.

    AWS Glue Benefits

    AWS Glue offers a wide range of benefits for its users. These benefits include:

    • Users of other AWS products can easily onboard with AWS Glue, as it is integrated across a wide range of the company's services.

    • The solution is serverless, which allows for a lower total cost of ownership.

    • AWS Glue offers more power for users, as it automates much of the effort in building, maintaining, and running ETL jobs.

    • The product allows customers to easily discover and search across all their AWS datasets through AWS Glue Data Catalog.

    • AWS Glue does not require additional payment for managing and enforcing schemas for data streams.

    • The solution facilitates the authority of scalable ETL jobs for beginners and non-coding experts through a drag-and-drop interface.

    Reviews from Real Users

    Mustapha A., a cloud data engineer at Jems Groupe, likes AWS Glue because it is a product that is great for serverless data transformations.

    Liana I., CEO at Quark Technologies SRL, describes AWS Glue as a highly scalable, reliable, and beneficial pay-as-you-go pricing model.

    Matillion ETL is a powerful tool for extracting, transforming, and loading large amounts of data from various sources into cloud data warehouses like Snowflake. Its ability to load data dynamically and efficiently using metadata is a standout feature, as is its open-source ETL with good performance and high efficiency. 

    The solution has a graphical interface for jobs, is easily adjustable and extensible, and allows for scheduling and error reporting. Matillion ETL has helped organizations move to a cloud-based solution, bridge the gap between on-premises and on-cloud, and perform complex migration projects.

    Sample Customers
    bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
    Thrive Market, MarketBot, PWC, Axtria, Field Nation, GE, Superdry, Quantcast, Lightbox, EDF Energy, Finn Air, IPRO, Twist, Penn National Gaming Inc
    Top Industries
    REVIEWERS
    Computer Software Company47%
    Financial Services Firm18%
    Pharma/Biotech Company12%
    Consumer Goods Company6%
    VISITORS READING REVIEWS
    Financial Services Firm20%
    Computer Software Company14%
    Manufacturing Company8%
    Insurance Company7%
    REVIEWERS
    Manufacturing Company33%
    Financial Services Firm33%
    Healthcare Company8%
    Computer Software Company8%
    VISITORS READING REVIEWS
    Computer Software Company16%
    Financial Services Firm14%
    Manufacturing Company9%
    Government8%
    Company Size
    REVIEWERS
    Small Business29%
    Midsize Enterprise13%
    Large Enterprise58%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise12%
    Large Enterprise72%
    REVIEWERS
    Small Business25%
    Midsize Enterprise33%
    Large Enterprise42%
    VISITORS READING REVIEWS
    Small Business18%
    Midsize Enterprise13%
    Large Enterprise69%
    Buyer's Guide
    AWS Glue vs. Matillion ETL
    March 2024
    Find out what your peers are saying about AWS Glue vs. Matillion ETL and other solutions. Updated: March 2024.
    772,649 professionals have used our research since 2012.

    AWS Glue is ranked 1st in Cloud Data Integration with 37 reviews while Matillion ETL is ranked 4th in Cloud Data Integration with 24 reviews. AWS Glue is rated 7.8, while Matillion ETL is rated 8.6. The top reviewer of AWS Glue writes "Provides serverless mechanism, easy data transformation and automated infrastructure management". On the other hand, the top reviewer of Matillion ETL writes "Efficient data integration and transformation with seamless cloud-native integration". AWS Glue is most compared with AWS Database Migration Service, Informatica PowerCenter, Informatica Cloud Data Integration, SSIS and Palantir Foundry, whereas Matillion ETL is most compared with Snowflake, Azure Data Factory, SSIS, Informatica PowerCenter and Informatica Cloud Data Integration. See our AWS Glue vs. Matillion ETL report.

    See our list of best Cloud Data Integration vendors.

    We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.