We performed a comparison between Amazon Kinesis and Apache Spark Streaming based on real PeerSpot user reviews.
Find out in this report how the two Streaming Analytics solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."The most valuable feature is that it has a pretty robust way of capturing things."
"The solution's technical support is flawless."
"Amazon Kinesis's main purpose is to provide near real-time data streaming at a consistent 2Mbps rate, which is really impressive."
"Amazon Kinesis has improved our ROI."
"Setting Amazon Kinesis up is quick and easy; it only takes a few minutes to configure the necessary settings and start using it."
"Kinesis is a fully managed program streaming application. You can manage any infrastructure. It is also scalable. Kinesis can handle any amount of data streaming and process data from hundreds, thousands of processes in every source with very low latency."
"Its scalability is very high. There is no maintenance and there is no throughput latency. I think data scalability is high, too. You can ingest gigabytes of data within seconds or milliseconds."
"From my experience, one of the most valuable features is the ability to track silent events on endpoints. Previously, these events might have gone unnoticed, but now we can access them within the product range. For example, if a customer reports that their calls are not reaching the portal files, we can use this feature to troubleshoot and optimize the system."
"The solution is better than average and some of the valuable features include efficiency and stability."
"Apache Spark Streaming is versatile. You can use it for competitive intelligence, gathering data from competitors, or for internal tasks like monitoring workflows."
"As an open-source solution, using it is basically free."
"The solution is very stable and reliable."
"Apache Spark Streaming's most valuable feature is near real-time analytics. The developers can build APIs easily for a code-steaming pipeline. The solutions have an ecosystem of integration with other stock services."
"It's the fastest solution on the market with low latency data on data transformations."
"Apache Spark Streaming has features like checkpointing and Streaming API that are useful."
"The platform’s most valuable feature for processing real-time data is its ability to handle continuous data streams."
"Amazon Kinesis involved a more complex setup and configuration than Azure Event Hub."
"Could include features that make it easier to scale."
"In general, the pain point for us was that once the data gets into Kinesis there is no way for us to understand what's happening because Kinesis divides everything into shards. So if we wanted to understand what's happening with a particular shard, whether it is published or not, we could not. Even with the logs, if we want to have some kind of logging it is in the shard."
"One area for improvement in the solution is the file size limitation of 10 Mb. My company works with files with a larger file size. The batch size and throughput also need improvement in Amazon Kinesis."
"For me, especially with video streams, there's sometimes a kind of delay when the data has to be pumped to other services. This delay could be improved in Kinesis, or especially the Kinesis Video Streams, which is being used for different use cases for Amazon Connect. With that improvement, a lot of other use cases of Amazon Connect integrating with third-party analytic tools would be easier."
"It would be beneficial if Amazon Kinesis provided document based support on the internet to be able to read the data from the Kinesis site."
"If there were better documentation on optimal sharding strategies then it would be helpful."
"Something else to mention is that we use Kinesis with Lambda a lot and the fact that you can only connect one Stream to one Lambda, I find is a limiting factor. I would definitely recommend to remove that constraint."
"The cost and load-related optimizations are areas where the tool lacks and needs improvement."
"The solution itself could be easier to use."
"We would like to have the ability to do arbitrary stateful functions in Python."
"There could be an improvement in the area of the user configuration section, it should be less developer-focused and more business user-focused."
"It was resource-intensive, even for small-scale applications."
"Integrating event-level streaming capabilities could be beneficial."
"The service structure of Apache Spark Streaming can improve. There are a lot of issues with memory management and latency. There is no real-time analytics. We recommend it for the use cases where there is a five-second latency, but not for a millisecond, an IOT-based, or the detection anomaly-based. Flink as a service is much better."
"The initial setup is quite complex."
Amazon Kinesis is ranked 1st in Streaming Analytics with 24 reviews while Apache Spark Streaming is ranked 8th in Streaming Analytics with 9 reviews. Amazon Kinesis is rated 8.0, while Apache Spark Streaming is rated 8.0. The top reviewer of Amazon Kinesis writes "Used for media streaming and live-streaming data". On the other hand, the top reviewer of Apache Spark Streaming writes "Easy integration, beneficial auto-scaling, and good open-sourced support community". Amazon Kinesis is most compared with Azure Stream Analytics, Amazon MSK, Confluent, Apache Flink and Databricks, whereas Apache Spark Streaming is most compared with Spring Cloud Data Flow, Azure Stream Analytics, Apache Pulsar, Confluent and Starburst Enterprise. See our Amazon Kinesis vs. Apache Spark Streaming report.
See our list of best Streaming Analytics vendors.
We monitor all Streaming Analytics reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.