bicyclesetr.blogg.se

Amazon redshift spectrum vs athena
Amazon redshift spectrum vs athena






Amazon Redshift Spectrum and Amazon Athena are evolutions of the AWS solution stack. It can be difficult to keep up from product, technical, and business value persepctives.

amazon redshift spectrum vs athena amazon redshift spectrum vs athena

SummaryĪmazon is continuing to scale its product offerings at an accelerated rate. I can see people building custom UI wrappers with D3.js and similar frameworks. This might open some interesting new use cases for Athena. However, Amazon recently released a REST API for Athena: Amazon Athena adds API/CLI, AWS SDK support, and audit logging with AWS CloudTrail.

#Amazon redshift spectrum vs athena how to#

This post goes into the Apache Parquet topic in more detail: How to Be a Hero with Powerful Parquet, Google, and Amazon Athena Now Has a REST API The uses case for using both might be limited, but they are not mutually exclusive choices if the need arises. If your data is stored in Apache Parquet files, it is also trivial to switch contexts between Spectrum and Athena. CSV) is perfect if you have cash to burn. What becomes critical is cost and performance considerations related to the file format you employ. The transition between the two becomes somewhat trivial. If your data is optimized on S3 in the Apache Parquet format, then you are well positioned for Athena AND Spectrum. I can query a 1 TB Parquet file on S3 in Athena the same as Spectrum. Why? Athena and Spectrum can both access the same object on S3. If Athena or Spectrum are candidates for your workflows, then you are likely structuring your data in a manner that could support either tool. Can I Use Both Amazon Athena and Redshift Spectrum? Yes! The Redshift path gives your more analytics options at the moment. If you went down the Athena path, your tool choices are currently more limited than Redshift. So this gets back to the first point (1) around what your current stack includes. The flip side is they also don’t support Spectrum. However, there are many tools that don’t support Athena. For example, Tableau 10.3 officially released support for Athena. It might be the case that your analytic tool of choice does not support Athena, but does support Redshift. Do My Analytic Tools Support Amazon Athena? Athena might make more sense given that fact. Access to the “Redshift+Redshift Spectrum” tandem has costs that might not be worthwhile (right now). Remember that access to Spectrum requires an active, running Redshift instance: Redshift Spectrum is not an option without Redshift. Assuming you have objects on S3 that Athena can consume, then you might start with Athena, rather than spinning up Redshift. If you are not a Redshift customer, then it becomes more interesting. Why pay to store that in Redshift when moving it to S3 and querying it with Spectrum is an option? Be advised that you are still paying “per query” via Spectrum the same as you would be charged in Athena. The benefit of this approach is offloading data so you can be more efficient with local storage in Redshift.Īs an existing Redshift user, I would be less inclined to use Athena because of my existing investment in Redshift and any ancillary data operations that process data into it. For example, you have a 100 GB transactional table of infrequently accessed data. This can save you a money, since you can lifecycle data out of Redshift to S3. If you are already a Redshift customer, the use Amazon Spectrum can help you balance the need for adding capacity to the system. So how do you decide if using Amazon Redshift Spectrum or Amazon Athena makes sense? Here are four questions you can ask yourself to help get a sense of which is best for your case. Rather than looking at this question from a technical perspective, I thought exploring it as a buying question might be useful. How is Amazon Redshift Spectrum different than Amazon Athena? Most of the discussion of this question is centered around the technical differences.

amazon redshift spectrum vs athena

With both services claiming to run queries of unstructured data stored on Amazon S3 without having to load or transform them, and both offering similar pricing, it wasn't very clear how they differ and what to choose.

amazon redshift spectrum vs athena

Over the past year, AWS announced two serverless database technologies: Amazon Redshift Spectrum and Amazon Athena.






Amazon redshift spectrum vs athena