<img height="1" width="1" style="display:none" src="https://www.facebook.com/tr?id=315693165909440&amp;ev=PageView&amp;noscript=1">

Upstream

The Big Data Blog

Eran Levy

Director of Marketing at Upsolver

Recent Posts

Data Lake as a Service: Is There a GUI-based Data Lake?

Mar 1, 2020 2:11:00 PM / by Eran Levy posted in Big Data, Data Lake, ETL, SQL, AWS S3, Amazon S3, Apache Spark

 

Recent surveys have shown that the data lake market is expected to grow to $20.1 billion by 2024, with a growing number of organizations looking to deploy a data lake in coming years. However, despite growing interest in big data initiatives, a roadblock many organizations run into is the complex, manual nature of building a data lake - which requires hiring skilled personnel that are in dire shortage.

Read More

Custom Partitioning for Embedded Analytics with Athena

Feb 13, 2020 7:18:50 PM / by Eran Levy posted in Amazon Athena, Partitioning, Amazon S3, Glue Data Catalog

 

Read More

Data Architecture for AWS Athena: 6 Examples to Learn From

Feb 5, 2020 4:43:18 PM / by Eran Levy posted in Amazon Athena, Data Engineering, Data Lake ETL, Amazon Redshift, Amazon Kinesis, Amazon S3, Apache Parquet

 

Read More

Apache Spark Limitations & the Self-service Alternative

Jan 23, 2020 3:41:15 PM / by Eran Levy posted in Data Engineering, Apache Spark, DevOps

 

Read More

14 Best Data Engineering Podcasts, Blogs and Websites

Jan 16, 2020 11:38:51 AM / by Eran Levy posted in Apache Kafka, Data Engineering, Blogs, Podcasts, Netflix, DevOps, NoSQL, Uber

 

Read More

Upsolver Lookup Tables: A Decoupled Alternative to Cassandra

Jan 9, 2020 3:16:54 PM / by Eran Levy posted in Streaming Data, Lookup Tables, Apache Cassandra, Schema Discovery

 

Read More

In 2020, the Data Lake is Ripe for Self-service

Jan 2, 2020 3:58:08 PM / by Eran Levy posted in Data Engineering, ETL, IoT, Data Lake ETL, Machine learning, Apache Spark

Read More

7 Best Practices for High-performance Data Lakes

Dec 19, 2019 2:43:31 PM / by Eran Levy posted in Big Data, Industry Trends, Data Lake ETL, Streaming processing

 

Read More

5 Exciting Redshift and Athena Announcements from re:Invent 2019

Dec 12, 2019 2:14:18 PM / by Eran Levy posted in Amazon Athena, Data Lake ETL, Amazon Redshift, re:Invent

 

Read More

Benchmarking AWS Athena vs BigQuery: Performance, Price, Data Freshness

Dec 6, 2019 1:48:36 PM / by Eran Levy posted in Amazon Athena, Data Lake ETL, Google BigQuery

With the cloud wars heating up, Google and AWS tout two directly-competing serverless querying tools: Amazon Athena, an interactive query service that runs over Amazon S3; and Google BigQuery, a high-performance, decoupled database. In this document we will take a closer look at these two services, and compare their real-world performance executing a series of SQL queries against the same dataset.

Read More