<img height="1" width="1" style="display:none" src="https://www.facebook.com/tr?id=315693165909440&amp;ev=PageView&amp;noscript=1">

Upstream

The Big Data Blog

In 2020, the Data Lake is Ripe for Self-service

Jan 2, 2020 3:58:08 PM / by Eran Levy posted in Data Engineering, ETL, IoT, Data Lake ETL, Machine learning, Apache Spark

Read More

7 Best Practices for High-performance Data Lakes

Dec 19, 2019 2:43:31 PM / by Eran Levy posted in Big Data, Industry Trends, Data Lake ETL, Streaming processing

 

Read More

5 Exciting Redshift and Athena Announcements from re:Invent 2019

Dec 12, 2019 2:14:18 PM / by Eran Levy posted in Amazon Athena, Data Lake ETL, Amazon Redshift, re:Invent

 

Read More

Benchmarking AWS Athena vs BigQuery: Performance, Price, Data Freshness

Dec 6, 2019 1:48:36 PM / by Eran Levy posted in Amazon Athena, Data Lake ETL, Google BigQuery

With the cloud wars heating up, Google and AWS tout two directly-competing serverless querying tools: Amazon Athena, an interactive query service that runs over Amazon S3; and Google BigQuery, a high-performance, decoupled database. In this document we will take a closer look at these two services, and compare their real-world performance executing a series of SQL queries against the same dataset.

Read More

Intro to AWS Data Lakes: Components & Architecture

Nov 29, 2019 11:49:25 AM / by Eran Levy posted in Data Lake, Amazon Athena, Amazon Web Services, Amazon S3

The following article is based on a presentation given by Roy Hasson, Senior Business Development Manager at Amazon Web Services, as part of our recent joint webinar - Frictionless Data Lake ETL for Petabyte-scale Streaming Data. You can watch the full presentation and webinar for free right here.

Read More

Using Schema Discovery to Explore Kafka/Kinesis Streams

Nov 28, 2019 3:55:55 PM / by Eran Levy posted in Apache Kafka, Data Lake ETL, Amazon Kinesis, Schema

 

Read More

Streaming Data on AWS: Tools and Resources You Need to Know

Nov 25, 2019 4:43:42 PM / by Eran Levy posted in Big Data, Database, Amazon Athena, Data Lake ETL, AWS S3

 

Read More

ETL Your Kinesis Data to AWS Athena in Minutes with UpSQL

Nov 20, 2019 4:00:26 PM / by Roy Hegdish posted in Big Data, Data Lake, Amazon Athena, AWS S3, UpSQL

 

Read More

7 Guidelines for Ingesting Big Data to Data Lakes

Nov 14, 2019 3:10:48 PM / by Eran Levy posted in Big Data, Data Lake, S3, Data injestion

 

Read More

Joining Streams and Big Tables on S3: NoSQL vs UpSQL vs Spark

Nov 13, 2019 3:09:14 PM / by Eran Levy posted in Big Data, Database, Data Lake ETL, Click Streams, UpSQL, Amazon S3

 

Read More