<img height="1" width="1" style="display:none" src="https://www.facebook.com/tr?id=315693165909440&amp;ev=PageView&amp;noscript=1">

Upstream

The Big Data Blog

Eran Levy

Director of Marketing at Upsolver
Find me on:

Recent Posts

14 Best Data Engineering Podcasts, Blogs and Websites

Jan 16, 2020 11:38:51 AM / by Eran Levy posted in Apache Kafka, Data Engineering, Blogs, Podcasts, Netflix, DevOps, NoSQL, Uber

 

Read More

Upsolver Lookup Tables: A Decoupled Alternative to Cassandra

Jan 9, 2020 3:16:54 PM / by Eran Levy posted in Streaming Data, Lookup Tables, Apache Cassandra, Schema Discovery

 

Read More

In 2020, the Data Lake is Ripe for Self-service

Jan 2, 2020 3:58:08 PM / by Eran Levy posted in Data Engineering, ETL, IoT, Data Lake ETL, Machine learning, Apache Spark

Read More

7 Best Practices for High-performance Data Lakes

Dec 19, 2019 2:43:31 PM / by Eran Levy posted in Big Data, Industry Trends, Data Lake ETL, Streaming processing

 

Read More

5 Exciting Redshift and Athena Announcements from re:Invent 2019

Dec 12, 2019 2:14:18 PM / by Eran Levy posted in Amazon Athena, Data Lake ETL, Amazon Redshift, re:Invent

 

Read More

Benchmarking AWS Athena vs BigQuery: Performance, Price, Data Freshness

Dec 6, 2019 1:48:36 PM / by Eran Levy posted in Amazon Athena, Data Lake ETL, Google BigQuery

With the cloud wars heating up, Google and AWS tout two directly-competing serverless querying tools: Amazon Athena, an interactive query service that runs over Amazon S3; and Google BigQuery, a high-performance, decoupled database. In this document we will take a closer look at these two services, and compare their real-world performance executing a series of SQL queries against the same dataset.

Read More

Intro to AWS Data Lakes: Components & Architecture

Nov 29, 2019 11:49:25 AM / by Eran Levy posted in Data Lake, Amazon Athena, Amazon Web Services, Amazon S3

The following article is based on a presentation given by Roy Hasson, Senior Business Development Manager at Amazon Web Services, as part of our recent joint webinar - Frictionless Data Lake ETL for Petabyte-scale Streaming Data. You can watch the full presentation and webinar for free right here.

Read More

Using Schema Discovery to Explore Kafka/Kinesis Streams

Nov 28, 2019 3:55:55 PM / by Eran Levy posted in Apache Kafka, Data Lake ETL, Amazon Kinesis, Schema

 

Read More

Streaming Data on AWS: Tools and Resources You Need to Know

Nov 25, 2019 4:43:42 PM / by Eran Levy posted in Big Data, Database, Amazon Athena, Data Lake ETL, AWS S3

 

Read More

7 Guidelines for Ingesting Big Data to Data Lakes

Nov 14, 2019 3:10:48 PM / by Eran Levy posted in Big Data, Data Lake, S3, Data injestion

 

Read More