<img height="1" width="1" style="display:none" src="https://www.facebook.com/tr?id=315693165909440&amp;ev=PageView&amp;noscript=1">

Upstream

The Big Data Blog

Upsolver Announces SQL-based ETL (Extract, Transform, Load) for Cloud Data Lakes to Democratize Big Data

Oct 16, 2019 4:24:55 PM / by Upsolver Team posted in Big Data, ETL, SQL, Data Lake ETL, Machine learning, Amazon Web Services

 

Read More

6 Data Preparation Tips for Querying Big Data in AWS Athena

Oct 2, 2019 4:44:22 PM / by Upsolver Team posted in Big Data, Amazon Athena, ETL, AWS S3

 

Read More

Comparing Stream Processors: Apache Kafka vs Amazon Kinesis

Sep 25, 2019 6:18:15 PM / by Upsolver Team posted in Apache Kafka, AWS S3, Streaming processing, Amazon Kinesis

 

Read More

11 Alternatives to Alooma on AWS for Streaming and App Data

Sep 18, 2019 3:37:58 PM / by Eran Levy

 

Read More

4 Key Components of a Streaming Data Architecture (with Examples)

Sep 11, 2019 3:42:06 PM / by Eran Levy

Streaming data is becoming a core component of enterprise data architecture due to the explosive growth of data from non-traditional sources such as IoT sensors, security logs and web applications.

 

Streaming technologies are not new, but they have considerably matured in recent years. The industry is moving from painstaking integration of open-source Spark/Hadoop frameworks, towards full stack solutions that provide an end-to-end streaming data architecture built on the scalability of cloud data lakes.

 

Want to see how leading organizations design their big data infrastructure?‌‌ Check out these 4 real-life examples of streaming architectures.

Read More

Batch, Stream, and Micro-batch Processing: A Cheat Sheet

Sep 5, 2019 4:48:25 PM / by Eran Levy posted in Batch processing, Streaming processing, Batch ETL, Amazon Redshift, Google BigQuery

 

Read More

Data Lake ETL for IoT Data: From Streams to Analytics

Aug 29, 2019 4:48:45 PM / by Eran Levy posted in Database, Event Streams, IoT, SQL, Data Lake ETL

 

Read More

Orchestrating Streaming and Batch ETL for Machine Learning

Aug 20, 2019 4:38:56 PM / by Eran Levy posted in Schemaless, ETL, Event Streams, Data Lake ETL, Data infrastructure, User personalization

 

Read More

Partitioning Data on S3 to Improve Performance in Athena/Presto

Aug 13, 2019 2:04:33 PM / by Eran Levy posted in Data Lake Platform, Amazon Athena, Data Lake ETL, AWS S3, Partitioning

 

Read More

Getting Data Lake ETL Right: 6 Guidelines for Evaluating Tools

Aug 6, 2019 2:52:11 PM / by Eran Levy posted in Database, Data Lake Platform, SQL, Data Lake ETL, Data infrastructure

 

Read More