<img height="1" width="1" style="display:none" src="https://www.facebook.com/tr?id=315693165909440&amp;ev=PageView&amp;noscript=1">


The Big Data Blog

Solving the Upserts Challenge in Data Lakes

4 Guiding Principles for Modern Data Lake Architecture

Data Lake as a Service: Is There a GUI-based Data Lake?

How (and Why) to Analyze CloudWatch Logs In AWS Athena

Protecting PII & Sensitive Data on S3 with Tokenization

Custom Partitioning for Embedded Analytics with Athena

Data Architecture for AWS Athena: 6 Examples to Learn From

Apache Spark Limitations & the Self-service Alternative

14 Best Data Engineering Podcasts, Blogs and Websites

Upsolver Lookup Tables: A Decoupled Alternative to Cassandra

In 2020, the Data Lake is Ripe for Self-service

7 Best Practices for High-performance Data Lakes

5 Exciting Redshift and Athena Announcements from re:Invent 2019

Benchmarking AWS Athena vs BigQuery: Performance, Price, Data Freshness

Intro to AWS Data Lakes: Components & Architecture

Using Schema Discovery to Explore Kafka/Kinesis Streams

Streaming Data on AWS: Tools and Resources You Need to Know

ETL Your Kinesis Data to AWS Athena in Minutes with UpSQL

7 Guidelines for Ingesting Big Data to Data Lakes

Joining Streams and Big Tables on S3: NoSQL vs UpSQL vs Spark

Top Spark Alternatives by Use Case: ETL, ML, Data Discovery, BI

Athena or Redshift? 4 Questions to Decide

Joining Impression and Click Streams in Minutes Using UpSQL

Upsolver Announces SQL-based ETL (Extract, Transform, Load) for Cloud Data Lakes to Democratize Big Data

6 Data Preparation Tips for Querying Big Data in AWS Athena

Comparing Stream Processors: Apache Kafka vs Amazon Kinesis

11 Alternatives to Alooma on AWS for Streaming and App Data

4 Key Components of a Streaming Data Architecture (with Examples)

Batch, Stream, and Micro-batch Processing: A Cheat Sheet

Data Lake ETL for IoT Data: From Streams to Analytics

Orchestrating Streaming and Batch ETL for Machine Learning

Partitioning Data on S3 to Improve Performance in Athena/Presto

Getting Data Lake ETL Right: 6 Guidelines for Evaluating Tools

Real-time Machine Learning: Hype vs Reality

Databricks Delta Lake vs Data Lake ETL: Overview and Comparison

Introducing Upsolver Notebooks: Data Enrichment with SQL

IoT Analytics: Challenges, Applications, and Innovations

Upsolver Receives 2019 Rising Star and Premium Usability Awards from Finances Online

A Data Lake Approach to Event Stream Analytics

ETL Pipelines for Kafka Data: Choosing the Right Approach

4 Challenges of Using Databases for Streaming Data (and a Solution)

Big Data Infrastructure: When to Build, When to Buy

Kafka vs. RabbitMQ: Architecture, Performance & Use Cases

How to Improve AWS Athena Performance: The Complete Guide

7 Popular Stream Processing Frameworks Compared

Cloud Data Lake vs On-Premises Data Lake: What You Need to Know

3 Steps To Reduce Your Elasticsearch Costs By 90 - 99%

Integrate Upsolver with Git for Carefree Change Management and Review

Top 7 Trends in Streaming Data for 2019

What’s New in Upsolver - December 2018 Edition

Problems with Small Files on HDFS / S3? Make Them Bigger

Upsolver Data Security: An Overview

4 Examples of Data Lake Architectures on Amazon S3

Apache Kafka with and without a Data Lake

Understanding Data Lakes and Data Lake Platforms

Let's face it: Big Data sucks.

5 Signs You've Outgrown Your Data Warehouse

Spark Streaming VS Upsolver on Twitter Data