Tag: delta lake

Guide to Migrating from Databricks Delta Lake to Apache Iceberg

Big Data March 28, 2024

Introduction In the fast changing world of big data processing and analytics, the potential management of extensive datasets serves as a foundational pillar for companies...

Exploring real-time streaming for generative AI Applications | Amazon Web Services

Big Data March 25, 2024

Use Amazon Athena with Spark SQL for your open-source transactional table formats | Amazon Web Services

Big Data January 24, 2024

AWS Lake Formation 2023 year in review | Amazon Web Services

Big Data January 18, 2024

Enforce fine-grained access control on Open Table Formats via Amazon EMR integrated with AWS Lake Formation | Amazon Web Services

Big DataJanuary 17, 2024

With Amazon EMR 6.15, we launched AWS Lake Formation based fine-grained access controls (FGAC) on Open Table Formats (OTFs), including Apache Hudi, Apache Iceberg,...

Introducing Apache Hudi support with AWS Glue crawlers | Amazon Web Services

Big DataNovember 22, 2023

Apache Hudi is an open table format that brings database and data warehouse capabilities to data lakes. Apache Hudi helps data engineers manage complex challenges, such as...

Spark on AWS Lambda: An Apache Spark runtime for AWS Lambda | Amazon Web Services

Big DataOctober 30, 2023

Spark on AWS Lambda (SoAL) is a framework that runs Apache Spark workloads on AWS Lambda. It’s designed for both batch and event-based workloads,...

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi | Amazon Web Services

Big DataSeptember 13, 2023

The Analytics specialty practice of AWS Professional Services (AWS ProServe) helps customers across the globe with modern data architecture implementations on the AWS Cloud....

Modern Data Engineering with MAGE: Empowering Efficient Data Processing

Big DataJune 20, 2023

Introduction In today’s data-driven world, organizations across industries are dealing with massive volumes of data, complex pipelines, and the need for efficient data processing. Traditional...

Testing and Monitoring Data Pipelines: Part One – DATAVERSITY

Big DataMay 26, 2023

Suppose you’re in charge of maintaining a large set of data pipelines from cloud storage or streaming data into a data warehouse. How can...

Data Warehouse vs. Data Lakehouse – DATAVERSITY

Big DataMay 23, 2023

The phrase “data warehouse vs. data lakehouse” offers an exciting topic for ongoing debate in the global Data Management world. While businesses have relied on traditional data warehouses...

Implement slowly changing dimensions in a data lake using AWS Glue and Delta

Big DataMarch 28, 2023

In a data warehouse, a dimension is a structure that categorizes facts and measures in order to enable users to answer business questions. To...

Introducing native support for Apache Hudi, Delta Lake, and Apache Iceberg on AWS Glue for Apache Spark, Part 2: AWS Glue Studio Visual Editor

Big DataMarch 20, 2023

In the first post of this series, we described how AWS Glue for Apache Spark works with Apache Hudi, Linux Foundation Delta Lake, and...

12 Page 1 of 2

Latest Intelligence

Introducing native support for Apache Hudi, Delta Lake, and Apache Iceberg on AWS Glue for Apache Spark, Part 1: Getting Started

Big Data January 26, 2023

Generative Data Intelligence

Tag: delta lake

Guide to Migrating from Databricks Delta Lake to Apache Iceberg

Top News

Exploring real-time streaming for generative AI Applications | Amazon Web Services

Use Amazon Athena with Spark SQL for your open-source transactional table formats | Amazon Web Services

AWS Lake Formation 2023 year in review | Amazon Web Services

Latest Intelligence

Introducing native support for Apache Hudi, Delta Lake, and Apache Iceberg on AWS Glue for Apache Spark, Part 1: Getting Started

Delta Lake: A Comprehensive Introduction

Top analytics announcements of AWS re:Invent 2022

Introducing native Delta Lake table support with AWS Glue crawlers

Delta Lake in Action – Quick Hands-on Tutorial for Beginners

Basic Tenets of Delta Lake

How a Delta Lake is Process with Azure Synapse Analytics