Tag: AWS Database Migration Service

Automate replication of relational sources into a transactional data lake with Apache Iceberg and AWS Glue

Big DataFebruary 14, 2023

Organizations have chosen to build data lakes on top of Amazon Simple Storage Service (Amazon S3) for many years. A data lake is the...

Automate schema evolution at scale with Apache Hudi in AWS Glue

Big DataFebruary 7, 2023

In the data analytics space, organizations often deal with many tables in different databases and file formats to hold data for different business functions....

Handle UPSERT data operations using open-source Delta Lake and AWS Glue

Big DataJanuary 30, 2023

Many customers need an ACID transaction (atomic, consistent, isolated, durable) data lake that can log change data capture (CDC) from operational data sources. There...

How SikSin improved customer engagement with AWS Data Lab and Amazon Personalize

AIJanuary 25, 2023

This post is co-written with Byungjun Choi and Sangha Yang from SikSin. SikSin is a technology platform connecting customers with restaurant partners serving their...

Migrate Google BigQuery to Amazon Redshift using AWS Schema Conversion tool (SCT)

Big DataDecember 14, 2022

Amazon Redshift is a fast, fully-managed, petabyte scale data warehouse that provides the flexibility to use provisioned or serverless compute for your analytical workloads....

Convert Oracle XML BLOB data to JSON using Amazon EMR and load to Amazon Redshift

Big DataAugust 29, 2022

In legacy relational database management systems, data is stored in several complex data types, such XML, JSON, BLOB, or CLOB. This data might contain...

Accelerate your data warehouse migration to Amazon Redshift – Part 5

Big DataMarch 21, 2022

This is the fifth in a series of posts. We’re excited to share dozens of new features to automate your schema conversion; preserve your investment in existing scripts, reports, and applications; accelerate query performance; and potentially simplify your migrations from legacy data warehouses to Amazon Redshift. Check out the all the posts in this series: […]

How the Georgia Data Analytics Center built a cloud analytics solution from scratch with the AWS Data Lab

Big DataMarch 2, 2022

This is a guest post by Kanti Chalasani, Division Director at Georgia Data Analytics Center (GDAC). GDAC is housed within the Georgia Office of Planning and Budget to facilitate governed data sharing between various state agencies and departments. The Office of Planning and Budget (OPB) established the Georgia Data Analytics Center (GDAC) with the intent […]

Create a low-latency source-to-data lake pipeline using Amazon MSK Connect, Apache Flink, and Apache Hudi

Big DataMarch 1, 2022

During the recent years, there has been a shift from monolithic to the microservices architecture. The microservices architecture makes applications easier to scale and quicker to develop, enabling innovation and accelerating time to market for new features. However, this approach causes data to live in different silos, which makes it difficult to perform analytics. To […]

12Page 2 of 2

Generative Data Intelligence