Zephyrnet Logo

Tag: AWS Glue

Run Trino queries 2.7 times faster with Amazon EMR 6.15.0 | Amazon Web Services

Trino is an open source distributed SQL query engine designed for interactive analytic workloads. On AWS, you can run Trino on Amazon EMR, where...

Scale AWS Glue jobs by optimizing IP address consumption and expanding network capacity using a private NAT gateway | Amazon Web Services

As businesses expand, the demand for IP addresses within the corporate network often exceeds the supply. An organization’s network is often designed with some...

Gain insights from historical location data using Amazon Location Service and AWS analytics services | Amazon Web Services

Many organizations around the world rely on the use of physical assets, such as vehicles, to deliver a service to their end-customers. By tracking...

Build a RAG data ingestion pipeline for large-scale ML workloads | Amazon Web Services

For building any generative AI application, enriching the large language models (LLMs) with new data is imperative. This is where the Retrieval Augmented Generation...

Measure performance of AWS Glue Data Quality for ETL pipelines | Amazon Web Services

In recent years, data lakes have become a mainstream architecture, and data quality validation is a critical factor to improve the reusability and consistency...

Run an audience overlap analysis in AWS Clean Rooms | Amazon Web Services

Advertisers, publishers, and advertising technology providers are actively seeking efficient ways to collaborate with their partners to generate insights about their collective datasets. One...

Implementing Near-Real-Time Analytics with Amazon Redshift Streaming Ingestion and Amazon MSK: Best Practices from Amazon Web Services

Amazon Web Services (AWS) offers a wide range of services for data analytics, including Amazon Redshift and Amazon Managed Streaming for Apache Kafka (MSK)....

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion | Amazon Web Services

Organizations often need to manage a high volume of data that is growing at an extraordinary rate. At the same time, they need to...

Build a pseudonymization service on AWS to protect sensitive data: Part 2 | Amazon Web Services

Part 1 of this two-part series described how to build a pseudonymization service that converts plain text data attributes into a pseudonym or vice...

Use AWS Glue ETL to perform merge, partition evolution, and schema evolution on Apache Iceberg | Amazon Web Services

As enterprises collect increasing amounts of data from various sources, the structure and organization of that data often need to change over time to...

How BMO improved data security with Amazon Redshift and AWS Lake Formation | Amazon Web Services

This post is cowritten with Amy Tseng, Jack Lin and Regis Chow from BMO. BMO is the...

Data governance in the age of generative AI | Amazon Web Services

Data is your generative AI differentiator, and a successful generative AI implementation depends on a robust data strategy incorporating a comprehensive data governance approach....

Latest Intelligence

spot_img
spot_img