Zephyrnet Logo

Tag: Amazon EMR

Enforce fine-grained access control on Open Table Formats via Amazon EMR integrated with AWS Lake Formation | Amazon Web Services

With Amazon EMR 6.15, we launched AWS Lake Formation based fine-grained access controls (FGAC) on Open Table Formats (OTFs), including Apache Hudi, Apache Iceberg,...

Detect, mask, and redact PII data using AWS Glue before loading into Amazon OpenSearch Service | Amazon Web Services

Many organizations, small and large, are working to migrate and modernize their analytics workloads on Amazon Web Services (AWS). There are many reasons for...

Architectural patterns for real-time analytics using Amazon Kinesis Data Streams, part 1 | Amazon Web Services

We’re living in the age of real-time data and insights, driven by low-latency data streaming applications. Today, everyone expects a personalized experience in any...

Build efficient ETL pipelines with AWS Step Functions distributed map and redrive feature | Amazon Web Services

AWS Step Functions is a fully managed visual workflow service that enables you to build complex data processing pipelines involving a diverse set of...

Experience the new and improved Amazon SageMaker Studio | Amazon Web Services

Launched in 2019, Amazon SageMaker Studio provides one place for all end-to-end machine learning (ML) workflows, from data preparation, building and experimentation, training, hosting, and...

Announcing zero-ETL integrations with AWS Databases and Amazon Redshift | Amazon Web Services

As customers become more data driven and use data as a source of competitive advantage, they want to easily run analytics on their data...

Your guide to AWS Analytics at AWS re:Invent 2023 | Amazon Web Services

Join the AWS Analytics team at AWS re:Invent this year, where new ideas and exciting innovations come together. For those in the data...

Simplifying data processing at Capitec with Amazon Redshift integration for Apache Spark | Amazon Web Services

This post is co-written with Preshen Goobiah and Johan Olivier from Capitec. Apache Spark is a widely-used open source distributed processing system renowned for...

Introducing Amazon MWAA support for Apache Airflow version 2.7.2 and deferrable operators | Amazon Web Services

Amazon Managed Workflow for Apache Airflow (Amazon MWAA) is a managed service that allows you to use a familiar Apache Airflow environment with improved...

Use IAM runtime roles with Amazon EMR Studio Workspaces and AWS Lake Formation for cross-account fine-grained access control | Amazon Web Services

Amazon EMR Studio is an integrated development environment (IDE) that makes it straightforward for data scientists and data engineers to develop, visualize, and debug...

GoDaddy benchmarking results in up to 24% better price-performance for their Spark workloads with AWS Graviton2 on Amazon EMR Serverless | Amazon Web Services

This is a guest post co-written with Mukul Sharma, Software Development Engineer, and Ozcan IIikhan, Director of Engineering from GoDaddy. GoDaddy empowers everyday entrepreneurs...

Spark on AWS Lambda: An Apache Spark runtime for AWS Lambda | Amazon Web Services

Spark on AWS Lambda (SoAL) is a framework that runs Apache Spark workloads on AWS Lambda. It’s designed for both batch and event-based workloads,...

Latest Intelligence

spot_img
spot_img