Tag: hdfs

Dive deep into security management: The Data on EKS Platform | Amazon Web Services

Big Data April 29, 2024

The construction of big data applications based on open source software has become increasingly uncomplicated since the advent of projects like Data on EKS,...

Automate large-scale data validation using Amazon EMR and Apache Griffin | Amazon Web Services

Big Data April 4, 2024

Exploring real-time streaming for generative AI Applications | Amazon Web Services

Big Data March 25, 2024

20 Technologies in Data Science for Professionals

AI February 5, 2024

Preprocess and fine-tune LLMs quickly and cost-effectively using Amazon EMR Serverless and Amazon SageMaker | Amazon Web Services

AIFebruary 1, 2024

Large language models (LLMs) are becoming increasing popular, with new use cases constantly being explored. In general, you can build applications powered by LLMs...

Enforce fine-grained access control on Open Table Formats via Amazon EMR integrated with AWS Lake Formation | Amazon Web Services

Big DataJanuary 17, 2024

With Amazon EMR 6.15, we launched AWS Lake Formation based fine-grained access controls (FGAC) on Open Table Formats (OTFs), including Apache Hudi, Apache Iceberg,...

Top 20 Data Engineering Project Ideas [With Source Code]

Big DataSeptember 20, 2023

Data engineering plays a pivotal role in the vast data ecosystem by collecting, transforming, and delivering data essential for analytics, reporting, and machine learning....

Query your Apache Hive metastore with AWS Lake Formation permissions | Amazon Web Services

Big DataJuly 20, 2023

Apache Hive is a SQL-based data warehouse system for processing highly distributed datasets on the Apache Hadoop platform. There are two key components to...

Get started managing partitions for Amazon S3 tables backed by the AWS Glue Data Catalog | Amazon Web Services

Big DataJune 22, 2023

Large organizations processing huge volumes of data usually store it in Amazon Simple Storage Service (Amazon S3) and query the data to make data-driven...

10 Best Data Analytics Projects

Big DataMay 21, 2023

Introduction Not a single day passes without us getting to hear the word “data.” It is almost as if our lives revolve around it. Don’t...

How Zoom implemented streaming log ingestion and efficient GDPR deletes using Apache Hudi on Amazon EMR | Amazon Web Services

Big DataMay 16, 2023

In today’s digital age, logging is a critical aspect of application development and management, but efficiently managing logs while complying with data protection regulations...

12 Page 1 of 2

Generative Data Intelligence

Tag: hdfs

Dive deep into security management: The Data on EKS Platform | Amazon Web Services

Top News

Automate large-scale data validation using Amazon EMR and Apache Griffin | Amazon Web Services

Exploring real-time streaming for generative AI Applications | Amazon Web Services

20 Technologies in Data Science for Professionals

Latest Intelligence

Introducing the AWS ProServe Hadoop Migration Delivery Kit TCO tool

Unlock the True Potential of Your Data with ETL and ELT Pipeline

Build a data lake with Apache Flink on Amazon EMR

Scaling Data Management Through Apache Gobblin

Build your Apache Hudi data lake on AWS using Amazon EMR – Part 1

How GoDaddy built a data mesh to decentralize data ownership using AWS Lake Formation

Understanding the Concepts of Teradata