DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Why Data Partitioning Is Harder Than It Looks

Why Data Partitioning Is Harder Than It Looks

1
Comments
2 min read
Part 2: Snowflake's Autonomous Future

Part 2: Snowflake's Autonomous Future

Comments
8 min read
Collecting Africa’s Energy Insights:

Collecting Africa’s Energy Insights:

3
Comments
4 min read
Containerization for Data Engineering: A Practical Guide with Docker and Docker Compose

Containerization for Data Engineering: A Practical Guide with Docker and Docker Compose

Comments
5 min read
Making JSON Compression Searchable — SEE (Schema-Aware Encoding)

Making JSON Compression Searchable — SEE (Schema-Aware Encoding)

1
Comments
2 min read
Apache Iceberg Dev List Digest (Sept 15–19, 2025)

Apache Iceberg Dev List Digest (Sept 15–19, 2025)

Comments
3 min read
Data Engineering with Docker: A Hands-On Guide to Containerization

Data Engineering with Docker: A Hands-On Guide to Containerization

7
Comments 2
3 min read
From Kafka to Clean Tables: Building a Confluent Snowflake Pipeline with Streams & Tasks

From Kafka to Clean Tables: Building a Confluent Snowflake Pipeline with Streams & Tasks

3
Comments 1
10 min read
Understanding the Basics of Linux Operating System

Understanding the Basics of Linux Operating System

Comments
1 min read
Why you need to learn Apache Airflow - right now

Why you need to learn Apache Airflow - right now

Comments
3 min read
Building a True Dual-Destination Analytics Pipeline: Real-Time Streaming with S3 Backup and Recovery

Building a True Dual-Destination Analytics Pipeline: Real-Time Streaming with S3 Backup and Recovery

1
Comments
8 min read
Apache Kafka Deep Dive: Concepts, Applications, and Production

Apache Kafka Deep Dive: Concepts, Applications, and Production

Comments
4 min read
A Dive into Apache Iceberg™'s Metadata

A Dive into Apache Iceberg™'s Metadata

Comments
4 min read
Building an Automated YouTube Analytics Dashboard with Airflow, PySpark, MinIO, PostgreSQL & Grafana

Building an Automated YouTube Analytics Dashboard with Airflow, PySpark, MinIO, PostgreSQL & Grafana

7
Comments
5 min read
Composable Analytics with Agents: Leveraging Virtual Datasets and the Semantic Layer

Composable Analytics with Agents: Leveraging Virtual Datasets and the Semantic Layer

1
Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.