DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
How I built a 39x compression pipeline with AES-256-GCM in Python (and why the dictionary is everything)

How I built a 39x compression pipeline with AES-256-GCM in Python (and why the dictionary is everything)

1
Comments
2 min read
I Built a Corrupt Archive Recovery Engine in Rust — Because Every Tool I Tried Just Gave Up

I Built a Corrupt Archive Recovery Engine in Rust — Because Every Tool I Tried Just Gave Up

1
Comments
4 min read
I Analyzed 1 Million dev.to Articles (2022–2026): Here’s What the Data Reveals

I Analyzed 1 Million dev.to Articles (2022–2026): Here’s What the Data Reveals

1
Comments
4 min read
SQL Window Functions Don't Have to Be Scary 🪟

SQL Window Functions Don't Have to Be Scary 🪟

1
Comments
3 min read
Prediksi ETA Pengiriman Tanpa “AI Hype”: Fitur yang Masuk Akal, Evaluasi Model, dan Cara Menghindari Bias

Prediksi ETA Pengiriman Tanpa “AI Hype”: Fitur yang Masuk Akal, Evaluasi Model, dan Cara Menghindari Bias

Comments
8 min read
From Clicks to Data: The Invisible Journey of Your Data

From Clicks to Data: The Invisible Journey of Your Data

2
Comments
6 min read
DAY 9 - Recommendation System

DAY 9 - Recommendation System

Comments
2 min read
Data Engineers: What If Your BigQuery Function Could Return Multiple Tables?

Data Engineers: What If Your BigQuery Function Could Return Multiple Tables?

1
Comments
2 min read
Batch Processing with Apache Spark

Batch Processing with Apache Spark

Comments
1 min read
A Beginner's Guide to SQL Joins and Window Functions

A Beginner's Guide to SQL Joins and Window Functions

1
Comments
6 min read
How to Test Data Pipelines Effectively

How to Test Data Pipelines Effectively

Comments
2 min read
My Non-Fiction Library: Books on Data Lakehouses, Apache Iceberg, AI, and Beyond

My Non-Fiction Library: Books on Data Lakehouses, Apache Iceberg, AI, and Beyond

Comments
9 min read
How Data Analyst Transform Messy Data with DAX in Power BI

How Data Analyst Transform Messy Data with DAX in Power BI

Comments
3 min read
DAY 7 - MLflow Tracking

DAY 7 - MLflow Tracking

Comments
1 min read
How We Generate AI Network Digests for MegaETH at MiniBlocks.io

How We Generate AI Network Digests for MegaETH at MiniBlocks.io

1
Comments
8 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.