Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
What is the difference between ETL and ETL?
Cliffe Okoth
Cliffe Okoth
Cliffe Okoth
Follow
Apr 10
What is the difference between ETL and ETL?
#
architecture
#
data
#
dataengineering
#
learning
Comments
Add Comment
9 min read
dbt snapshots: moving from merges to native history
Philip Hern
Philip Hern
Philip Hern
Follow
Apr 10
dbt snapshots: moving from merges to native history
#
dbt
#
dataengineering
#
snowflake
#
snapshots
1
reaction
Comments
Add Comment
5 min read
PySpark to Pandas/scikit-learn: A Practical Migration Guide for Data Engineers Learning ML
Nyson Markus
Nyson Markus
Nyson Markus
Follow
Apr 10
PySpark to Pandas/scikit-learn: A Practical Migration Guide for Data Engineers Learning ML
#
dataengineering
#
datascience
#
machinelearning
#
python
Comments
Add Comment
7 min read
Apache Parquet File Anatomy: Row Groups, Column Chunks, Pages, and Metadata Explained 🧱📦
Kumaravelu Saraboji Mahalingam
Kumaravelu Saraboji Mahalingam
Kumaravelu Saraboji Mahalingam
Follow
Apr 10
Apache Parquet File Anatomy: Row Groups, Column Chunks, Pages, and Metadata Explained 🧱📦
#
dataengineering
#
apacheparquet
#
iceberg
#
analytics
Comments
Add Comment
8 min read
🚀 DB Explorer 3.0.1 — The AI‑First SQL Editor You’ll Want to Try
Ashish Srivastava
Ashish Srivastava
Ashish Srivastava
Follow
Apr 10
🚀 DB Explorer 3.0.1 — The AI‑First SQL Editor You’ll Want to Try
#
sql
#
database
#
postgres
#
dataengineering
Comments
Add Comment
1 min read
My first data pipeline
Ajay M
Ajay M
Ajay M
Follow
Apr 10
My first data pipeline
#
showdev
#
beginners
#
dataengineering
#
sideprojects
Comments
Add Comment
1 min read
ETL vs ELT: Which One Should You Use and Why?
John Wakaba
John Wakaba
John Wakaba
Follow
Apr 10
ETL vs ELT: Which One Should You Use and Why?
#
architecture
#
beginners
#
data
#
dataengineering
1
reaction
Comments
Add Comment
6 min read
Entity Resolution at Scale: Matching Products Across Amazon, Reddit, and RTINGS
Daniel Rozin
Daniel Rozin
Daniel Rozin
Follow
Apr 10
Entity Resolution at Scale: Matching Products Across Amazon, Reddit, and RTINGS
#
ai
#
webdev
#
dataengineering
#
tutorial
Comments
Add Comment
4 min read
Apache Data Lakehouse Weekly: April 3–9, 2026
Alex Merced
Alex Merced
Alex Merced
Follow
Apr 9
Apache Data Lakehouse Weekly: April 3–9, 2026
#
news
#
data
#
dataengineering
#
opensource
Comments
Add Comment
7 min read
AWS Lake Formation: Why Your Data Lake Permissions Are Probably a Mess (And How to Fix That)
Soumyadeep Basu
Soumyadeep Basu
Soumyadeep Basu
Follow
Apr 9
AWS Lake Formation: Why Your Data Lake Permissions Are Probably a Mess (And How to Fix That)
#
dataengineering
#
awsdatalake
#
aws
Comments
Add Comment
3 min read
ETL VS ELT: WHICH ONE SHOULD YOU USE AND WHY?
Wangeci Ndovu
Wangeci Ndovu
Wangeci Ndovu
Follow
Apr 10
ETL VS ELT: WHICH ONE SHOULD YOU USE AND WHY?
#
analytics
#
beginners
#
data
#
dataengineering
Comments
Add Comment
5 min read
Airflow vs Prefect vs Dagster: Picking the Right Orchestrator in 2026
DataStackX
DataStackX
DataStackX
Follow
Apr 9
Airflow vs Prefect vs Dagster: Picking the Right Orchestrator in 2026
#
dataengineering
#
python
#
airflow
#
dagster
Comments
Add Comment
6 min read
Advanced SQL Techniques for Data Analytics Every Data Analyst Should Know
Lawrence Murithi
Lawrence Murithi
Lawrence Murithi
Follow
Apr 9
Advanced SQL Techniques for Data Analytics Every Data Analyst Should Know
#
sql
#
luxdev
#
dataengineering
Comments
Add Comment
6 min read
Your Customer Table Has Duplicates You Can't See With SQL How I Built a Cross-Platform Identity Resolution Layer for a Dark Kitchen Data Platform
SARAN TEJA MALLELA
SARAN TEJA MALLELA
SARAN TEJA MALLELA
Follow
Apr 9
Your Customer Table Has Duplicates You Can't See With SQL How I Built a Cross-Platform Identity Resolution Layer for a Dark Kitchen Data Platform
#
dataengineering
#
apachespark
#
kafka
#
deltalake
3
reactions
Comments
Add Comment
8 min read
How to Bypass the Pandas "Object Tax": Building an 8x Faster CSV Engine in C
NARESH-CN2
NARESH-CN2
NARESH-CN2
Follow
Apr 9
How to Bypass the Pandas "Object Tax": Building an 8x Faster CSV Engine in C
#
python
#
performance
#
dataengineering
#
datascience
Comments
Add Comment
2 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account