Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
How to Get Filtered Amazon Reviews into a Pandas DataFrame in Under 50 Lines of Python
BAH123
BAH123
BAH123
Follow
Nov 16 '25
How to Get Filtered Amazon Reviews into a Pandas DataFrame in Under 50 Lines of Python
#
python
#
scraper
#
dataengineering
Comments
Add Comment
3 min read
Comparing CsvPath and SodaCL
David Kershaw
David Kershaw
David Kershaw
Follow
Nov 28 '25
Comparing CsvPath and SodaCL
#
data
#
dataengineering
#
sql
#
csv
Comments
Add Comment
4 min read
Star vs. Snowflake Schema
Wangare
Wangare
Wangare
Follow
Nov 14 '25
Star vs. Snowflake Schema
#
architecture
#
database
#
dataengineering
Comments
Add Comment
4 min read
The Bear Awakens: From Pure Speed to Massive Endurance (640 Million Rows Tested)
Alberto Cardenas
Alberto Cardenas
Alberto Cardenas
Follow
Dec 18 '25
The Bear Awakens: From Pure Speed to Massive Endurance (640 Million Rows Tested)
#
showdev
#
testing
#
dataengineering
#
performance
Comments
Add Comment
16 min read
Data Engineer — Người Kiến Tạo “Dòng Chảy Dữ Liệu” Trong Kỷ Nguyên Số
peter nguyen
peter nguyen
peter nguyen
Follow
Nov 14 '25
Data Engineer — Người Kiến Tạo “Dòng Chảy Dữ Liệu” Trong Kỷ Nguyên Số
#
architecture
#
career
#
dataengineering
Comments
Add Comment
2 min read
Sustainability in retail is a Software Problem Now
codecraft
codecraft
codecraft
Follow
Dec 19 '25
Sustainability in retail is a Software Problem Now
#
architecture
#
dataengineering
#
softwareengineering
Comments
Add Comment
2 min read
Join Data from Anywhere: The Streaming SQL Engine That Bridges Databases, APIs, and Files
Theodore P.
Theodore P.
Theodore P.
Follow
Dec 16 '25
Join Data from Anywhere: The Streaming SQL Engine That Bridges Databases, APIs, and Files
#
python
#
database
#
dataengineering
#
sql
8
reactions
Comments
1
comment
17 min read
Building a Modern Data Platform to Track Kenya’s Food Prices — A Data Engineering Case Study
Rose Wabere
Rose Wabere
Rose Wabere
Follow
Nov 13 '25
Building a Modern Data Platform to Track Kenya’s Food Prices — A Data Engineering Case Study
#
spark
#
pyspark
#
grafana
#
dataengineering
Comments
Add Comment
5 min read
Part 1: Database Concepts & Architecture
Data Tech Bridge
Data Tech Bridge
Data Tech Bridge
Follow
Dec 18 '25
Part 1: Database Concepts & Architecture
#
architecture
#
database
#
dataengineering
Comments
Add Comment
14 min read
AWS Glue ETL Jobs: Transform Your Data at Scale
Oteng Isaac
Oteng Isaac
Oteng Isaac
Follow
for
AWS Community Builders
Dec 7 '25
AWS Glue ETL Jobs: Transform Your Data at Scale
#
aws
#
dataengineering
#
etl
#
awsbigdata
1
reaction
Comments
Add Comment
4 min read
Final Project Report 1: Schema Evolution Support on Apache SeaTunnel Flink Engine
Apache SeaTunnel
Apache SeaTunnel
Apache SeaTunnel
Follow
Nov 14 '25
Final Project Report 1: Schema Evolution Support on Apache SeaTunnel Flink Engine
#
opensource
#
apacheseatunnel
#
bigdata
#
dataengineering
Comments
Add Comment
4 min read
I Built an ETL Pipeline That Actually Thinks & And Cut Token Costs by 52% (And Here's What I Learned)
Seenivasa Ramadurai
Seenivasa Ramadurai
Seenivasa Ramadurai
Follow
Dec 17 '25
I Built an ETL Pipeline That Actually Thinks & And Cut Token Costs by 52% (And Here's What I Learned)
#
ai
#
dataengineering
#
performance
#
llm
1
reaction
Comments
Add Comment
17 min read
Beyond SQL: Solving Data Warehouse Performance Bottlenecks with Smart Algorithms, Not Just Bigger Clusters
Judy
Judy
Judy
Follow
Dec 17 '25
Beyond SQL: Solving Data Warehouse Performance Bottlenecks with Smart Algorithms, Not Just Bigger Clusters
#
algorithms
#
database
#
dataengineering
#
performance
5
reactions
Comments
Add Comment
13 min read
From Pandas to Upstream Control: The Evolution PyData Needs Next
David Aronchick
David Aronchick
David Aronchick
Follow
Nov 12 '25
From Pandas to Upstream Control: The Evolution PyData Needs Next
#
dataengineering
#
python
#
distributedsystems
#
machinelearning
Comments
Add Comment
6 min read
Statistics Day 2: Correlation Isn’t Causation — Here’s Why It Matters!
Chanchal Singh
Chanchal Singh
Chanchal Singh
Follow
Nov 13 '25
Statistics Day 2: Correlation Isn’t Causation — Here’s Why It Matters!
#
statistics
#
datascience
#
machinelearning
#
dataengineering
5
reactions
Comments
Add Comment
4 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account