DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
1 billion JSON records, 1-second query response: Apache Doris vs. ClickHouse, Elasticsearch, and PostgreSQL

1 billion JSON records, 1-second query response: Apache Doris vs. ClickHouse, Elasticsearch, and PostgreSQL

6
Comments
7 min read
SQL: is there a better way to code this?

SQL: is there a better way to code this?

Comments 1
1 min read
Building Real-Time Data Pipelines from PostgreSQL Using Flink CDC

Building Real-Time Data Pipelines from PostgreSQL Using Flink CDC

Comments
5 min read
How to Convert Excel to CSV in Python using Spire.XLS for Python

How to Convert Excel to CSV in Python using Spire.XLS for Python

Comments
4 min read
Building a Sales Database in PostgreSQL — Schema, Data & JOIN Examples

Building a Sales Database in PostgreSQL — Schema, Data & JOIN Examples

4
Comments
6 min read
Building Self-Healing, Reliable Data Pipelines That Think

Building Self-Healing, Reliable Data Pipelines That Think

Comments 1
4 min read
Interesting links - September 2025

Interesting links - September 2025

Comments
13 min read
Beyond the Browser: Crafting a Robust Web Scraping Pipeline for Dynamic Sports Data

Beyond the Browser: Crafting a Robust Web Scraping Pipeline for Dynamic Sports Data

Comments 1
3 min read
Get Started with Fastest SQL Query Engine - Presto C++ (Prestissimo): Beginner Friendly Setup Guide with Docker.

Get Started with Fastest SQL Query Engine - Presto C++ (Prestissimo): Beginner Friendly Setup Guide with Docker.

Comments
5 min read
10 Best Platforms to Learn Data Analytics in 2026

10 Best Platforms to Learn Data Analytics in 2026

1
Comments
4 min read
Apache Zookeeper: O coordenador de sistemas distribuĂ­dos

Apache Zookeeper: O coordenador de sistemas distribuĂ­dos

Comments
8 min read
Debezium: Capturando mudanças de dados em tempo real

Debezium: Capturando mudanças de dados em tempo real

Comments
3 min read
Change Data Capture (CDC): Capturando mudanças em tempo real

Change Data Capture (CDC): Capturando mudanças em tempo real

Comments
4 min read
Streams de Dados: Processamento de Informações em Tempo Real

Streams de Dados: Processamento de Informações em Tempo Real

Comments
3 min read
Designing Data-Intensive Applications — Chapter 1: Reliable, Scalable, and Maintainable Applications

Designing Data-Intensive Applications — Chapter 1: Reliable, Scalable, and Maintainable Applications

5
Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.