DEV Community

# bigdata

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
The Hidden Costs of Idle EMR Clusters (And How to Stop the Bleed)

The Hidden Costs of Idle EMR Clusters (And How to Stop the Bleed)

1
Comments
3 min read
Understanding Data Modeling in Power BI: Joins, Relationships, and Schemas Explained.

Understanding Data Modeling in Power BI: Joins, Relationships, and Schemas Explained.

Comments
3 min read
PySpark : The Big Brain of Data Processing

PySpark : The Big Brain of Data Processing

3
Comments
5 min read
Understanding Data Modeling in Power BI: Joins, Relationships, and Schemas Explained

Understanding Data Modeling in Power BI: Joins, Relationships, and Schemas Explained

Comments
5 min read
How to fuzzy-match 1M rows with dbt in under 10 minutes (2026 guide)

How to fuzzy-match 1M rows with dbt in under 10 minutes (2026 guide)

Comments
4 min read
🚀 Apache Spark Just Killed the Microbatch Barrier (And Why Flink Should Be Worried)

🚀 Apache Spark Just Killed the Microbatch Barrier (And Why Flink Should Be Worried)

1
Comments
3 min read
Data Modelling in Power BI: The Foundation Every Analyst Needs.

Data Modelling in Power BI: The Foundation Every Analyst Needs.

2
Comments
3 min read
Building a Transport Monitoring Dashboard with APIs 🚚📊

Building a Transport Monitoring Dashboard with APIs 🚚📊

1
Comments
7 min read
Apache Cloudberry 2.0: Rebuilding Storage for the Cloud-Native Era with PAX

Apache Cloudberry 2.0: Rebuilding Storage for the Cloud-Native Era with PAX

1
Comments
6 min read
The Human Blueprint of a Winning Scorecard

The Human Blueprint of a Winning Scorecard

1
Comments
5 min read
How to Choose Between Serverless and Dedicated Compute in Databricks

How to Choose Between Serverless and Dedicated Compute in Databricks

3
Comments
3 min read
Orchestrating Our Way Out of Chaos: How I Compared Airflow, Prefect, and Dagster (and Picked What to Ship)

Orchestrating Our Way Out of Chaos: How I Compared Airflow, Prefect, and Dagster (and Picked What to Ship)

2
Comments
6 min read
Part 3 | How Does Scheduling Actually “Start Running”?

Part 3 | How Does Scheduling Actually “Start Running”?

4
Comments
5 min read
How to Implement Data Modelling in Power BI

How to Implement Data Modelling in Power BI

2
Comments
2 min read
The future of Data Engineering in Databricks - From Pipelines to Intent

The future of Data Engineering in Databricks - From Pipelines to Intent

2
Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.