Navigate:
~$DBT0.4%

dbt: Data transformation using software engineering practices

SQL-based transformation framework for analytics data warehouses.

LIVE RANKINGS • 10:20 AM • STEADY
OVERALL
#189
22
DATA ENGINEERING
#7
2
30 DAY RANKING TREND
ovr#189
·Data#7
STARS
12.3K
FORKS
2.3K
7D STARS
+52
7D FORKS
+12
See Repo:
Share:

Learn more about dbt

dbt is a Python-based tool that compiles SQL select statements into executable transformations for data warehouses. It operates on a directed acyclic graph (DAG) model where each SQL query represents a node, and dbt automatically determines execution order based on declared dependencies between models. The tool supports multiple data warehouse backends and provides features for testing data quality, generating documentation, and version controlling transformation logic. Common use cases include building data marts, creating dimensional models, and establishing repeatable ELT (Extract, Load, Transform) pipelines in analytics workflows.

dbt

1

SQL-first approach

dbt uses standard SQL select statements as the primary interface for defining transformations, rather than requiring a proprietary language or visual interface. This allows analysts familiar with SQL to define complex data pipelines without learning additional frameworks.

2

Dependency management via DAG

dbt automatically constructs a directed acyclic graph from declared model relationships and determines the correct execution order. This eliminates manual scheduling logic and allows developers to reference upstream models using the ref() function rather than hardcoding table names.

3

Built-in testing and documentation

dbt includes native support for data quality tests and automatic documentation generation from model definitions and comments. These capabilities are integrated into the project structure and CI/CD workflows rather than requiring separate tools.


-- models/orders_summary.sql
-- Reference a staging model to build aggregated orders

select
    customer_id,
    count(*) as order_count,
    sum(order_total) as total_spent
from {{ ref('stg_orders') }}
group by customer_id

vv1.11.3

Fixed deadlock issues in concurrent batch execution and removed unnecessary deprecation warnings for Python models.

  • Avoid deadlock edgecases of concurrent microbatch/batch execution
  • Stop raising deprecation warnings for internal python model configs
  • Move click minimum to 8.3.0
vv1.10.19

Fixed catalog integration functionality to ensure proper initialization even when a manifest already exists.

  • Add addcatalogintegration call even if we have a pre-existing manifest
vv1.10.18

Introduced new configuration metadata methods and fixed a critical bug in the meta_require method implementation.

  • Implement config.metaget and config.metarequire
  • Adds omitted return statement to RuntimeConfigObject.meta_require method
  • Bump lower bound for dbt-common to 1.37.2

See how people are using dbt

Loading tweets...


[ EXPLORE MORE ]

Related Repositories

Discover similar tools and frameworks used by developers