About dbt artifacts
With every invocation, dbt generates and saves one or more artifacts. Several of these are JSONJSON (JavaScript Object Notation) is a minimal format for semi-structured data used to capture relationships between fields and values. files (semantic_manifest.json
, manifest.json
, catalog.json
, run_results.json
, and sources.json
) that are used to power:
They could also be used to:
- gain insights into your dbt Semantic Layer
- calculate project-level test coverage
- perform longitudinal analysis of run timing
- identify historical changes in tableIn simplest terms, a table is the direct storage of data in rows and columns. Think excel sheet with raw values in each of the cells. structure
- do much, much more
dbt has produced artifacts since the release of dbt-docs in v0.11.0. Starting in dbt v0.19.0, we are committing to a stable and sustainable way of versioning, documenting, and validating dbt artifacts.
When are artifacts produced? teamenterprise
Most dbt commands (and corresponding RPC methods) produce artifacts:
- semantic manifest: produced whenever your dbt project is parsed
- manifest: produced by commands that read and understand your project
- run results: produced by commands that run, compile, or catalog nodes in your DAG
- catalog: produced by
docs generate
- sources: produced by
source freshness
When running commands from the dbt Cloud CLI, all artifacts are downloaded by default. If you want to change this behavior, refer to How to skip artifacts from being downloaded.
Where are artifacts produced?
By default, artifacts are written to the /target
directory of your dbt project. You can configure the location using the target-path
flag.
Common metadata
All artifacts produced by dbt include a metadata
dictionary with these properties:
dbt_version
: Version of dbt that produced this artifact. For details about release versioning, refer to Versioning.dbt_schema_version
: URL of this artifact's schema. See notes below.generated_at
: Timestamp in UTC when this artifact was produced.adapter_type
: The adapter (database), e.g.postgres
,spark
, etc.env
: Any environment variables prefixed withDBT_ENV_CUSTOM_ENV_
will be included in a dictionary, with the prefix-stripped variable name as its key.invocation_id
: Unique identifier for this dbt invocation
In the manifest, the metadata
may also include:
send_anonymous_usage_stats
: Whether this invocation sent anonymous usage statistics while executing.project_name
: Thename
defined in the root project'sdbt_project.yml
. (Added in manifest v10 / dbt Core v1.6)project_id
: Project identifier, hashed fromproject_name
, sent with anonymous usage stats if enabled.user_id
: User identifier, stored by default in~/dbt/.user.yml
, sent with anonymous usage stats if enabled.
Notes:
- The structure of dbt artifacts is canonized by JSON schemas, which are hosted at schemas.getdbt.com.
- Artifact versions may change in any minor version of dbt (
v1.x.0
). Each artifact is versioned independently.
Related docs
- Other artifacts files such as
index.html
orgraph_summary.json
.