The dbt Analytics Engineer certification is a vendor-neutral, practitioner-oriented credential that measures your ability to design, build, and quality-assure the transformation (T) layer on top of a data warehouse. It tests SQL, modeling, testing, and automation across the board.
This article frames the certification against neighboring vendor certs and walks through the high-frequency topics, a study plan, key design and operations points, and exam-taking tactics — all from a practitioner's viewpoint.
The dbt Analytics Engineer certification targets people who can deliver reproducible modeling, trustworthy tests, and operationalized documentation at the center of a data transformation pipeline. It is aimed at analytics engineers, senior data analysts, and data engineers responsible for the T layer on top of a DWH.
Platform certifications such as Snowflake and Databricks go deep on platform features and operations. dbt, in contrast, is about the craft of transformation and applies across multiple DWHs. In practice, pairing a platform cert (for the underlying foundation) with the dbt cert (for transformation design, quality, and operations) is a solid combination.
| Certification | Primary Domain | Exam Characteristics | Primary Real-World Focus |
|---|---|---|---|
| dbt Analytics Engineer | Modeling / Testing / Documentation / Orchestration | Best-practice selection, concept-focused, vendor-neutral | Transformation (T) design, quality assurance, operational design |
| Databricks Data Engineer Associate | Lakehouse / Delta / ETL and optimization | Spark/Delta APIs and architectural understanding | Batch and streaming ETL, performance tuning |
| SnowPro Core | Snowflake architecture / Security / SQL | Service composition, features, and best practices | Account design, permissions, and cost optimization |
Where the Analytics Engineer sits (overlap of responsibilities)
The exam favors design judgement and best practices over rote command memorization. Mapping topics down to real work units makes them stick faster.
Expect frequent questions on managing model dependencies, test design, snapshots and freshness, documentation and the catalog, and job operations.
Spin up a tiny project on a lightweight DWH (DuckDB or SQLite is fine) and add deliverables each week. For exam notes, the key is to write down the reasoning behind your decisions, not just the answers.
If you can use an existing DWH (Snowflake / Databricks), practice on a near-production setup while keeping a close eye on permissions and cost.
Layering is foundational on both the exam and in real work. stg handles normalization and light cleanup of raw sources, the intermediate layer handles joins, aggregation, and business logic, and marts focus on consumption (dimensions / facts).
Choose materializations by intent. view is lightweight and reflects changes instantly, table provides stable supply, incremental avoids full recomputation at scale, and ephemeral is for transient subqueries that should not add to the dependency graph.
Incremental models identify updates via a unique key and ingest only the changes. You're evaluated on whether your design also handles late-arriving data and pairs cleanly with data quality tests.
Minimal incremental model pattern (conceptual example)
{{ config(
materialized='incremental',
unique_key='order_id'
) }}
with src as (
select
order_id,
customer_id,
amount,
updated_at
from {{ source('app', 'orders') }}
)
select *
from src
{% if is_incremental() %}
-- Conceptual change-detection: pull incrementally by update timestamp.
where updated_at > (
select coalesce(max(updated_at), '1970-01-01') from {{ this }}
)
{% endif %}Tests are a data contract. Built around not_null, unique, relationships, and accepted_values, severity controls runtime behavior (warn vs. fail). Place them according to schema stability: syntactic checks at stg, business constraints at marts.
source and freshness monitor upstream health. If late arrivals are routine, tune the thresholds appropriately and standardize failure notifications and retries through your jobs.
Snapshots are the standard way to retain history. Decide the row identity key and the change-detection strategy (for example, detect column-value changes) and let the auto-managed history columns handle validity periods.
Questions ask how you decide, not what you build. Given the requirements (SLA, grain, history, quality, operations), the rule of thumb is to pick the option that satisfies them with the minimum machinery.
Common traps include grain mismatches, mixing up ref vs. source, misusing incremental (missing keys / full overwrites), and missing tests or misconfigured severity.
Analytics Engineer
問題 1
You want to retain historical values of customer attributes and manage validity periods whenever they change. Which dbt approach is most appropriate?
正解: A
Snapshots fit history-management (SCD Type 2) requirements. You specify an identity key and a change-detection strategy, and the validity columns are managed automatically. Incremental is primarily about update efficiency and does not automatically manage history columns. seed is for static data, and macros alone cannot provide durable history retention.
Should I learn with dbt Cloud or dbt Core?
Either path can get you certified. Cloud has built-in jobs, environment separation, and an integrated docs UI, which makes the operational picture easier to grasp. Core is local-first, but combined with Git/CI you can reproduce the same workflow. The exam emphasizes concepts common to both, such as modeling, tests, ref/source, snapshots, and selectors.
How heavily are metrics and newer features tested?
The exam leans on the stable fundamentals: models, tests, docs, snapshots, and operations. It avoids fine-grained version differences and instead tests design judgement. Keep up with new features in the official docs, but spend the bulk of your study time on the foundational design principles.
Which data warehouse should I assume while studying?
dbt is a vendor-neutral transformation layer, so focus on concepts that apply to any DWH. Practice locally on whatever is convenient — Snowflake, Databricks, or even DuckDB. Solid SQL fundamentals plus an awareness of adapter-specific behavior (for example, optimization differences in some materializations) is enough.
Practice with certification-focused question sets
無料で問題を解いてみるNicheeLab Editorial Team
NicheeLab editorial team focused on data engineering and cloud certification learning. Content is structured around practical study needs and official exam domains.
dbt Models: SQL-Defined Transformation Units (2026)
Model fundamentals — SELECT-based definitions, naming, refs,...
dbt Analytics Engineering Exam: Complete Guide (2026)
Pass the AE Certification — scope, weighting, sample questio...
dbt Cloud vs dbt Core: Feature & Cost Comparison (2026)
Honest comparison of dbt Cloud vs. dbt Core — IDE, scheduler...
dbt Project Structure: models/seeds/macros Layout (2026)
Recommended dbt project layout — models, seeds, macros, snap...
dbt_project.yml Explained: Every Config (2026)
Every dbt_project.yml setting that matters — paths, vars, ma...