The Databricks Certified Data Analyst Associate (DAA) is the certification that measures practical skills in writing queries, building dashboards, and analyzing data with Databricks SQL. It centers on SQL knowledge, and no Python or Spark coding is required. In late 2025, AI/BI Genie was added to the exam scope, making it a 9-domain exam.
| Item | Details |
|---|---|
| Exam name | Databricks Certified Data Analyst Associate |
| Questions | 45 |
| Duration | 90 minutes (avg. 2 minutes per question) |
| Passing score | 70% (32+ correct answers) |
| Exam fee | $200 (excl. tax) |
| Languages | English / Japanese (selectable) |
| Prerequisites | None |
| Validity period | 2 years |
| Key tools | Databricks SQL, SQL Warehouse, Dashboards |
| Domain | Weight | Approx. questions | Key topics |
|---|---|---|---|
| Databricks SQL | 14% | 6-7 | SQL warehouse configuration, query editor |
| Data Management | 12% | 5-6 | Table operations, views, when to use CTEs |
| SQL Query | 16% | 7-8 | SELECT, JOIN, aggregation, subqueries |
| Data Visualization | 12% | 5-6 | Chart type selection, formatting |
| Dashboards | 12% | 5-6 | Dashboard creation, filters, sharing |
| Analytics Applications | 10% | 4-5 | Alerts, scheduled execution |
| AI/BI Genie | 8% | 3-4 | Genie Space configuration, natural-language queries |
| Data Access and Security | 8% | 3-4 | Table permissions, sharing settings |
| Lakehouse Concepts | 8% | 3-4 | Delta Lake basics, medallion architecture |
The SQL Query domain carries the largest weight at 16%. Most questions ask you to interpret the results of SQL execution, so you must understand JOINs, window functions, and CTEs. Databricks SQL follows at 14%, testing your knowledge of SQL warehouse configuration and management.
SQL Warehouse is the compute resource for Databricks SQL, and it comes in three types.
| Type | Characteristics | Use case |
|---|---|---|
| Serverless | Fastest startup, on the order of seconds. Databricks manages the infrastructure | Production dashboards and ad-hoc queries (recommended) |
| Pro | Moderate startup time. Ships with the Photon engine | Production environments where cost control matters |
| Classic | Takes several minutes to start. Limited feature set | Legacy compatibility only (not recommended for new use) |
AI/BI Genie is a natural-language interface for querying data. Ask a question like "What were the top 5 products by sales last month?" and Genie auto-generates the appropriate SQL and returns the results.
OPTIMIZE my_table ZORDER BY (column1) specifies the filter columnANALYZE TABLE my_table COMPUTE STATISTICS FOR ALL COLUMNS| Comparison item | Data Analyst Associate (DAA) | Data Engineer Associate (DEA) |
|---|---|---|
| Primary audience | BI analysts, data analysts | Data engineers |
| Compute used | SQL Warehouse | All-purpose / Job Cluster |
| Key tools | Databricks SQL, Dashboards, Genie | Notebooks, DLT, Workflows |
| Primary language | SQL (100%) | Python + SQL |
| Delta Lake scope | Basic concepts only (Time Travel, OPTIMIZE) | In depth (MERGE, CDF, Schema Evolution) |
| Unity Catalog scope | Table permissions, sharing settings | 3-level namespace, lineage, external locations |
| Dashboards | Covered (12%) | Not covered |
| ETL pipelines | Not covered | Covered (DLT, Auto Loader, Workflows) |
| Recommended study period | 3-4 weeks (if you have SQL basics) | 5-6 weeks |
Databricks
問題 1
You want to improve dashboard rendering speed in Databricks SQL. The dashboard contains 5 queries, each averaging 30 seconds of execution time. Which is the most effective improvement?
正解: B
To improve dashboard rendering speed, leveraging the Result Cache and physically optimizing the tables is the most effective first step. When the Result Cache is enabled, the second and subsequent executions of the same query return instantly from the cache. OPTIMIZE + Z-ORDER additionally places data for filtered columns physically close together, reducing the amount of scanning required. Option A (increasing cluster size) substantially increases cost while leaving scan volumes unchanged without query-level optimization, so cost effectiveness is poor. Option C does not address the root cause of slow rendering and sacrifices data freshness. Option D forces queries with different schemas into a single UNION ALL, which severely hurts readability and maintainability.
Should I take Data Analyst Associate or Data Engineer Associate first?
Choose based on your day-to-day work. If you spend most of your time writing SQL queries, building dashboards, and doing BI analysis, DAA is the right fit. If you focus on ETL pipelines, Delta Lake operations, and Workflows-based job management, DEA is the better choice. DAA can be tackled in 3-4 weeks if you already know SQL. DEA requires additional Python/PySpark and Delta Lake knowledge, so plan for 5-6 weeks. If you want both, the most efficient path is DAA first (assuming SQL basics) to get comfortable with the Databricks UI, then DEA.
How much is AI/BI Genie covered on the exam?
AI/BI Genie is a newer Databricks SQL feature and accounts for about 2-3 questions. You will be fine if you understand that it is a natural-language interface for querying data, that it runs through a Genie Space connected to a SQL warehouse, and that access permissions follow Unity Catalog table permissions. Questions about SQL auto-generation accuracy and its limits (such as complex JOINs) have also been reported.
Can I pass without hands-on Databricks SQL experience?
Yes, you can pass. The free Databricks Community Edition does not include SQL warehouses, but you can practice writing queries with SQL execution inside notebooks. For dashboards, alerts, and query history, memorizing the screenshots and step-by-step procedures in the official documentation is enough. If you sign up for the 14-day free trial, you can experience SQL warehouses and dashboards hands-on, which we recommend doing at least once before the exam.
Related Databricks Certification Articles
Practice with certification-focused question sets
無料で問題を解いてみるNicheeLab Editorial Team
NicheeLab editorial team focused on data engineering and cloud certification learning. Content is structured around practical study needs and official exam domains.
Databricks Certifications: All 7 Exams, Difficulty & Study Plan (2026)
Complete guide to all 7 Databricks certifications — Data Eng...
Databricks Exam Difficulty Ranking: All 7 Certs Compared (2026)
Every Databricks certification ranked by difficulty, with st...
Databricks Study Guide: Fastest Pass Route & Time Estimates (2026)
How to pass Databricks certifications efficiently. Official ...
Databricks Data Engineer Associate: Complete Guide (2026)
Domain-by-domain breakdown of the Databricks Certified Data ...
Databricks Data Engineer Professional: Complete Guide (2026)
Tactics for the Databricks Certified Data Engineer Professio...