Once real-time processing becomes a baseline requirement, Kafka sits at the center of the data platform, and the value of engineers who can drive both design and operations rises sharply.
CCDAK (development) and CCAAK (operations) are among the few credentials that objectively signal those skills. This article ties exam preparation to on-the-job tasks in a way that translates directly into hiring outcomes, compensation, and career growth.
Batch-centric DWHs alone struggle to meet requirements such as personalization, inventory synchronization, fraud detection, and IoT telemetry. Kafka, with its unified messaging and log semantics, is adopted as the backbone of loosely coupled, event-driven designs. Hiring teams want engineers who understand the trade-offs between throughput and consistency and who can build reprocessing and ordering guarantees into the design from day one.
Enterprises typically run a mix of multiple clouds and on-premises. Kafka offers strong runtime compatibility and stable client APIs, making it easier to mitigate vendor lock-in. Pairing certifications with a portfolio leads to consistent evaluations even on migration and modernization projects.
A bird's-eye view of a streaming platform
Minimum foundation: creating a highly available topic
kafka-topics.sh --bootstrap-server broker-1:9092 \
--create --topic orders \
--partitions 6 --replication-factor 3 \
--config min.insync.replicas=2CCDAK covers producer/consumer APIs, data modeling (keys, partitions, ordering), delivery guarantees (at-least / at-most / exactly once), and the fundamentals of connectors and schema operations. The focus is on application-side design decisions and recovery strategies.
CCAAK centers on broker and topic configuration, security (SASL/SSL and ACLs), monitoring and operational standardization (alerts, capacity planning, rolling upgrades), and failure recovery and rebalance control. The emphasis is on operational decisions that protect production SLOs.
| Certification | Target Audience | Main Exam Domains | Example On-the-Job Tasks |
|---|---|---|---|
| CCDAK (Developer) | Application / data engineers | Producer/Consumer APIs, key design, partitions, delivery guarantees, schemas, connector basics, stream processing basics | Event schema design, ordering-guarantee requirements, idempotent/transactional producers, read_committed consumption, reprocessing design |
| CCAAK (Administrator) | SRE / platform / operations | Broker/topic configuration, security (SASL/SSL/ACL), monitoring and capacity planning, rolling operations, failure recovery, Connect operations | Cluster build and upgrades, standardizing ACL policies, monitoring throughput/latency SLOs, tuning rebalances and throttling, multi-AZ topologies |
Where the exam meets practice: minimum settings for delivery guarantees and access control
# Producer(Exactly-onceの前提)
enable.idempotence=true
acks=all
retries=2147483647
max.in.flight.requests.per.connection=5
# トランザクションを使う場合
delivery.timeout.ms=120000
transactional.id=orders-tx-001
# Consumer(コミット済のメッセージのみ)
isolation.level=read_committed
auto.offset.reset=earliest
# ACL例(管理者が適用)
# topic:orders の Read をアプリUser:app-ordersに許可
kafka-acls.sh --bootstrap-server broker-1:9092 \
--add --allow-principal User:app-orders \
--operation Read --topic orders --group orders-cgHiring decisions are driven by consistency of design judgment and reproducibility under incidents. Certifications work as shared vocabulary, but you gain real negotiating power when you can back them up with hands-on evidence: schema evolution, ordering and reprocessing, and SLO operations. Scale, SLOs, on-call expectations, and multi-region requirements feed directly into compensation, so it is important to articulate the difficulty and scope of each role clearly.
Case interviews typically cover isolating bottlenecks when latency rises (producer, broker, consumer, storage), avoiding schema incompatibilities, and designing dead-letter handling for connectors. Interview prep maps cleanly onto the exam domains.
A minimum set of diagnostic commands useful in interviews
# API互換の確認
kafka-broker-api-versions.sh --bootstrap-server broker-1:9092
# トピックの状態
kafka-topics.sh --bootstrap-server broker-1:9092 --describe --topic orders
# 消費ラグの概観(ツール併用も可)
kafka-consumer-groups.sh --bootstrap-server broker-1:9092 \
--describe --group orders-cgWeek 1 is about concepts and terminology alignment. Use the official Kafka documentation to articulate topics, partitions, replication, the basic producer/consumer APIs, and the precise definitions of delivery guarantees. Then map the exam scope against the Confluent exam guide.
Weeks 2-3 are hands-on. On a small cluster (3 brokers), toggle topic retention and compaction, and vary producer batch settings while measuring throughput and latency. Confirm schema evolution and read_committed consumption. On the operations side, work through ACLs, certificates, and procedural rolling restarts.
Week 4 is shoring up weak spots and running mock questions. Prepare failure scenarios (broker outages, slow networks, consumer fencing) and finalize your diagnostic playbook.
A baseline for performance measurement (observing the impact of batching and compression)
kafka-producer-perf-test.sh \
--producer.config producer.props \
--topic perf-test --num-records 500000 --throughput -1 \
--record-size 512
# producer.props の例
compression.type=snappy
batch.size=65536
linger.ms=20
acks=all
enable.idempotence=trueKey design is the linchpin of ordering guarantees and scaling. Fix the key at the unit where ordering matters, and choose partition counts that account for consumer parallelism and future scale. Frequent rebalances hurt throughput, so stabilize them by tuning session and max poll interval settings.
Pick delivery guarantees based on the requirement. If duplicates are acceptable, at-least-once is often enough. If coordination with external systems is required, combine idempotence with transactions to achieve exactly-once (deduplication within the topic plus commit consistency). In operations, distinguishing retention from compaction and preserving backward schema compatibility largely determines the cost of reprocessing.
Configuration snippets: compaction and transactions
# ログコンパクションを有効化
kafka-configs.sh --bootstrap-server broker-1:9092 \
--alter --entity-type topics --entity-name user-profile \
--add-config cleanup.policy=compact,min.compaction.lag.ms=60000
# Java Producerのプロパティ抜粋(EOS2)
props.put("enable.idempotence", "true");
props.put("acks", "all");
props.put("transactional.id", "inventory-tx-1");
props.put("max.in.flight.requests.per.connection", "5");Kafka is the foundation of data distribution. When upstream schema management, downstream DWH/lake, and infrastructure automation all fit together, you can deliver value end-to-end from requirements through operations. The combination of certifications you hold directly expands the range of projects and roles you can take on.
For example, the reliability work in streaming ETL (schema compatibility, observability, reprocessing design) pairs naturally with Snowflake and Databricks loading strategies, dbt transformation design, and Terraform/Vault for automation and secrets management.
CCDAK / CCAAK
問題 1
You are processing payment events that trigger side-effectful updates to an external inventory service. Duplicate application is strictly prohibited, and after a failure the system must produce consistent results once restarted. Which combination is the most appropriate Kafka-based design?
正解: A
To achieve effectively exactly-once semantics, idempotent producers with transactions (transactional.id) preserve commit-unit consistency, while consumers using read_committed read only committed records. Side effects in external systems should be protected on the application side with an idempotency key so retries do not apply twice. The other options break delivery guarantees or consistency.
Which should I take first, CCDAK or CCAAK?
If you work on application implementation and data modeling, CCDAK is the shortest path. If your role is closer to platform operations and SRE, CCAAK is the better starting point. If you straddle both, lock in design fundamentals with CCDAK first, then cover SLO operations and security with CCAAK.
Should I focus on ZooKeeper or KRaft?
Core concepts (topics, replication, delivery guarantees, security, monitoring) are the same for both. Align your exam prep with the version assumed in the official guide. In production, the industry is shifting to KRaft, but during the transition period it is safer to understand the operational assumptions of both.
Is the certification still valuable if I mostly use managed Kafka (Confluent Cloud)?
Yes. Core concepts such as client design, schema operations, delivery guarantees, and ACLs apply equally, and the certification signals that you can make portable design decisions. On the operations side, understanding the shared responsibility boundary for managed services while also being able to articulate monitoring, capacity planning, and security design decisions is a strong combination.
Practice with certification-focused question sets
無料で問題を解いてみるNicheeLab Editorial Team
NicheeLab editorial team focused on data engineering and cloud certification learning. Content is structured around practical study needs and official exam domains.
Kafka Topics & Partitions: Distribution Fundamentals (2026)
How Kafka topics and partitions enable scale — ordering guar...
CCDAK Exam Guide: Confluent Certified Developer (2026)
Complete prep for the CCDAK exam — Producer/Consumer API, St...
CCAAK Exam Guide: Confluent Certified Administrator (2026)
Pass the CCAAK exam — cluster management, partitions, securi...
Kafka Replicas & ISR: Fault Tolerance Explained (2026)
Replica placement, in-sync replicas (ISR), leader election. ...
Kafka Offsets: Commit Modes & Consumer Position (2026)
Offset semantics — auto vs. manual commit, __consumer_offset...