Snowflake Architecture: Three-Layer Design Explained (2026)

Snowflake uses a multi-cluster shared-data architecture that differs from both traditional shared-disk and shared-nothing designs. Storage, compute, and cloud services are fully decoupled, and each layer scales independently. This article walks through the role of each layer, micro-partitions, the cache hierarchy, and the Cloud Services billing rule — the exam-relevant essentials plus what matters in production.

Three-Layer Architecture Overview

Snowflake's architecture is built from three layers. Each layer scales and fails over independently, freeing users from infrastructure management so they can focus on analytics.


┌─────────────────────────────────────────────┐
│         Cloud Services Layer                │
│  Auth / Access Control / Query Optimization │
│  Metadata Management / Transactions         │
├─────────────────────────────────────────────┤
│         Compute Layer (Virtual Warehouses)  │
│  WH-1 (XS)  │  WH-2 (L)  │  WH-3 (XL)    │
│  Independent clusters, isolated from each   │
├─────────────────────────────────────────────┤
│         Storage Layer                       │
│  Cloud Object Storage (S3 / Azure Blob / GCS)│
│  Micro-partitions (columnar, compressed)    │
└─────────────────────────────────────────────┘

Storage Layer

Data is stored on cloud-provider object storage (AWS S3, Azure Blob Storage, Google Cloud Storage) in a columnar format. Users never interact with storage directly — Snowflake handles all data management automatically.

How Micro-Partitions Work

Table data is automatically split into units called micro-partitions.

Property	Detail
Size	50-500 MB compressed (uncompressed equivalent is several hundred MB)
Format	Columnar storage format
Management	Snowflake handles splitting and reorganization automatically (no user action required)
Metadata	Per-partition min/max, NULL count, and distinct count metadata for every column
Pruning	WHERE clauses are matched against metadata so irrelevant partitions are skipped
Immutability	Once created, partitions are immutable (INSERT/UPDATE/DELETE create new partitions)

Micro-partition immutability is what enables Time Travel (point-in-time queries) and Fail-safe (disaster recovery). When you run UPDATE, the affected partition is recreated as a new version, and the previous version is retained for the Time Travel window.

Compute Layer

Virtual Warehouses handle query execution. Each warehouse is an independent compute cluster — workloads in one warehouse don't impact others.

Size	Credits/Hour	Typical Use Case
X-Small	1	Development, testing, light queries
Small	2	Small dashboards
Medium	4	Medium ETL and BI workloads
Large	8	Large batch processing
X-Large	16	Full scans on TB-scale tables
2XL〜6XL	32〜512	Very large data processing

Warehouses support multi-cluster scaling, automatically adding clusters as concurrent query count grows. Auto-suspend stops idle warehouses, and auto-resume starts them again on the next query.

Cloud Services Layer

The 'brain' of Snowflake. Users never see this layer directly, but it handles these critical responsibilities:

Authentication & access control: User authentication, RBAC, and network policy enforcement
Metadata management: Table definitions and micro-partition statistics
Query optimization: Query parsing, optimization, and pruning decisions
Transaction management: Guarantees ACID transactions
Security: End-to-end AES-256 encryption with automatic key rotation

Cloud Services Billing (the 10% adjustment)

Item	Value
Billing threshold	10% of daily warehouse credit consumption
What's billed	Only the portion exceeding 10%
How to check	ACCOUNT_USAGE.METERING_HISTORY view
Common over-threshold cases	Heavy SHOW/DESCRIBE usage, Snowpipe notification processing, high-volume COPY INTO

Example: if warehouses consume 200 credits in a day, the Cloud Services threshold is 20 credits. If Cloud Services consumed 25 credits, only 25 - 20 = 5 credits are billed on top.

The Three-Cache Hierarchy

Snowflake has three caches that improve query performance and reduce cost.

Cache type	Location	Lifetime	Hit conditions	Cost
Result Cache	Cloud Services Layer	24 hours (extended on reuse)	Same query, same role, no data change	No warehouse used (zero credits)
Metadata Cache	Cloud Services Layer	Always-on	Aggregation queries like COUNT/MIN/MAX	No warehouse used (zero credits)
Warehouse Cache (Local Disk Cache)	Warehouse SSD	While warehouse is running	Same warehouse accessing same table	Included in warehouse cost

Result Cache is the most cost-effective — it returns results without starting a warehouse. It shines for repeated identical queries like scheduled BI dashboard refreshes. But Result Cache is invalidated as soon as DML runs on the underlying table.

End-to-End Query Processing

User submits SQL
Cloud Services Layer parses and optimizes the query
If Result Cache has a matching result, it returns immediately without a warehouse
If an aggregation query can be answered from Metadata Cache, it returns without a warehouse
Otherwise the query runs on the specified warehouse
Micro-partition pruning reads only the relevant data
Data already in Warehouse Cache skips storage access
Results are returned to the user and stored in Result Cache

Snowflake Editions and Architecture Features

Feature	Standard	Enterprise	Business Critical	VPS
Multi-cluster warehouses	-	Yes	Yes	Yes
Time Travel (up to 90 days)	1 day only	Up to 90 days	Up to 90 days	Up to 90 days
Materialized views	-	Yes	Yes	Yes
Search Optimization Service	-	Yes	Yes	Yes
Tri-Secret Secure	-	-	Yes	Yes
AWS PrivateLink / Azure Private Link	-	-	Yes	Yes
Dedicated metadata store	-	-	-	Yes

Exam Focus Areas

On the SnowPro Core exam, the Architecture domain is about 25% of the test — the single biggest area. These are the points you should expect to be tested on:

Role of each layer and which processing happens where
Concrete benefits of decoupled storage and compute
Micro-partition properties (size, immutability, columnar, automatic management)
Differences across the three caches (location, warehouse start required or not, invalidation conditions)
The Cloud Services Layer 10% billing rule
Feature differences across editions (multi-cluster, Time Travel window, Tri-Secret Secure)

Check Your Understanding

SnowPro Core

問題 1

Which statement about Snowflake's Result Cache is correct?

Result Cache is stored on the warehouse's local disk and is cleared when the warehouse stops
Result Cache lasts 24 hours and returns results without a warehouse when the same query is run by the same role and the data has not changed
Result Cache is only available on Enterprise Edition or higher
There is a 50 MB upper limit on data that can be stored in Result Cache

正解: B

Result Cache lives in the Cloud Services Layer for 24 hours. It hits when the query text and role match and no DML has changed the underlying tables, returning results without starting a warehouse. Option A actually describes Warehouse Cache (Local Disk Cache).

Frequently Asked Questions

What are the benefits of separating storage and compute in Snowflake's three-layer architecture?

Storage and compute scale independently, so you can grow only storage as data volume rises, or scale warehouses up and down based on query load. You don't carry idle capacity the way a traditional on-prem DWH forces you to, and cost efficiency improves dramatically. A second major benefit: multiple warehouses can access the same data concurrently without lock contention.

How is Cloud Services Layer billing calculated?

Cloud Services credits are billed only for usage that exceeds 10% of daily warehouse credit consumption (the 10% adjustment rule). For example, if warehouses consume 100 credits and Cloud Services consume 15, only 15 - 10 = 5 credits are billed. The design effectively makes core services like authentication, metadata management, and query optimization free.

How do micro-partitions relate to clustering keys?

Snowflake automatically splits data into 50-500 MB compressed micro-partitions and stores per-partition min/max metadata. When you set a clustering key, partitions are reorganized (reclustered) by the specified columns, which improves pruning efficiency. Reclustering consumes credits, so the typical recommendation is to consider clustering keys only when tables are larger than 1 TB and frequently filter by a specific column.

Check what you learned with practice questions

Practice with certification-focused question sets

Try free practice questions

Author

NicheeLab Editorial Team

NicheeLab editorial team focused on data engineering and cloud certification learning. Content is structured around practical study needs and official exam domains.

Snowflake Architecture: Three-Layer Design Explained