Vault Storage Backends: Integrated Storage vs Consul vs Other (2026)

Vault availability and failure behavior depend heavily on the Storage Backend. Especially from an Ops perspective, the key is to understand the consistency model and select one that fits your organization's operational standards.

Based on stable behavior in the official documentation, this article explains realistic comparisons and design points for Integrated Storage (Raft) and Consul, with both practice and certification preparation in mind.

Role of Storage Backends and Foundational Assumptions

Vault stores data in the Storage Backend in encrypted form. The targets include secrets, tokens, leases, credentials, and metadata. Storage selection directly affects durability, consistency, availability, and operational cost.

Production use requires a backend that satisfies high availability and strong consistency. The current recommendations are primarily Integrated Storage (Raft) or Consul. Some legacy backends are deprecated or have reduced support, so it is safer to avoid them in new builds.

Vault itself encrypts data, and the Storage side holds the ciphertext
HA locks and leader election secure a consistent write path
Backup methods differ by backend (Raft snapshots, Consul snapshots, etc.)
Behavior during network partitions depends on the consistency model. Clarify quorum requirements at the design stage

Overview of Options and Recommended Scenarios

Integrated Storage (Raft) provides strong consistency and simple operations via Raft embedded in the Vault process. It suits cases where you want to reduce external dependencies and keep the cluster self-contained.

Consul has a KV store as part of its mature service mesh / service discovery offering, with a long track record as Vault storage. If you already have a reliable Consul operational foundation and your team has the know-how, it remains a strong choice.

For greenfield builds that minimize external dependencies, choose Integrated Storage
If you have an existing, stable Consul foundation, choose Consul
From a long-term operations and support perspective, it is safer to avoid adopting legacy backends for new builds

Backend	Consistency	Availability / Failure Behavior	Operational Cost
Integrated Storage (Raft)	Strong consistency (single leader via Raft, quorum commit)	Node failures continue with quorum maintained. The minority halts during network partitions	Few external dependencies and simple. Disk/IOPS design is important
Consul	Strongly consistent writes. Read consistency is configurable (Vault assumes strongly consistent behavior)	Depends on the health of the Consul cluster. Multi-DC design and WAN federation raise design complexity	Requires Consul operational knowledge. Good fit when you already have an existing foundation
Legacy backend group (e.g., etcd/DynamoDB)	Implementation and operational assumptions vary	Behavior and future viability differ based on support status	Strongly dependent on vendor and operations

Translating the Consistency Model into Design

Raft performs log replication using a single-leader model, committing with majority agreement. This keeps writes consistently strongly consistent. Reads are also strongly consistent, based on the leader or committed indexes.

Consul also uses Raft internally, and writes from Vault are strongly consistent. While some reads allow a consistency/latency tradeoff to be configured, Vault generally assumes a strongly consistent path to avoid data races. In design, make explicit the number of nodes needed to form quorum, behavior during network partitions, and read consistency.

Quorum for an N-node configuration is the majority (floor(N/2)+1)
During network partitions, writes are impossible in the minority (the cost of strong consistency)
Do not stretch a single cluster across high-latency regions (destabilizes quorum)
Improving read performance requires holistic design of node count, CPU, IOPS, and network

Consistency and Quorum in a Vault Cluster (Raft)

               ┌────────────────┐
               │   Client Write │
               └───────┬────────┘
                       │
                  +────▼────+
                  │ Leader  │  Node A
                  +────┬────+
           AppendEntries│
     ┌──────────────────┼──────────────────┐
 +───▼────+         +───▼────+         +───▼────+
 │Follower│         │Follower│         │Follower│
 │ Node B │         │ Node C │         │ Node D │
 +────────+         +────────+         +────────+
     │                   │                  │
     └────── ack ────────┴────── ack ───────┘
             Quorum reached → commit

Key Points of Availability and Operational Design

Availability is not just a matter of increasing node count; placement must consider quorum and failure domains (AZ/rack). Storage IO stability is particularly important because it can trigger latency spikes and leader turnover.

Prepare backups to suit your backend. Integrated Storage relies on Vault's Raft snapshots, and Consul on Consul's snapshot feature as the basics. Rehearse restoration procedures periodically and verify your recovery time objective and data consistency.

Three or more nodes in an odd-count configuration (e.g., 3, 5). Distribute across AZs with independent power and networks
Prioritize stability for Disk/IOPS. Write latency directly drives cluster-wide throughput
Monitoring should include Raft/Consul health, leader status, replication lag, and snapshot success/failure
Perform planned outages and upgrades one node at a time, in an order that preserves quorum
Decide upfront whether to use Enterprise features for DR and cross-region (based on requirements and budget)

Configuration Examples and Build Checklist

A representative server.hcl example. In real operations, configure with TLS, auto-unseal, audit logs, resource limits, and more. Adjust addresses and paths to match your environment.

After configuration, validate settings before startup. After startup, verify cluster health, leader status, and whether snapshots can be taken.

Before startup: storage path permissions, disk capacity, network reachability, TLS certificates
After startup: leader election, API/Cluster addresses, snapshot take/restore tests
Failure drills: simulate node outages, network partitions, and high disk latency, then verify SLOs

Example Vault server.hcl (Integrated Storage and Consul)

# Integrated Storage (Raft) example
listener "tcp" {
  address       = "0.0.0.0:8200"
  tls_disable   = 0
  tls_cert_file = "/etc/vault/tls/tls.crt"
  tls_key_file  = "/etc/vault/tls/tls.key"
}

storage "raft" {
  path    = "/opt/vault/data"
  node_id = "vault-1"
  retry_join {
    leader_api_addr = "https://10.0.0.1:8200"
  }
  retry_join {
    leader_api_addr = "https://10.0.0.2:8200"
  }
}

api_addr     = "https://vault-1.example.local:8200"
cluster_addr = "https://10.0.0.11:8201"

# Consul example (when using an existing stable Consul)
# Configure ACL/TLS/snapshot settings appropriately on the Consul side
# Match address (e.g., via local agent) to your operational standards
#
# storage "consul" {
#   address = "127.0.0.1:8500"
#   path    = "vault/"
#   scheme  = "https"
#   token   = "<consul-acl-token>"
# }

Ops Exam Prep Points and Real-World Pitfalls

Ops design and operations questions tend to ask how you concretized the tradeoff between consistency and availability. Prepare to explain consistently the rationale for storage selection, quorum design, backup and restore procedures, and failure behavior.

Both the exam and real-world practice check whether you understand not to repurpose dev backends or single-node configurations for production, and that write halts during network partitions are correct behavior.

Adopt a strongly consistent backend (Integrated Storage or Consul) in production
State explicitly: odd node count, majority quorum, failure domain distribution
Verify backups periodically using backend-native means (Raft snapshots / Consul snapshots)
file and inmem are not recommended for production; they do not satisfy HA
It is by design that the minority cannot write during a network partition (data protection)

Check with a Practice Question

Ops

問題 1

You are building a new 3-node Vault cluster in a single region. You want to minimize external dependencies and prioritize strong consistency, simple operations, and self-healing during failures (automatic rejoining while quorum is maintained). Which is the most appropriate storage choice and design?

Adopt Integrated Storage (Raft) with 3 nodes for an odd-count quorum. Configure stable disk/IOPS and Raft rejoin settings
Skip Consul and share a file backend on NFS, mounting the same path from all nodes
Use the dev-purpose inmem backend even in production and skip backups
Provide just one Consul node and connect a 3-node Vault, leaving quorum to the Vault side

正解: A

The requirements are reduced external dependencies, strong consistency, and simple operations. Integrated Storage (Raft) is the best fit. Design quorum with an odd node count, and configure stable storage IO and rejoin settings. NFS-shared file does not satisfy HA and is unsuitable; inmem is not recommended for production; a single Consul node does not meet availability requirements.

Frequently Asked Questions

Should I choose Integrated Storage or Consul?

For greenfield deployments where you want to minimize external dependencies and run Vault self-contained, Integrated Storage is the first choice. If you already have a stable Consul foundation that is the organizational standard, Consul is also appropriate. Both provide strong consistency and are production-ready.

How should I perform backups?

Integrated Storage uses the vault operator raft snapshot mechanism to take and verify snapshots. Consul uses its own snapshot feature. In both cases, build periodic execution and restore rehearsals into your operations, and verify your recovery time objective and data consistency.

Writes failed during a network partition. Is this a malfunction?

This is correct behavior that preserves strong consistency. The side that loses quorum (the minority) cannot become leader and rejects writes. It is important to design a topology that prioritizes quorum formation (odd node count, AZ distribution) and to define operational procedures for partition scenarios in advance.

Check what you learned with practice questions

Practice with certification-focused question sets

無料で問題を解いてみる

Author

NicheeLab Editorial Team

NicheeLab editorial team focused on data engineering and cloud certification learning. Content is structured around practical study needs and official exam domains.

Vault Storage Backends Overview: Operational Decisions Driven by Choice and Consistency