In production Vault deployments, reads typically dominate the workload. This is exactly where Enterprise's Performance Standby shines: standby nodes handle read APIs themselves, dramatically reducing the load on the active node.
This article walks through production-grade design points and the comparison, health-check, and forwarding gotchas that frequently appear on Ops-oriented certification exams.
In Vault HA, one node is active and the rest are standby. Standby nodes normally just forward requests to the active node, but in Enterprise the Performance Standby feature lets standby nodes serve most reads themselves.
Secret lookups and token reads/updates (updates that do not touch storage) get distributed, which prevents the active node from becoming a CPU/IO hot spot. Writes — secret creation/updates, mount enable/disable, policy changes, and so on — continue to be processed centrally on the active node.
Performance Standby serves eligible read endpoints on the local node. Ineligible requests, or those that cannot be completed locally for consistency reasons, are forwarded to the active node via request forwarding. With cluster_addr and TLS configured correctly, clients can simply target a single VIP without worrying about redirects.
Consistency is maintained by sharing the same storage and invalidating / synchronizing as needed. Depending on timing — for example a read immediately after a write — Vault may safely forward the request to the active node. This behavior prioritizes a consistent view of the data.
Performance Standby request flow (single cluster)
There are a lot of similar-sounding terms, so let's clarify how to choose from an Ops perspective. For read-scale inside a single cluster, use Performance Standby. To optimize cross-region latency and throughput across multiple clusters, use Performance Replication Secondary (a separate cluster). Standard standbys are basically just forwarders.
DR Replication exists for recovery purposes and is not intended for routine read distribution. The focus of this article is read-scale within a cluster (Performance Standby).
| Aspect | Standard Standby | Performance Standby (Ent) | Performance Replication Secondary (Ent) |
|---|---|---|---|
| Scope | Single cluster (standby) | Single cluster (standby + local reads) | Separate cluster (per region/site) |
| Read handling | Mostly forwarded to active | Eligible reads served by standby | Served on secondary (replicated data) |
| Write handling | Performed on active | Performed on active (forwarded) | Performed on primary (replicated) |
| Latency / bandwidth goal | Reduce redirects / forwards | Reduce active-node load and latency | Faster reads via geographic proximity |
| Primary use case | Small to medium HA | Single cluster with very read-heavy traffic | Multi-region read optimization |
The most important thing is to get the foundation for request forwarding right. Align cluster_addr and TLS on every node so that mutual reachability and certificate validation work between nodes. Without this, standbys cannot forward, and clients get dragged into redirects.
Operationally, the easiest design is a single VIP fronting all nodes (active + standby). The standard health check is /sys/health with standbyok=true and perfstandbyok=true, so both standby and performance-standby nodes are treated as healthy. Writes that land on a standby are forwarded internally, so client-side path splitting is not required.
Example: minimal config using Integrated Storage (excerpt)
# /etc/vault.d/server.hcl
api_addr = "https://vault.example.com:8200"
cluster_addr = "https://10.0.0.1:8201"
disable_mlock = true
storage "raft" {
path = "/opt/vault/data"
node_id = "vault-1"
}
listener "tcp" {
address = "0.0.0.0:8200"
tls_cert_file = "/etc/vault/tls/server.crt"
tls_key_file = "/etc/vault/tls/server.key"
}
# EnterpriseではPerformance Standbyは既定で有効
# 無効化したい場合のみ:
# disable_performance_standby = trueFor monitoring, track node role (active vs. performance standby) and the ratio of locally-served vs. forwarded requests. Capturing per-endpoint response-time distributions and forward ratios via telemetry (e.g. Prometheus) makes bottleneck analysis and effect measurement much easier.
Run load tests at realistic ratios such as 9 reads to 1 write, and measure latency / throughput through the actual LB-wired topology. For active-node failover tests, also measure the transient latency spike during warm-up (cache invalidation and re-convergence) after a standby is promoted.
Remember Performance Standby with a single phrase: 'read-scale within a single cluster.' Knowing where it sits relative to adjacent features and the basics of LB, health checks, and forwarding translates directly into exam points.
Ops
問題 1
In Vault Enterprise, read traffic has surged and the active node's CPU is maxed out. You want to raise read throughput with the smallest possible architectural change. Which option is most appropriate?
正解: A
Read-scale within a single cluster is the domain of Performance Standby. Distributing across all nodes via the LB and treating standbys as healthy lets eligible reads be served locally on each standby. DR Replication is for recovery, not read distribution. The other options either do not help throughput or just lock in the bottleneck.
Is Performance Standby available in Vault OSS?
No. Performance Standby is a Vault Enterprise feature. OSS still supports standby request forwarding, but standby nodes cannot serve reads locally.
Which storage backends support it?
HA-capable storage (Consul or Raft / Integrated Storage). In either case, correct cluster_addr settings and TLS reachability between nodes are critical.
What if I temporarily want everything handled by the active node?
As a stop-gap, enable disable_performance_standby in the server config or switch the LB to target only the active node. For steady-state operation, re-enable Performance Standby afterwards.
Practice with certification-focused question sets
無料で問題を解いてみるNicheeLab Editorial Team
NicheeLab editorial team focused on data engineering and cloud certification learning. Content is structured around practical study needs and official exam domains.
Vault Core Concepts: Sealed/Unsealed, Auth, Secrets (2026)
Vault fundamentals — sealed/unsealed state, auth methods, se...
Vault Operations Professional (VOP-003): Complete Guide (2026)
Pass the Vault Operations Professional exam — enterprise pat...
Vault Path-Based Routing: API URL Structure (2026)
How Vault's path-based routing works — mount points, sub-pat...
Vault Tokens: Auth Token Mechanics (2026)
Token fundamentals — service vs. batch tokens, accessor, ren...
Vault Token Types: Service, Batch, Periodic (2026)
Service vs. batch tokens compared — performance, ACL behavio...