Claude Code on Google Vertex AI

Learn about configuring Claude Code through Google Vertex AI, including setup, IAM configuration, and troubleshooting.

export const ContactSalesCard = ({surface}) => { const utm = content => utm_source=claude_code&utm_medium=docs&utm_content=${surface}_${content}; const iconArrowRight = (size = 13) => <svg width={size} height={size} viewBox="0 0 24 24" fill="none" stroke="currentColor" strokeWidth="2.5" strokeLinecap="round" strokeLinejoin="round" aria-hidden="true"> ; const STYLES = .cc-cs { --cs-slate: #141413; --cs-clay: #d97757; --cs-clay-deep: #c6613f; --cs-gray-000: #ffffff; --cs-gray-700: #3d3d3a; --cs-border-default: rgba(31, 30, 29, 0.15); font-family: inherit; } .dark .cc-cs { --cs-slate: #f0eee6; --cs-gray-000: #262624; --cs-gray-700: #bfbdb4; --cs-border-default: rgba(240, 238, 230, 0.14); } .cc-cs-card { display: flex; align-items: center; justify-content: space-between; gap: 16px; padding: 14px 16px; margin: 0; background: var(--cs-gray-000); border: 0.5px solid var(--cs-border-default); border-radius: 8px; flex-wrap: wrap; } .cc-cs-text { font-size: 13px; color: var(--cs-gray-700); line-height: 1.5; flex: 1; min-width: 240px; } .cc-cs-text strong { font-weight: 550; color: var(--cs-slate); } .cc-cs-actions { display: flex; align-items: center; gap: 8px; flex-shrink: 0; } .cc-cs-btn-clay { display: inline-flex; align-items: center; gap: 8px; background: var(--cs-clay-deep); color: #fff; border: none; border-radius: 8px; padding: 8px 14px; font-size: 13px; font-weight: 500; transition: background-color 0.15s; white-space: nowrap; } .cc-cs-btn-clay:hover { background: var(--cs-clay); } .cc-cs-btn-ghost { display: inline-flex; align-items: center; gap: 8px; background: transparent; color: var(--cs-gray-700); border: 0.5px solid var(--cs-border-default); border-radius: 8px; padding: 8px 14px; font-size: 13px; font-weight: 500; } .cc-cs-btn-ghost:hover { background: rgba(0, 0, 0, 0.04); } .dark .cc-cs-btn-ghost:hover { background: rgba(255, 255, 255, 0.04); } @media (max-width: 720px) { .cc-cs-actions { width: 100%; } }; return <div className="cc-cs not-prose"> <div className="cc-cs-card"> <div className="cc-cs-text"> Deploying Claude Code across your organization? Talk to sales about enterprise plans, SSO, and centralized billing.

Model type	Default value
Primary model	`claude-sonnet-4-5@20250929`
Small/fast model	`claude-haiku-4-5@20251001`

google-vertex-ai.md +7 −2

68 <a href={`https://claude.com/pricing?${utm('view_plans')}#plans-business`} className="cc-cs-btn-ghost">68 <a href={`https://claude.com/pricing?${utm('view_plans')}#plans-business`} className="cc-cs-btn-ghost">

69 View plans69 View plans

70 </a>70 </a>

~~71 <a href={`https://www.anthropic.com/contact-sales?${utm('contact_sales')}`} className="cc-cs-btn-clay">~~71 <a href={`https://claude.com/contact-sales?${utm('contact_sales')}`} className="cc-cs-btn-clay">

72 Contact sales {iconArrowRight()}72 Contact sales {iconArrowRight()}

73 </a>73 </a>

74 </div>74 </div>

283# Optional: Disable prompt caching if needed283# Optional: Disable prompt caching if needed

284export DISABLE_PROMPT_CACHING=1284export DISABLE_PROMPT_CACHING=1

285 285

286# Optional: Request 1-hour prompt cache TTL instead of the 5-minute default

287export ENABLE_PROMPT_CACHING_1H=1

288

286# When CLOUD_ML_REGION=global, override region for models that don't support global endpoints289# When CLOUD_ML_REGION=global, override region for models that don't support global endpoints

287export VERTEX_REGION_CLAUDE_HAIKU_4_5=us-east5290export VERTEX_REGION_CLAUDE_HAIKU_4_5=us-east5

288export VERTEX_REGION_CLAUDE_4_6_SONNET=europe-west1291export VERTEX_REGION_CLAUDE_4_6_SONNET=europe-west1

290 293

291Most model versions have a corresponding `VERTEX_REGION_CLAUDE_*` variable. See the [Environment variables reference](/en/env-vars) for the full list. Check [Vertex Model Garden](https://console.cloud.google.com/vertex-ai/model-garden) to determine which models support global endpoints versus regional only.294Most model versions have a corresponding `VERTEX_REGION_CLAUDE_*` variable. See the [Environment variables reference](/en/env-vars) for the full list. Check [Vertex Model Garden](https://console.cloud.google.com/vertex-ai/model-garden) to determine which models support global endpoints versus regional only.

292 295

293[Prompt caching](https://platform.claude.com/docs/en/build-with-claude/prompt-caching) is automatically supported when you specify the `cache_control` ephemeral flag. To disable it, set `DISABLE_PROMPT_CACHING=1`. For heightened rate limits, contact Google Cloud support. When using Vertex AI, the `/login` and `/logout` commands are disabled since authentication is handled through Google Cloud credentials.296[Prompt caching](https://platform.claude.com/docs/en/build-with-claude/prompt-caching) is enabled automatically. To disable it, set `DISABLE_PROMPT_CACHING=1`. To request a 1-hour cache TTL instead of the 5-minute default, set `ENABLE_PROMPT_CACHING_1H=1`; cache writes with a 1-hour TTL are billed at a higher rate. For heightened rate limits, contact Google Cloud support. When using Vertex AI, the `/login` and `/logout` commands are disabled since authentication is handled through Google Cloud credentials.

297

298[MCP tool search](/en/mcp#scale-with-mcp-tool-search) is disabled by default on Vertex AI because the endpoint does not accept the required beta header. All MCP tool definitions load upfront instead. To opt in, set `ENABLE_TOOL_SEARCH=true`.

294 299

295### 5. Pin model versions300### 5. Pin model versions

296 301

google-vertex-ai.md 2026-04-23 18:19 UTC to 2026-04-24 18:11 UTC

Claude Code on Google Vertex AI

Prerequisites

Enable Claude models in your GCP project

Start Claude Code and choose Vertex AI

Follow the wizard prompts

Region configuration

Set up manually

1. Enable Vertex AI API

2. Request model access

3. Configure GCP credentials

4. Configure Claude Code

5. Pin model versions

Startup model checks

IAM configuration

1M token context window

Troubleshooting

Additional resources

google-vertex-ai.md +7 −2

google-vertex-ai.md 2026-04-23 18:19 UTC to 2026-04-24 18:11 UTC

Claude Code on Google Vertex AI

Prerequisites

Sign in with Vertex AI

Enable Claude models in your GCP project

Start Claude Code and choose Vertex AI

Follow the wizard prompts

Region configuration

Set up manually

1. Enable Vertex AI API

2. Request model access

3. Configure GCP credentials

4. Configure Claude Code

5. Pin model versions

Startup model checks

IAM configuration

1M token context window

Troubleshooting

Additional resources