Part 1: Stop Shipping MCP Servers Naked, Why OAuth 2.1 Is Non-Negotiable

Industry Insights

Part 1: Stop Shipping MCP Servers Naked, Why OAuth 2.1 Is Non-Negotiable

February 26, 2026

·

6-Minute Read

Sagi Haas

Sagi Haas

Co-founder & CTO

Table of contents

The Problem with Local MCP and Long-Lived Keys Remote MCP Servers with OAuth 2.1: The Production Architecture How It Works Dynamic Client Registration (DCR): No More Hardcoded Credentials PKCE: Mandatory, Not Optional Short-Lived, Scoped Tokens: The End of All-or-Nothing Access What's Next

Share Article

The Model Context Protocol (MCP) has gone from a niche developer experiment to the backbone of enterprise AI agent workflows in record time. MCP servers now broker access to CRMs, databases, cloud infrastructure, CI/CD pipelines, and internal tools, acting as the universal translation layer between LLMs and the systems that run your business.

Unfortunately, the vast majority of MCP servers deployed today are running completely unauthenticated. No identity layer, no authorization boundary, and no audit trail. Many rely on local STDIO transports with long-lived API keys stuffed into .env files or passed as command-line arguments - a pattern that was barely acceptable for prototyping, and is categorically unacceptable for production.

If your MCP server is callable from an AI agent or a remote workflow, and there's no authorization layer in front of it, that's not an oversight. That's an open door.

The Problem with Local MCP and Long-Lived Keys

Let's be specific about what we're moving away from, and why.

The traditional local MCP setup looks something like this: a server running via STDIO on the same machine as the client, configured with environment variables holding API keys, service account tokens, or database credentials. The configuration file ships with secrets baked in. There's no token expiry, no scope boundaries, no consent flow, and no way to know who or what is actually calling the server.

This model has several fundamental problems:

Secrets sprawl and lateral movement. Long-lived API keys stored in config files, environment variables, or .json configs are trivially extractable. Once compromised, if through a leaked repo, a compromised developer machine, or a supply chain attack, they provide persistent, unrestricted access. There's no rotation, no revocation without redeployment, and no blast radius containment.
Zero identity attribution. When every request uses the same static credential, you can't distinguish between a legitimate agent workflow and a compromised tool exfiltrating data. Your logs - if they exist at all - show one identity making every call. Incident investigation becomes guesswork.
No consent, no scopes, no boundaries. Static keys are all-or-nothing. There's no mechanism for users to review what permissions an agent is requesting, no way to grant read access to one resource without granting write access to everything, and no principle of least privilege enforced at the protocol level.
No visibility. STDIO-based local servers don't natively produce the kind of structured, correlated audit logs that security teams need. Events aren't linked across sessions. There's no centralized view of what tools were invoked, by whom, with what parameters, and what data was returned.

This is the model we need to leave behind.

Remote MCP Servers with OAuth 2.1: The Production Architecture

The March 2025 MCP specification revision formally mandated OAuth 2.1 as the authorization mechanism for remote MCP servers. The June 2025 revision went further, establishing a clear architectural separation: MCP servers act as OAuth 2.1 Resource Servers only, validating tokens issued by external, dedicated Authorization Servers.

This isn't just a protocol upgrade, but a fundamental shift in how trust is established in AI agent workflows.

How It Works

The architecture separates concerns cleanly:

The Authorization Server is your identity gatekeeper. It authenticates users, issues short-lived access tokens with specific scopes, and manages consent flows. Think Auth0, Keycloak, Scalekit, or your enterprise IdP.
The MCP Server is the resource server. It validates incoming tokens, enforces scope-based access control, and serves tool responses. It never manages user logins or issues tokens itself.
The MCP Client (your AI agent, IDE, or orchestration layer) initiates the OAuth flow, obtains tokens, and presents them with each request.

Model Context Protocol: Authorization Server Discovery

Source: Sequence Diagram

This maps directly to how the rest of the industry secures APIs, and for good reason. It leverages decades of battle-tested OAuth infrastructure rather than inventing new, untested patterns.

Dynamic Client Registration (DCR): No More Hardcoded Credentials

One of the most impactful features of the MCP authorization spec is its support for Dynamic Client Registration (RFC 7591). In an agentic world, clients can't know all possible MCP servers in advance, and manual registration creates friction that blocks adoption.

DCR allows AI agents and tools to securely and automatically register themselves as OAuth clients at runtime. No hardcoded client IDs. No shared secrets passed through Slack channels. No manual registration forms.

This is critical for enterprise environments where dozens of agents may need to interact with dozens of MCP servers, each managed by different teams. DCR makes that possible without creating a credential management nightmare.

PKCE: Mandatory, Not Optional

OAuth 2.1 mandates Proof Key for Code Exchange (PKCE) for all clients, both public and confidential. This prevents authorization code interception attacks, a class of vulnerability that has plagued OAuth deployments for years. In the MCP context, where clients may be running in diverse and potentially hostile environments, PKCE isn't a nice-to-have; it's the baseline.

Short-Lived, Scoped Tokens: The End of All-or-Nothing Access

With OAuth 2.1, tokens are short-lived and scoped. An agent requesting access to a deployment tool gets a token with deploy:read, not admin:*. When that token expires in minutes (not never), the blast radius of a compromised credential shrinks dramatically.

The MCP security best practices are explicit on this: avoid wildcard scopes (*, all, full-access). Start with a minimal initial scope set. Use incremental elevation via targeted WWW-Authenticate challenges when privileged operations are first attempted. Log every elevation event with correlation IDs.

What's Next

OAuth 2.1 with DCR, PKCE, and scoped tokens gives you identity, consent, least privilege, and revocability — the foundational properties that every production API has required for years. Combined with rigorous server reputation management, this is the minimum viable security posture for MCP in production.

But it's not the finish line.

In Part 2, we'll go deeper: how OIDC federation and workload identity can eliminate secrets from the equation entirely, how recent advances in claims-based access control (like AWS STS provider-specific claim validation) enable truly fine-grained least privilege on downstream resources, and how to build monitoring that leverages the rich identity context these protocols provide.

Secure Non-Human Identities. Everywhere.

Sagi Haas

Sagi Haas

Co-founder & CTO

Sagi is the Co-Founder and CTO of Clutch Security. An alumnus of Israel's elite cyber Unit 8200 and former engineering leader at Axonius, he has a deep background in building scalable security platforms. At Clutch, Sagi leads the technical vision to bring visibility and control to the growing sprawl of Non-Human Identities that enterprises struggle to manage.