Advanced MCP: Enterprise-Grade AI Integration with Rust

The Problem We're Solving

Every day, knowledge workers in large organizations face the same frustrating pattern:

Ask ChatGPT or Copilot a question about their business data
Realize the AI doesn't have access to their systems
Open their database, CRM, or internal tools
Copy data, paste it into the AI conversation
Hope nothing sensitive gets leaked
Repeat dozens of times per day

This copy-paste workflow is:

Inefficient: Hours lost to context switching
Inconsistent: Different employees get different results
Insecure: Sensitive data ends up in AI training sets
Error-prone: Manual data transfer introduces mistakes

The Solution: Model Context Protocol

The Model Context Protocol (MCP) is an open standard that allows AI assistants to securely connect to your enterprise systems. Instead of copy-paste, your AI can:

Query your databases directly (with proper authorization)
Access your internal APIs and services
Read documentation and knowledge bases
Execute approved business workflows

All while maintaining enterprise security standards.

Why This Course?

There are plenty of tutorials showing how to build a "hello world" MCP server. This course is different.

We focus on enterprise requirements:

Hobbyist Tutorial	This Course
Works on localhost	Deploys to cloud
No authentication	OAuth with enterprise IdPs
No testing	Automated test suites
No monitoring	Full observability
Single developer	Team development
Proof of concept	Production-ready

Why Rust?

When your MCP server handles sensitive enterprise data, you need:

Memory safety: No buffer overflows or use-after-free bugs
Performance: Microsecond response times, minimal cloud costs
Reliability: If it compiles, it probably works correctly
Type safety: Catch errors at compile time, not in production

Rust provides all of this, and the PMCP SDK makes it accessible even to developers new to Rust.

What You'll Build

By the end of this course, you'll have built:

A database MCP server that safely exposes SQL queries to AI
Deployed to three cloud platforms with full CI/CD
OAuth-protected endpoints integrated with your identity provider
Comprehensive test suites that run locally and in production
Observable infrastructure with logging, metrics, and alerting

More importantly, you'll understand the design principles that separate enterprise-grade MCP servers from toy examples.

Course Structure

Part I: Foundations

Start with the basics, but production-ready from day one. Build your first MCP server and understand the architecture.

Part II: Thoughtful Design

Learn why most MCP servers fail: too many confusing tools. Master the art of cohesive API design.

Part III: Cloud Deployment

Deploy to AWS Lambda, Cloudflare Workers, and Google Cloud Run. Connect real MCP clients.

Part IV: Testing

Generate tests from schemas, run them locally, then against production. Integrate with CI/CD.

Part V: Enterprise Security

Add OAuth authentication with Cognito, Auth0, and Entra ID. Implement proper token validation.

Part VI: AI-Assisted Development

Use Claude Code and other AI assistants to accelerate development of business logic.

Part VII: Observability

Add middleware for logging and metrics. Use pmcp.run for simplified monitoring.

Part VIII: Advanced Patterns

Compose multiple servers, build UIs, and architect for high availability.

Prerequisites

Before starting this course, you should have:

Basic Rust knowledge (or willingness to learn)
Access to a cloud account (AWS, GCP, or Cloudflare)
An MCP client (Claude Desktop, VS Code, or similar)
Familiarity with REST APIs and JSON

See the Prerequisites chapter for detailed setup instructions.

Let's Begin

Enterprise AI integration is no longer optional. Your competitors are already connecting their AI assistants to their data.

The question isn't whether to build MCP servers—it's whether to build them right.

Let's build them right.

Continue to Prerequisites →

Prerequisites

Welcome! This course is designed to be accessible to enterprise developers coming from any background. Whether you're a Java architect, C# backend developer, or Python data engineer, you'll find familiar concepts here—just expressed in Rust's syntax.

Our Learning Philosophy: Read, Don't Write

You need to know how to read Rust code, not how to write it.

This course provides extensive code examples that you'll read to understand concepts. When it comes to writing code, you'll use AI coding assistants (Claude Code, Cursor, Copilot) to do the heavy lifting. Your job is to:

Understand what the code is doing
Instruct the AI what you want to build
Review the generated code
Run the compiler to catch any issues

The Rust compiler becomes your safety net—if it compiles, it almost certainly works correctly. This is why Rust is uniquely suited for AI-assisted development.

Why This Approach Works

Rust has an exceptional compiler that provides clear, actionable error messages. Combined with AI assistants that can read and fix these errors, you get a powerful feedback loop:

You describe what you want
    ↓
AI generates Rust code
    ↓
Compiler catches issues (if any)
    ↓
AI fixes issues automatically
    ↓
Working, production-ready code

We cover this in depth in Part VI: AI-Assisted Development, where you'll learn how to effectively collaborate with AI assistants to build MCP servers.

Rust Concepts You'll Encounter

Don't worry if these aren't familiar yet—you'll learn them through the code examples.

Familiar Concepts (Coming from Java/C#)

Java/C#	Rust	Example
`class`	`struct`	`struct User { name: String }`
`interface`	`trait`	`trait Tool { fn call(&self); }`
`try/catch`	`Result<T, E>`	`Ok(value)` or `Err(error)`
`nullable`	`Option<T>`	`Some(value)` or `None`
`async/await`	`async/await`	Same concept, same keywords!
Generics `<T>`	Generics `<T>`	Same syntax!

Rust-Specific Concepts

You'll see these in code examples. AI assistants handle them well:

Ownership & borrowing - Rust's way of managing memory without garbage collection. The compiler ensures you use references safely. You'll see & and &mut in function signatures.
The ? operator - A clean way to propagate errors. When you see result?, it means "return the error if there is one, otherwise continue."
Pattern matching - Like a powerful switch statement. You'll see match and if let used to handle Result and Option values.
Macros - Code that generates code. You'll see #[derive(...)] annotations that automatically implement common functionality.

What You Don't Need to Master

These advanced topics are handled by AI assistants and the PMCP SDK:

Lifetime annotations ('a, 'static)
Unsafe Rust
Advanced trait bounds
Macro writing
Memory layout optimization

Technical Prerequisites

Required Tools

# You'll set these up in Chapter 2
rust (latest stable)    # Programming language
cargo-pmcp              # MCP development toolkit

Helpful Background

HTTP and APIs (you probably already know this):

HTTP methods (GET, POST)
JSON format
REST API concepts

Command Line (basic comfort):

Running commands
Environment variables

Cloud Platforms (For Deployment Chapters)

Parts III-V cover deployment. Familiarity with one is helpful:

AWS - Lambda, API Gateway
Cloudflare - Workers
Google Cloud - Cloud Run

Don't worry if cloud is new—we guide you step by step.

Environment Setup

Chapter 2 includes an interactive setup exercise that guides you through:

Installing Rust
Installing cargo-pmcp
Configuring your MCP client (Claude Desktop, VS Code, etc.)

Go to Environment Setup Exercise →

A Note for Enterprise Developers

If you're coming from enterprise Java or C#, you'll find that:

Rust's type system is similar to what you know, with some additions for safety
The package manager (Cargo) is more ergonomic than Maven or NuGet
Error handling uses explicit types instead of exceptions—cleaner once you're used to it
No null pointer exceptions ever—Rust simply doesn't have null

The strictness that might seem unusual at first is exactly what makes Rust reliable for enterprise systems. And with AI assistants handling the syntax, you can focus on the architecture and business logic you're already expert in.

Ready to Start?

You're ready if you can:

Read code and understand its intent
Describe what you want to build in plain English
Run commands in a terminal
Accept that AI will write most of your code

That's it. The compiler and AI handle the rest.

Continue to Part I: Foundations →

The Enterprise Case for MCP

"We're spending millions on AI tools, but our employees still copy-paste data between applications." — Every CIO, 2024-2025

The Disconnect

Large organizations have invested heavily in AI:

ChatGPT Enterprise licenses
GitHub Copilot for developers
Microsoft Copilot for Office
Custom AI assistants and chatbots

Yet the productivity gains remain elusive. Why?

The AI can't access your data.

Your enterprise knowledge lives in:

SQL databases and data warehouses
CRM systems (Salesforce, HubSpot)
Internal wikis and documentation
Custom APIs and microservices
File shares and document stores

None of these are directly accessible to your AI tools.

The Copy-Paste Tax

Watch any knowledge worker use ChatGPT for work:

1. Open ChatGPT
2. Ask about Q3 sales figures
3. ChatGPT says "I don't have access to your data"
4. Open Salesforce
5. Run a report
6. Copy the data
7. Paste into ChatGPT
8. Ask follow-up question
9. Realize you need more context
10. Open database tool
11. Run SQL query
12. Copy results
13. Paste into ChatGPT
14. Repeat 20 times per day

This pattern costs enterprises:

Hidden Cost	Impact
Time	30-60 minutes per employee per day
Consistency	Different employees get different results
Security	Sensitive data pasted into AI systems
Accuracy	Manual copying introduces errors
Audit trail	No record of what data was shared

At a 10,000-person company, the copy-paste tax is millions of dollars per year.

The MCP Solution

The Model Context Protocol enables secure, direct connections between AI assistants and enterprise systems:

┌─────────────────┐     ┌─────────────────┐     ┌─────────────────┐
│  AI Assistant   │     │                 │     │                 │
│   (ChatGPT,     │────▶│   MCP Server    │────▶│  Enterprise     │
│    Claude,      │     │   (Your Code)   │     │  Systems        │
│   Copilot)      │◀────│                 │◀────│  (DB, API, etc) │
│                 │     │                 │     │                 │
└─────────────────┘     └─────────────────┘     └─────────────────┘

Instead of copy-paste:

1. Open ChatGPT with MCP connections
2. Ask "What were our Q3 sales figures by region?"
3. ChatGPT calls your MCP server
4. MCP server queries Salesforce (with your permissions)
5. Returns structured data
6. ChatGPT analyzes and responds
7. Ask follow-up—MCP handles it automatically

What MCP Provides

We will dive deeper into the design of MCP server is lesson 4. Here is a quick overview:

Tools

Functions the AI can call:

query_sales(region, quarter)
create_ticket(customer, issue)
generate_report(type, date_range)

Resources

Documentation the AI can read:

salesforce://accounts/tiers
jira://issues/severity-and-escalation
s3://reports/quarterly/{year}

Prompts

Workflow templates for common tasks:

"Customer health check" (combines multiple data sources)
"Weekly standup summary" (aggregates JIRA, Git, Slack)
"Compliance audit prep" (gathers required documentation)

Enterprise Requirements

Building a "hello world" MCP server is easy. Building one for enterprise is not.

Enterprise MCP servers must be:

Business Focused

Easy to connect by non technical people (no local installation)
Connected to the organization data fabric
Domain specific (different per department)

Secure

OAuth 2.0 authentication (no API keys)
Integration with enterprise identity providers (Cognito, Okta, Entra)
Audit logging for compliance
Input validation to prevent injection

Reliable

99.9%+ uptime
Graceful degradation
Retry logic and circuit breakers
Proper error handling

Observable

Structured logging
Metrics and dashboards
Alerting on failures
Performance tracking

Maintainable

Type-safe implementation
Comprehensive tests
CI/CD pipelines
Documentation

Scalable

Handle concurrent users
Cost-effective at scale and can scale to zero
Global availability options

Why Most Tutorials Fail

Search for "MCP tutorial" and you'll find:

# A typical tutorial example
from mcp import Server

server = Server()

@server.tool()
def hello(name: str) -> str:
    return f"Hello, {name}!"

server.run()

This runs on localhost. It has no authentication. No error handling. No tests. No deployment story.

Try deploying this to production for 10,000 employees.

You'll quickly discover:

How do users authenticate?
How can it connect securely to data systems
Where does this run?
How do we update it?
What happens when it fails?
How do we know it's working?
Who's responsible for it?

This course answers all these questions.

The PMCP Approach

The PMCP SDK and cargo-pmcp toolkit provide:

Challenge	PMCP Solution
Authentication	Built-in OAuth with identity providers
Deployment	One-command deploy to Lambda, Workers, Cloud Run
Testing	Schema-driven test generation
Observability	Middleware for logging and metrics
Type Safety	Rust's compile-time guarantees
Validation	Automatic input/output schema validation

You focus on business logic. PMCP handles the infrastructure.

What You'll Learn

By the end of this section, you'll understand:

Why do we need MCP in the age of LLMs (statistic models vs. symbolic computation)
Why MCP over alternatives (custom integrations, RAG, etc.)
Why Rust for enterprise (safety, performance, reliability)
How to build production-ready servers from day one

Let's start with why MCP beats the alternatives.

Continue to The AI Integration Problem →

The AI Integration Problem

The Fundamental Disconnect

Large Language Models are remarkable at reasoning, summarizing, and generating content. But they have a critical limitation: they can only work with what's in their context window.

Your enterprise data lives in:

Relational databases (PostgreSQL, MySQL, SQL Server)
Data warehouses (Snowflake, BigQuery, Redshift)
SaaS platforms (Salesforce, HubSpot, Workday)
Internal APIs and microservices
Document stores and file systems
Real-time event streams

None of this is visible to an LLM by default.

Statistical Models vs. Symbolic Computation

To understand why this matters, we need to distinguish between what LLMs do well and what they don't.

What LLMs Excel At

LLMs are statistical models trained on vast amounts of text. They excel at:

Pattern recognition: Understanding intent from natural language
Synthesis: Combining information into coherent narratives
Translation: Converting between formats, languages, and styles
Reasoning: Following logical chains (with limitations)

When you ask "What were our Q3 sales?", the LLM perfectly understands your intent.

What LLMs Cannot Do

LLMs cannot perform symbolic computation—precise operations on structured data:

Query a database
Call an API with exact parameters
Perform arithmetic on large numbers
Access real-time information
Execute business logic

When the LLM understands you want Q3 sales, it has no way to fetch that data.

The AI Capability Spectrum

The diagram below illustrates the full spectrum of AI tasks, from probabilistic pattern recognition (where LLMs excel natively) to deterministic symbolic computation (where external tools are essential).

The MCP Spectrum - Extending LLM Intelligence with External Tools & Data

On the left side, tasks like creative writing, sentiment analysis, and language translation are native LLM strengths—probabilistic pattern matching on training data. Moving toward the center, tasks like code generation and data analysis benefit from MCP augmentation but can partially work with LLM reasoning alone.

On the right side, tasks become impossible without external tools: database queries require actual database connections, real-time data needs live APIs, and exact math demands calculators. These deterministic tasks are where MCP servers become essential.

The key insight: Enterprise value increasingly lives on the right side of this spectrum. While LLMs excel at creative and probabilistic tasks, business operations require precision, real-time data, and system integration—exactly what MCP provides.

The Integration Gap

This creates a fundamental gap:

┌─────────────────────────────────────────────────────────────────┐
│                                                                 │
│   Human Intent          LLM Understanding         Actual Data   │
│   ─────────────         ─────────────────         ───────────   │
│                                                                 │
│   "What were our   ───▶  Understands the    ───▶  ??? No way    │
│    Q3 sales by           question perfectly       to access     │
│    region?"                                       Salesforce    │
│                                                                 │
└─────────────────────────────────────────────────────────────────┘

The human has to bridge this gap manually—the copy-paste tax we discussed.

Why This Problem Is Getting Worse

Data Volume Is Exploding

Enterprise data doubles every 2-3 years. The gap between "what AI could analyze" and "what AI can access" widens continuously.

AI Expectations Are Rising

After seeing demos of AI assistants that seem capable of anything, users expect the same from enterprise tools. The reality disappoints.

Security Requirements Are Tightening

Simply pasting data into AI tools violates:

Data residency requirements (GDPR, CCPA)
Industry regulations (HIPAA, SOC2, PCI-DSS)
Internal security policies
Audit and compliance requirements

The manual workaround isn't just inefficient—it's increasingly illegal.

Multi-System Workflows Are Common

Real business questions rarely involve a single system:

"Which customers with open support tickets have contracts expiring this quarter?"

This requires:

Query the ticketing system (Zendesk/Jira)
Query the CRM (Salesforce)
Query the contract database
Join and analyze the results

No amount of copy-paste makes this efficient.

The Cost of Manual Integration

Let's quantify the problem for a typical enterprise:

Direct Costs

Activity	Time per Instance	Frequency	Annual Cost (at $75/hr)
Copy-paste data into AI	5 minutes	10x/day/employee	$15,625/employee
Re-run queries for context	10 minutes	5x/day/employee	$15,625/employee
Fix errors from manual transfer	15 minutes	2x/day/employee	$9,375/employee
Total per employee			$40,625/year

For a 1,000-person knowledge workforce: $40 million annually.

Indirect Costs

Inconsistent answers: Different employees get different results for the same question
Stale data: By the time it's pasted, it may be outdated
Security incidents: Sensitive data exposed through AI chat logs
Compliance violations: Audit failures, potential fines
Missed opportunities: Questions not asked because the process is too painful

What's Needed: A Bridge

The solution requires a programmatic bridge between:

Natural language understanding (what the LLM does)
Precise data operations (what enterprise systems do)

This bridge must be:

Requirement	Why
Secure	Enterprise data requires authentication, authorization, audit
Structured	AI needs to know what operations are available and how to call them
Reliable	Business processes can't depend on flaky integrations
Discoverable	AI should find relevant tools without human guidance
Composable	Complex workflows require multiple operations

This is exactly what the Model Context Protocol provides.

Preview: How MCP Solves This

MCP creates a standard interface between AI assistants and external systems:

┌─────────────┐     ┌─────────────┐     ┌─────────────┐
│             │     │             │     │             │
│     AI      │────▶│    MCP      │────▶│  Enterprise │
│  Assistant  │     │   Server    │     │   System    │
│             │◀────│             │◀────│             │
│  (Claude,   │     │  (Your      │     │  (Database, │
│   Copilot)  │     │   Code)     │     │   API, etc) │
│             │     │             │     │             │
└─────────────┘     └─────────────┘     └─────────────┘
     │                    │                    │
     │   "Get Q3 sales"   │                    │
     │───────────────────▶│                    │
     │                    │  SELECT sum(...)   │
     │                    │───────────────────▶│
     │                    │                    │
     │                    │◀───────────────────│
     │   Structured data  │   Query results    │
     │◀───────────────────│                    │

The AI assistant:

Discovers available tools from the MCP server
Decides which tool to call based on the user's question
Calls the tool with appropriate parameters
Receives structured results
Synthesizes a response for the user

The human never touches raw data. The AI never accesses systems directly. The MCP server mediates every interaction with full security and audit capability.

Enterprise Authentication Flow

In enterprise deployments, security is paramount. MCP supports OAuth 2.0 authentication, enabling the AI assistant to act on behalf of the authenticated user:

┌─────────────┐     ┌─────────────┐     ┌─────────────┐     ┌─────────────┐
│             │     │             │     │             │     │             │
│    User     │     │     AI      │     │    MCP      │     │  Enterprise │
│             │     │  Assistant  │     │   Server    │     │   System    │
│             │     │             │     │             │     │             │
└──────┬──────┘     └──────┬──────┘     └──────┬──────┘     └──────┬──────┘
       │                   │                   │                   │
       │  1. Authenticate  │                   │                   │
       │   (OAuth/SSO)     │                   │                   │
       │──────────────────▶│                   │                   │
       │                   │                   │                   │
       │  2. Access Token  │                   │                   │
       │◀──────────────────│                   │                   │
       │                   │                   │                   │
       │  3. "Get Q3 sales"│                   │                   │
       │──────────────────▶│                   │                   │
       │                   │                   │                   │
       │                   │  4. Tool call +   │                   │
       │                   │     Access Token  │                   │
       │                   │──────────────────▶│                   │
       │                   │                   │                   │
       │                   │                   │  5. Validate      │
       │                   │                   │     token &       │
       │                   │                   │     check perms   │
       │                   │                   │                   │
       │                   │                   │  6. Query with    │
       │                   │                   │     user context  │
       │                   │                   │──────────────────▶│
       │                   │                   │                   │
       │                   │                   │◀──────────────────│
       │                   │                   │  7. Results       │
       │                   │  8. Structured    │     (filtered by  │
       │                   │     response      │      user perms)  │
       │                   │◀──────────────────│                   │
       │                   │                   │                   │
       │  9. AI-generated  │                   │                   │
       │     answer        │                   │                   │
       │◀──────────────────│                   │                   │

This flow ensures:

Security Property	How It's Achieved
Identity verification	User authenticates via corporate IdP (Cognito, Okta, Entra ID)
Delegated access	AI acts with user's permissions, not elevated privileges
Data filtering	Enterprise system returns only data the user can see
Audit trail	Every request is logged with user identity and timestamp
Token expiration	Short-lived tokens limit exposure window
Scope limitation	Tokens specify exactly which operations are permitted

The user sees a seamless AI experience. Behind the scenes, every interaction is authenticated, authorized, and auditable—meeting the strictest enterprise compliance requirements.

But MCP isn't the only approach to AI integration. In the next section, we'll compare it to alternatives and explain why MCP is the right choice for enterprise.

Continue to Why MCP Over Alternatives →

Why MCP Over Alternatives

Before MCP, organizations tried several approaches to connect AI with enterprise data. Each has significant drawbacks that MCP addresses.

Alternative 1: Fine-Tuning LLMs

"What's the best way to personalize AI to understand my business data?" "Fine-tune the model on our data."

This was the conventional wisdom—and it's wrong for most use cases.

Why Fine-Tuning Made Sense (Historically)

Early LLMs performed poorly on domain-specific language. Terms have different meanings in different contexts:

Term	General Meaning	Domain-Specific Meaning
Consideration	Thoughtful attention	Something of value exchanged that makes a contract legally binding (Legal)
Discharge	To release or let go	Release of a patient from hospital care, or fluid emitted from the body (Medical)
Margin	The edge of something	Difference between cost and selling price, or collateral for trading (Financial)

Fine-tuning taught models this specialized vocabulary.

Why Fine-Tuning Is No Longer The Answer

1. Foundation models have caught up

Modern LLMs (GPT-5, Claude Sonnet/Opus 4.7, Gemini 3) are trained extensively on healthcare, financial, and legal domains. The vocabulary problem is largely solved.

2. Fine-tuning doesn't give access to your data

Even a fine-tuned model can't answer "What were our Q3 sales?" It learned patterns from training data—it didn't learn to query your Salesforce instance. Fine-tuning teaches language, not data access.

3. Models change faster than you can fine-tune

By the time you've fine-tuned GPT-4, GPT-5 is out. Your investment is frozen in an outdated base model. With MCP, you switch foundation models without changing your integration code.

4. Fine-tuning requires rare expertise

Fine-tuning requires experienced ML engineers and data scientists. MCP servers are standard software engineering—skills every organization already has.

5. Data leakage risks

Fine-tuning on sensitive data risks that data appearing in model outputs. A secret project name might suddenly surface in responses. MCP servers query data at runtime with proper access controls—nothing is baked into the model.

6. No audit trail

When a fine-tuned model produces an answer, you can't trace where it came from. MCP calls are fully logged: which tool, which parameters, which user, when.

The following diagram summarizes the fundamental architectural difference between the two approaches:

AI System Customization: MCP Servers vs Fine-Tuning

With MCP servers (left), the AI queries live data through tool calls, preserving security and traceability. With fine-tuning (right), data is baked into the model during training—immediately becoming stale and impossible to trace.

The Verdict on Fine-Tuning

Fine-tuning still has niche applications—specialized vocabulary in narrow domains where foundation models underperform. But for connecting AI to enterprise data? It's the wrong tool entirely.

Alternative 2: Retrieval-Augmented Generation (RAG)

RAG improves on fine-tuning by retrieving relevant documents at query time rather than baking knowledge into the model.

How RAG Works

┌──────────────┐     ┌──────────────┐     ┌──────────────┐
│              │     │              │     │              │
│  User Query  │────▶│   Vector     │────▶│  Retrieve    │
│              │     │   Search     │     │  Documents   │
│              │     │              │     │              │
└──────────────┘     └──────────────┘     └──────────────┘
                                                 │
                                                 ▼
┌──────────────┐     ┌──────────────┐     ┌──────────────┐
│              │     │              │     │              │
│   Response   │◀────│     LLM      │◀────│  Augmented   │
│              │     │              │     │   Prompt     │
│              │     │              │     │              │
└──────────────┘     └──────────────┘     └──────────────┘

Where RAG Falls Short

1. Documents aren't data

RAG retrieves text chunks. It can't execute SELECT SUM(revenue) FROM sales WHERE quarter='Q3'. Enterprise questions often require computation, not document retrieval.

2. Semantic search isn't always the right retrieval

"What were our Q3 sales by region?" doesn't need semantically similar documents. It needs a specific database query. RAG retrieves based on meaning; business queries often need exact matches.

3. No actions, only reading

RAG can read documents. It can't create a ticket, send an email, or update a record. MCP supports both read operations (Resources) and write operations (Tools).

4. Context window limits

RAG stuffs retrieved documents into the prompt. With limited context windows, you can only include so much. MCP returns structured data—compact and precise.

5. Stale embeddings

Vector databases need re-indexing when source documents change. MCP queries live data every time.

When RAG Makes Sense

RAG excels for knowledge bases, documentation search, and Q&A over static document collections. It complements MCP—use RAG for unstructured knowledge, MCP for structured data and actions.

Alternative 3: Hand-Written Agent Code

Many teams build custom agents with API calls embedded directly in agent code:

# The "hand-written agent" anti-pattern
class SalesAgent:
    def __init__(self):
        self.salesforce_client = SalesforceAPI(...)
        self.jira_client = JiraAPI(...)
        self.slack_client = SlackAPI(...)
    
    def handle_query(self, user_query: str):
        # LLM decides what to do
        intent = self.llm.classify(user_query)
        
        if intent == "sales_query":
            # Hard-coded API integration
            data = self.salesforce_client.query(...)
            return self.llm.summarize(data)
        
        elif intent == "create_ticket":
            # Another hard-coded integration
            self.jira_client.create_issue(...)
        
        # ... dozens more elif branches

This approach seems pragmatic but creates significant problems at scale.

Problems with Hand-Written Agents

1. Tight coupling

The agent code is tightly bound to specific APIs. Changing from Salesforce to HubSpot requires rewriting the agent, not just swapping a connector.

2. No discoverability

The LLM can only use tools the developer anticipated. MCP servers advertise their capabilities—the LLM discovers available tools dynamically.

3. No reusability

Every team builds their own Salesforce integration. With MCP, one server serves all AI applications in the organization.

4. Authentication nightmare

Each integration handles auth differently. OAuth flows, API keys, and token refresh logic scattered throughout agent code. MCP centralizes authentication at the server level.

5. No standard testing

How do you test that the agent correctly calls the Jira API? With MCP, standard tools (MCP Inspector, mcp-tester) validate any server.

6. Vendor lock-in

An agent built for ChatGPT's function calling won't work with Claude. MCP is an open standard—build once, connect to any compliant client.

7. Scaling challenges

Hand-written agents run in a single process. MCP servers can be deployed independently—scale the Salesforce server without touching the Jira server.

The Maintenance Burden

Consider maintaining 20 API integrations across 5 different AI applications:

Approach	Integration Points	Maintenance Burden
Hand-written agents	20 × 5 = 100	Every app maintains every integration
MCP servers	20 + 5 = 25	Each server maintained once, shared by all apps

As integrations and applications grow, MCP's advantage compounds.

MCP: The Right Abstraction

MCP succeeds because it provides the right level of abstraction:

Challenge	Fine-Tuning	RAG	Hand-Written	MCP
Access live data	No	Partial	Yes	Yes
Perform actions	No	No	Yes	Yes
Audit trail	No	Partial	Manual	Built-in
Model flexibility	No	Yes	No	Yes
Reusable across apps	No	Partial	No	Yes
Standard protocol	No	No	No	Yes
Enterprise auth	N/A	Custom	Custom	OAuth 2.0
Engineering skills	ML/Data Science	ML/Engineering	Engineering	Engineering

MCP Complements, Not Replaces

MCP doesn't eliminate other approaches—it provides the integration layer:

Fine-tuned models can be MCP clients, calling MCP servers for data
RAG systems can be exposed as MCP Resources for document retrieval
Existing APIs can be wrapped in MCP servers for standardized access

MCP is the universal adapter that connects AI to everything else.

The MCP Ecosystem

The Model Context Protocol, is an open protocol, published by Anthropic in late 2024, has been adopted across the industry:

Anthropic: Claude Desktop, Claude Code, Claude mobile apps
OpenAI: ChatGPT desktop applications
Google: Gemini integrations
Microsoft: GitHub Copilot, VS Code extensions
Cursor, Windsurf, Zed: IDE integrations

Building an MCP server means building once for all these platforms.

Who Builds MCP Servers?

Platform vendors build servers for their products (Google Workspace, GitHub, Slack)
Enterprises build servers for internal systems (custom databases, proprietary APIs)
You will build servers that connect AI to your organization's unique data

Knowledge Check

Test your understanding of AI integration approaches and why MCP is the right choice for enterprise:

MCP is the right protocol. But why implement it in Rust? In the next section, we explore why Rust is the ideal language for enterprise MCP servers.

Continue to Why Rust for Enterprise →

Why Rust for Enterprise MCP Servers

As enterprises begin building internal MCP servers, the choice of programming language becomes strategic. The default instinct is often to use whatever language the team already knows—Java, C#, Python, or TypeScript. However, for systems that expose sensitive business capabilities to AI agents, language choice has direct implications for security, performance, maintainability, and long-term cost.

The Language Decision Matrix

When evaluating languages for enterprise MCP servers, consider these dimensions:

Security & Memory Safety: Protection against buffer overflows, use-after-free, data races
Performance & Efficiency: Latency, throughput, resource consumption
Deployment & Ops Simplicity: Binary size, startup time, dependency management
Maintainability & Long-Term Cost: Refactoring safety, code clarity over time
Ecosystem & Enterprise Readiness: Libraries, frameworks, corporate adoption
Concurrency Model: Handling parallel requests safely
Tooling & Dev Assistance: IDE support, AI coding assistance effectiveness
Reliability & Correctness: Compile-time guarantees, runtime predictability

The following radar chart compares Rust, Python, TypeScript, and Java/C# across these enterprise requirements:

Enterprise MCP Server Language Comparison

Rust dominates in security, performance, reliability, and deployment simplicity—the dimensions that matter most for infrastructure that bridges AI and enterprise systems.

1. Security by Construction

The majority of cybersecurity vulnerabilities in modern systems—buffer overflows, memory corruption, data races, use-after-free bugs—are prevented entirely by Rust's compiler and ownership model.

For CIOs and CISOs, this translates to concrete benefits:

Security Benefit	Business Impact
No buffer overflows	Eliminates entire vulnerability class
No data races	Safe concurrent access to shared state
No null pointer exceptions	Predictable behavior, fewer crashes
No use-after-free	Memory safety without garbage collection

When MCP servers act as the bridge between AI agents and internal systems, reducing risk is not optional. Rust enforces safety at compile time—before code ever runs inside your infrastructure.

The CVE Perspective

Microsoft and Google have independently reported that ~70% of their security vulnerabilities are memory safety issues. Rust eliminates this entire category by design.

#![allow(unused)]
fn main() {
// This won't compile - Rust prevents data races at compile time
fn dangerous_concurrent_access() {
    let mut data = vec![1, 2, 3];
    
    std::thread::spawn(|| {
        data.push(4);  // Error: cannot borrow `data` as mutable
    });
    
    println!("{:?}", data);
}
}

2. Performance That Impacts Business Value

MCP servers often sit on critical paths:

Answering low-latency requests from LLMs
Serving real-time enterprise data
Running high-volume automation workflows

Rust's performance matches C/C++ but with far stronger safety guarantees:

Metric	Rust	Python	TypeScript	Java
Cold start (Lambda)	~10ms	~300ms	~150ms	~500ms
Memory footprint	10-20MB	50-100MB	40-80MB	100-200MB
Requests/sec (typical)	50,000+	1,000-5,000	5,000-15,000	10,000-30,000

Approximate figures for typical MCP server workloads

Why Performance Matters for MCP

High performance enables:

Faster responses → Better user adoption, lower frustration
More responsive autonomous workflows → AI agents don't wait
Lower cloud spend → Fewer CPU cycles for the same work
Better scalability → Handle traffic spikes gracefully

In an AI-native enterprise, performance isn't a nice-to-have—it's a force multiplier.

Serverless Cost Implications

On AWS Lambda, you pay for GB-seconds. A Rust function that completes in 10ms costs 1/30th of a Python function that takes 300ms—for identical functionality.

At scale, this difference compounds:

Monthly Invocations	Python Cost	Rust Cost	Annual Savings
1 million	$50	$2	$576
100 million	$5,000	$167	$57,996
1 billion	$50,000	$1,667	$579,996

3. A Language Built for AI-Assisted Development

A surprising benefit of Rust in the age of LLMs: it works exceptionally well with AI coding assistants.

Why? Rust's compiler gives exact, helpful error messages and enforces correctness at the type system level. This allows AI tools like Claude, ChatGPT, and Copilot to:

Generate high-quality code with fewer logical errors
Fix mistakes rapidly using compiler feedback
Maintain consistent patterns across teams

The "Read, Don't Write" Paradigm

"You don't need to learn how to write Rust. You need to learn how to read Rust."

The AI writes the code. Developers validate it. The compiler catches mistakes before they reach production.

This dramatically increases productivity for teams adopting MCP—especially teams new to Rust:

#![allow(unused)]
fn main() {
// AI-generated MCP tool implementation
#[tool(
    name = "query_sales",
    description = "Query sales data by region and quarter"
)]
async fn query_sales(
    #[arg(description = "Sales region (NA, EMEA, APAC)")] 
    region: String,
    #[arg(description = "Quarter (Q1, Q2, Q3, Q4)")] 
    quarter: String,
) -> Result<SalesReport, ToolError> {
    // AI generates the implementation
    // Compiler ensures it's correct
    // Developer reviews and approves
}
}

The developer's job shifts from writing boilerplate to reviewing business logic.

4. Predictable, Maintainable, Long-Lived Services

Enterprise MCP servers will remain in production for years, serving mission-critical workflows. Rust provides long-term stability through:

No Garbage Collector

Rust has no GC, which means:

Predictable latency → No GC pauses during requests
Consistent performance → Same speed at 1 req/sec or 10,000 req/sec
Lower memory usage → No GC overhead

Strong, Opinionated Ecosystem

Tool	Purpose	Quality
Cargo	Build and dependency management	Best-in-class
rustfmt	Code formatting	Eliminates style debates
Clippy	Linting and suggestions	Catches subtle bugs
rust-analyzer	IDE support	Excellent completions and refactoring

Refactoring Safety

Rust's type system makes large refactors safe:

#![allow(unused)]
fn main() {
// Change a function signature
fn process_order(order: Order) -> Result<Receipt, OrderError>
// to
fn process_order(order: Order, user: &User) -> Result<Receipt, OrderError>

// The compiler identifies EVERY call site that needs updating
// Nothing slips through to production
}

In dynamic languages, this refactor could introduce silent bugs. In Rust, the compiler ensures completeness.

5. Deployment Simplicity

Rust compiles to a single static binary with no runtime dependencies:

# Build for production
cargo build --release

# Result: one file, ~5-15MB, ready to deploy
ls -la target/release/my-mcp-server
# -rwxr-xr-x 1 user user 8.2M my-mcp-server

Compare this to:

Python: Requires Python runtime, virtualenv, pip dependencies
TypeScript: Requires Node.js runtime, node_modules
Java: Requires JVM, possibly application server

Container Images

Language	Typical Image Size	Rust Equivalent
Python	400-800MB	20-50MB
Node.js	200-400MB	20-50MB
Java	300-600MB	20-50MB

Smaller images mean faster deployments, lower storage costs, and reduced attack surface.

The PMCP Advantage

The PMCP SDK builds on Rust's strengths to provide enterprise-ready MCP development:

Challenge	PMCP Solution
Learning curve	`cargo-pmcp` generates idiomatic code
Boilerplate	Derive macros handle JSON-RPC, schemas
Testing	Built-in test utilities and mocking
Deployment	One-command deploy to Lambda, Workers, Cloud Run
Observability	Middleware for logging, metrics, tracing

You get Rust's benefits without fighting the language.

When Rust Might Not Be Right

To be fair, Rust isn't always the best choice:

Rapid prototyping: Python/TypeScript iterate faster for throwaway code
Team expertise: If your team is deeply invested in another language
Existing infrastructure: If you have mature deployment pipelines for other languages
Simple, low-stakes servers: A weekend project doesn't need Rust's guarantees

However, for enterprise MCP servers—systems that will run for years, handle sensitive data, and bridge AI with critical infrastructure—Rust's upfront investment pays dividends.

Summary: Why Rust for MCP

Requirement	Why Rust Delivers
Security	Memory safety prevents 70% of vulnerability classes
Performance	C-level speed, 10-30x faster than Python
Reliability	No GC pauses, predictable latency
Maintainability	Compiler-enforced refactoring safety
Deployment	Single binary, tiny containers
AI-Assisted Dev	Compiler feedback enables AI coding
Long-term Cost	Lower cloud bills, fewer incidents, easier maintenance

Your internal MCP services become assets, not liabilities.

Now that you understand why MCP and why Rust, let's build your first production-ready MCP server.

Continue to Your First Production Server →

Your First Production Server

Prerequisites: Make sure you've completed the Development Environment Setup before continuing. You'll need Rust, cargo-pmcp, and Claude Code installed.

Let's build your first MCP server. We'll get it running and connected to Claude in under 5 minutes—then we'll explore how it works.

Quick Start: From Zero to Working Server

Step 1: Create the Workspace

cargo pmcp new my-mcp-servers
cd my-mcp-servers

This creates a workspace structure for building MCP servers.

Step 2: Add a Calculator Server

cargo pmcp add server calculator --template calculator

This generates a complete, working MCP server with example tools.

Step 3: Build and Run

cargo pmcp dev calculator

You should see:

INFO Starting MCP server "calculator" v1.0.0
INFO Listening on http://0.0.0.0:3000

Your server is running.

Step 4: Connect to Claude Code

In a new terminal, add the server to Claude Code:

claude mcp add calculator -t http http://0.0.0.0:3000

That's it—Claude Code now knows about your server.

Step 5: Try It!

Start Claude Code and ask:

"What is 1234 + 5678?"

Claude will call your add tool and respond with the result. You just built an MCP server!

Try a few more:

"Calculate 100 divided by 7"
"What's 15 times 23?"
"Divide 10 by 0" (watch the error handling)

What Just Happened?

In those 5 steps, you created a production-ready MCP server that:

Feature	What It Does
Type-safe inputs	Invalid inputs are rejected automatically
Structured outputs	Results include both values and descriptions
Error handling	Division by zero returns a proper error, not a crash
JSON Schema	Claude knows exactly what parameters each tool accepts
HTTP transport	Ready for cloud deployment

This isn't a toy example—it's the same foundation you'll use for enterprise servers.

Testing with MCP Inspector

Before connecting to Claude, you can test your server interactively using MCP Inspector:

npx @modelcontextprotocol/inspector http://localhost:3000/mcp

This opens a web UI where you can:

Browse available tools and their schemas
Call tools with test inputs
See the raw JSON-RPC messages

Try the divide tool with divisor: 0 to see how errors are handled.

Project Structure

Let's look at what cargo pmcp generated:

my-mcp-servers/
├── Cargo.toml              # Workspace manifest
├── pmcp.toml               # PMCP configuration
├── server-common/          # Shared HTTP bootstrap code
│   ├── Cargo.toml
│   └── src/lib.rs
└── servers/
    └── calculator/         # Your calculator server
        ├── Cargo.toml
        └── src/
            ├── main.rs     # Entry point
            └── tools/
                ├── mod.rs
                └── calculator.rs

Why a workspace? As you build more servers, they'll share the server-common code for HTTP handling, authentication, and other infrastructure. This keeps each server focused on business logic.

Your Turn: Build Your First Server

You've seen the calculator server in action. Now build your own MCP server from scratch.

Chapter 2 Exercises - Start with Exercise 1: Your First MCP Server

Next Steps

Now that you have a working server, the following sections will cover:

Building and Running - Understanding the workspace structure
The Calculator Server - Deep dive into the generated code
Understanding the Code - Rust patterns and PMCP conventions
Testing with MCP Inspector - Advanced debugging techniques

Continue to Building and Running →

Development Environment Setup

Before building your first MCP server, let's set up your development environment. You'll need three things:

Rust - The programming language
cargo-pmcp - The PMCP development toolkit
An MCP client - To test and use your servers

Installing Rust

If you don't have Rust installed, run:

curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh

Follow the prompts and select the default installation.

macOS users: You may need to install Xcode command line tools first: xcode-select --install

After installation, restart your terminal and verify:

rustc --version
# Should output: rustc 1.82.0 or later

cargo --version
# Should output: cargo 1.82.0 or later

Installing cargo-pmcp

Install the PMCP development toolkit:

cargo install cargo-pmcp

This provides several commands you'll use throughout this course:

Command	Purpose
`cargo pmcp new`	Create a new MCP workspace
`cargo pmcp add`	Add servers and tools to your workspace
`cargo pmcp dev`	Run a server in development mode
`cargo pmcp test`	Run MCP-specific tests
`cargo pmcp deploy`	Deploy to cloud platforms

Verify installation:

cargo pmcp --version

Choosing an MCP Client

MCP servers need a client to connect to. Several developer-friendly MCP clients are available:

Client	Best For	MCP Support
Claude Code	Terminal-based development, CLI workflows	Excellent
Cursor	AI-assisted coding in VS Code fork	Good
Gemini Code Assist	Google Cloud integrated development	Good
Cline	VS Code extension for AI coding	Good
Kiro	AWS-focused agentic IDE	Good
Codex CLI	OpenAI's terminal assistant	Basic

For this course, we recommend Claude Code. It has excellent MCP support, works entirely in the terminal, and makes it easy to add and manage MCP servers.

Installing Claude Code

macOS and Linux

curl -fsSL https://claude.ai/install.sh | bash

Windows

irm https://claude.ai/install.ps1 | iex

After installation, verify it works:

claude --version

First Run

The first time you run Claude Code, you'll need to authenticate:

claude

Follow the prompts to log in with your Anthropic account.

Adding MCP Servers to Claude Code

Once your MCP server is running, you can add it to Claude Code with a single command:

claude mcp add <server-name> -t http <server-url>

For example:

claude mcp add calculator -t http http://localhost:3000

You can list your configured servers:

claude mcp list

And remove servers you no longer need:

claude mcp remove calculator

MCP Inspector (Optional)

MCP Inspector is a debugging tool that lets you interact with MCP servers directly, without going through an AI client. It's useful for testing and troubleshooting.

No installation needed—just run with npx:

npx @modelcontextprotocol/inspector http://localhost:3000/mcp

This opens a web UI where you can browse tools, call them with test inputs, and see the raw JSON-RPC messages.

Configuring Your IDE

For writing Rust code, configure your preferred IDE:

VS Code

Install these extensions:

rust-analyzer - Rust language support (essential)
Even Better TOML - TOML syntax highlighting
CodeLLDB - Debugging support

Cursor

Cursor includes rust-analyzer support. Enable it in settings and you're ready to go.

RustRover

JetBrains RustRover works out of the box with Rust projects—no additional configuration needed.

Zed

Zed has built-in Rust support with excellent performance.

Enterprise Considerations

In enterprise environments, you may need to:

Configure cargo to use an internal registry or mirror
Set up proxy settings for cargo and rustup
Use a corporate certificate authority

Consult your IT department's Rust setup guide for organization-specific instructions.

Verify Your Setup

Let's confirm everything is working:

# Check Rust
rustc --version && cargo --version

# Check cargo-pmcp
cargo pmcp --version

# Check Claude Code
claude --version

If all three commands succeed, you're ready to build your first MCP server!

Knowledge Check

Test your understanding of the setup process:

Continue to Your First Production Server →

Building and Running

Now that you've seen the quick start, let's understand what cargo pmcp created and how to work with it effectively.

The Workspace Structure

When you ran cargo pmcp new my-mcp-servers, it created a Cargo workspace:

my-mcp-servers/
├── Cargo.toml              # Workspace manifest
├── pmcp.toml               # PMCP configuration
├── server-common/          # Shared infrastructure code
│   ├── Cargo.toml
│   └── src/
│       └── lib.rs
└── servers/                # Your MCP servers live here
    └── calculator/
        ├── Cargo.toml
        └── src/
            ├── main.rs
            └── tools/
                ├── mod.rs
                └── calculator.rs

Why a Workspace?

A Cargo workspace lets you manage multiple related packages together. For MCP development, this provides:

Benefit	How It Helps
Shared dependencies	All servers use the same versions of pmcp, serde, etc.
Common code	`server-common` is shared across all servers
Single build	`cargo build` compiles everything together
Consistent tooling	One `cargo fmt`, one `cargo clippy` for all

As you build more MCP servers, they all go in the servers/ directory and share the common infrastructure.

The Workspace Manifest

The root Cargo.toml defines the workspace:

[workspace]
resolver = "2"
members = [
    "server-common",
    "servers/*",
]

[workspace.dependencies]
pmcp = "1.8"
tokio = { version = "1", features = ["full"] }
serde = { version = "1", features = ["derive"] }
serde_json = "1"
schemars = "0.8"
tracing = "0.1"
tracing-subscriber = { version = "0.3", features = ["env-filter"] }
anyhow = "1"
async-trait = "0.1"

Key points:

members includes server-common and all packages under servers/
[workspace.dependencies] defines shared dependency versions
Individual packages inherit these with dependency.workspace = true

The PMCP Configuration

The pmcp.toml file configures cargo-pmcp behavior:

[workspace]
name = "my-mcp-servers"
default_server = "calculator"

[servers.calculator]
package = "calculator"
port = 3000

[deploy]
default_target = "lambda"

This tells cargo pmcp dev which server to run by default and on which port.

Server-Common: Shared Infrastructure

The server-common crate provides HTTP server bootstrap code that all your MCP servers share:

#![allow(unused)]
fn main() {
// server-common/src/lib.rs
use pmcp::server::streamable_http_server::{
    StreamableHttpServer, 
    StreamableHttpServerConfig
};
use pmcp::Server;
use std::net::SocketAddr;
use std::sync::Arc;
use tokio::sync::Mutex;

/// Start an HTTP server for the given MCP server
pub async fn serve_http(
    server: Server,
    addr: SocketAddr,
) -> Result<(), Box<dyn std::error::Error>> {
    let server = Arc::new(Mutex::new(server));
    
    let config = StreamableHttpServerConfig {
        session_id_generator: None,   // Stateless mode
        enable_json_response: true,
        event_store: None,
        on_session_initialized: None,
        on_session_closed: None,
        http_middleware: None,
    };
    
    let http_server = StreamableHttpServer::with_config(addr, server, config);
    let (bound_addr, handle) = http_server.start().await?;
    
    tracing::info!("MCP server listening on http://{}/mcp", bound_addr);
    
    handle.await?;
    Ok(())
}
}

By centralizing this code, you:

Update HTTP handling once, all servers benefit
Keep server code focused on business logic
Ensure consistent configuration across servers

Running Your Server

Development Mode

Use cargo pmcp dev for local development:

# Run the default server (from pmcp.toml)
cargo pmcp dev

# Run a specific server
cargo pmcp dev calculator

# Run on a different port
cargo pmcp dev calculator --port 8080

Development mode includes:

Hot reloading (rebuilds on file changes)
Verbose logging
Pretty-printed output

Production Build

For production, build a release binary:

cargo build --release --package calculator

The binary is at target/release/calculator (~5-15MB, no runtime dependencies).

Run it directly:

./target/release/calculator

Or with environment configuration:

RUST_LOG=info PORT=3000 ./target/release/calculator

Adding More Servers

Add a new server to your workspace:

cargo pmcp add server inventory --template basic

This creates servers/inventory/ with the standard structure. Your workspace now has:

servers/
├── calculator/
└── inventory/

Both servers share server-common and workspace dependencies.

Available Templates

cargo pmcp add server supports several templates:

Template	Description
`basic`	Minimal server with one example tool
`calculator`	Math operations with typed inputs/outputs
`database`	Database query patterns with connection pooling
`crud`	Create/Read/Update/Delete operations
`authenticated`	OAuth-protected server template

Use --template to specify:

cargo pmcp add server users --template crud
cargo pmcp add server reports --template database

Building All Servers

Build everything in the workspace:

# Debug build
cargo build

# Release build (optimized)
cargo build --release

# Check without building (faster)
cargo check

Testing

Run tests across the workspace:

# All tests
cargo test

# Tests for a specific server
cargo test --package calculator

# With output
cargo test -- --nocapture

Code Quality

The workspace supports standard Rust quality tools:

# Format all code
cargo fmt

# Lint all code
cargo clippy

# Both (recommended before commits)
cargo fmt && cargo clippy

Summary

Command	Purpose
`cargo pmcp new <name>`	Create a new workspace
`cargo pmcp add server <name>`	Add a server to the workspace
`cargo pmcp dev [server]`	Run in development mode
`cargo build --release`	Build for production
`cargo test`	Run all tests
`cargo fmt && cargo clippy`	Code quality checks

Next, let's look inside the calculator server to understand how tools are defined.

Continue to The Calculator Server →

The Calculator Server

Let's examine the calculator server in detail. This simple example demonstrates all the patterns you'll use in production MCP servers.

Server Entry Point

The main.rs file is the server's entry point:

// servers/calculator/src/main.rs
use pmcp::prelude::*;
use server_common::serve_http;
use std::net::{Ipv4Addr, SocketAddr};

mod tools;

#[tokio::main]
async fn main() -> Result<(), Box<dyn std::error::Error>> {
    // Initialize structured logging
    tracing_subscriber::fmt()
        .with_env_filter("info")
        .init();

    // Build the MCP server
    let server = Server::builder()
        .name("calculator")
        .version("1.0.0")
        .capabilities(ServerCapabilities::tools_only())
        .tool("add", tools::AddTool)
        .tool("subtract", tools::SubtractTool)
        .tool("multiply", tools::MultiplyTool)
        .tool("divide", tools::DivideTool)
        .build()?;

    // Start HTTP server
    let addr = SocketAddr::new(Ipv4Addr::UNSPECIFIED.into(), 3000);
    tracing::info!("Starting calculator server");
    
    serve_http(server, addr).await
}

Key elements:

Line	Purpose
`use pmcp::prelude::*`	Imports common types (Server, ServerCapabilities, etc.)
`mod tools`	Includes the tools module
`#[tokio::main]`	Enables async main function
`Server::builder()`	Fluent API for server configuration
`.tool("name", Handler)`	Registers each tool
`serve_http(server, addr)`	Starts the HTTP transport

Tool Module Structure

Tools are organized in the tools/ directory:

src/tools/
├── mod.rs          # Module exports
└── calculator.rs   # Tool implementations

The mod.rs file exports the tool handlers:

#![allow(unused)]
fn main() {
// src/tools/mod.rs
mod calculator;

pub use calculator::{AddTool, SubtractTool, MultiplyTool, DivideTool};
}

Anatomy of a Tool

Let's examine the AddTool in detail:

#![allow(unused)]
fn main() {
// src/tools/calculator.rs
use async_trait::async_trait;
use pmcp::{ToolHandler, RequestHandlerExtra, Error};
use schemars::JsonSchema;
use serde::{Deserialize, Serialize};
use serde_json::{json, Value};

/// Input arguments for the add operation
#[derive(Debug, Deserialize, JsonSchema)]
pub struct AddArgs {
    /// First number to add
    pub a: f64,
    /// Second number to add  
    pub b: f64,
}

/// Result of the add operation
#[derive(Debug, Serialize, JsonSchema)]
pub struct AddResult {
    /// The sum of a and b
    pub result: f64,
    /// Human-readable expression
    pub expression: String,
}

/// Tool that adds two numbers
pub struct AddTool;

#[async_trait]
impl ToolHandler for AddTool {
    async fn handle(
        &self, 
        args: Value, 
        _extra: RequestHandlerExtra
    ) -> Result<Value, Error> {
        // Parse and validate arguments
        let input: AddArgs = serde_json::from_value(args)
            .map_err(|e| Error::validation(format!("Invalid arguments: {}", e)))?;
        
        // Perform the calculation
        let sum = input.a + input.b;
        
        // Return structured result
        let result = AddResult {
            result: sum,
            expression: format!("{} + {} = {}", input.a, input.b, sum),
        };
        
        Ok(serde_json::to_value(result)?)
    }
    
    fn metadata(&self) -> Option<pmcp::types::ToolInfo> {
        let schema = schemars::schema_for!(AddArgs);
        Some(pmcp::types::ToolInfo::new(
            "add",
            Some("Add two numbers together".to_string()),
            serde_json::to_value(&schema).unwrap_or_default(),
        ))
    }
}
}

Breaking It Down

1. Input Type with Schema

#![allow(unused)]
fn main() {
#[derive(Debug, Deserialize, JsonSchema)]
pub struct AddArgs {
    /// First number to add
    pub a: f64,
    /// Second number to add  
    pub b: f64,
}
}

Deserialize - Parses JSON into this struct
JsonSchema - Generates JSON Schema for validation
Doc comments (///) become field descriptions in the schema

The generated schema tells Claude exactly what parameters the tool accepts:

{
  "type": "object",
  "properties": {
    "a": { "type": "number", "description": "First number to add" },
    "b": { "type": "number", "description": "Second number to add" }
  },
  "required": ["a", "b"]
}

2. Output Type

#![allow(unused)]
fn main() {
#[derive(Debug, Serialize, JsonSchema)]
pub struct AddResult {
    pub result: f64,
    pub expression: String,
}
}

Serialize - Converts the struct to JSON
Structured output helps Claude understand and use the result

3. The Handler

#![allow(unused)]
fn main() {
#[async_trait]
impl ToolHandler for AddTool {
    async fn handle(&self, args: Value, _extra: RequestHandlerExtra) -> Result<Value, Error> {
        // Implementation
    }
}
}

async - All handlers are async for consistency
args: Value - Raw JSON input from the client
_extra: RequestHandlerExtra - Additional context (we'll use this later)
Returns Result<Value, Error> - JSON value or error

4. Metadata for Discovery

#![allow(unused)]
fn main() {
fn metadata(&self) -> Option<pmcp::types::ToolInfo> {
    let schema = schemars::schema_for!(AddArgs);
    Some(pmcp::types::ToolInfo::new(
        "add",
        Some("Add two numbers together".to_string()),
        serde_json::to_value(&schema).unwrap_or_default(),
    ))
}
}

This tells MCP clients:

Tool name: "add"
Description: "Add two numbers together"
Input schema: Generated from AddArgs

Error Handling: The Divide Tool

The divide tool shows proper error handling:

#![allow(unused)]
fn main() {
#[derive(Debug, Deserialize, JsonSchema)]
pub struct DivideArgs {
    /// The dividend (number to be divided)
    pub dividend: f64,
    /// The divisor (number to divide by)
    pub divisor: f64,
}

pub struct DivideTool;

#[async_trait]
impl ToolHandler for DivideTool {
    async fn handle(&self, args: Value, _extra: RequestHandlerExtra) -> Result<Value, Error> {
        let input: DivideArgs = serde_json::from_value(args)
            .map_err(|e| Error::validation(format!("Invalid arguments: {}", e)))?;
        
        // Validate: prevent division by zero
        if input.divisor == 0.0 {
            return Err(Error::validation("Cannot divide by zero"));
        }
        
        let quotient = input.dividend / input.divisor;
        
        Ok(json!({
            "result": quotient,
            "expression": format!("{} ÷ {} = {}", input.dividend, input.divisor, quotient)
        }))
    }
    
    fn metadata(&self) -> Option<pmcp::types::ToolInfo> {
        let schema = schemars::schema_for!(DivideArgs);
        Some(pmcp::types::ToolInfo::new(
            "divide",
            Some("Divide two numbers. Returns an error if divisor is zero.".to_string()),
            serde_json::to_value(&schema).unwrap_or_default(),
        ))
    }
}
}

Error Types

PMCP provides error types that map to MCP error codes:

Error Type	When to Use	MCP Code
`Error::validation(msg)`	Invalid input from client	-32602
`Error::internal(msg)`	Server-side failures	-32603
`Error::not_found(msg)`	Resource doesn't exist	-32001
`Error::permission_denied(msg)`	Authorization failure	-32002

When Claude sees a validation error, it understands the request was malformed and can try again with corrected input.

The Complete Calculator Module

Here's the full calculator.rs with all four operations:

#![allow(unused)]
fn main() {
use async_trait::async_trait;
use pmcp::{Error, RequestHandlerExtra, ToolHandler};
use schemars::JsonSchema;
use serde::{Deserialize, Serialize};
use serde_json::{json, Value};

// === Shared Types ===

#[derive(Debug, Serialize, JsonSchema)]
pub struct CalculationResult {
    pub result: f64,
    pub expression: String,
}

// === Add Tool ===

#[derive(Debug, Deserialize, JsonSchema)]
pub struct AddArgs {
    /// First number
    pub a: f64,
    /// Second number
    pub b: f64,
}

pub struct AddTool;

#[async_trait]
impl ToolHandler for AddTool {
    async fn handle(&self, args: Value, _extra: RequestHandlerExtra) -> Result<Value, Error> {
        let input: AddArgs = serde_json::from_value(args)
            .map_err(|e| Error::validation(format!("Invalid arguments: {}", e)))?;
        
        let result = input.a + input.b;
        Ok(serde_json::to_value(CalculationResult {
            result,
            expression: format!("{} + {} = {}", input.a, input.b, result),
        })?)
    }
    
    fn metadata(&self) -> Option<pmcp::types::ToolInfo> {
        let schema = schemars::schema_for!(AddArgs);
        Some(pmcp::types::ToolInfo::new(
            "add",
            Some("Add two numbers".to_string()),
            serde_json::to_value(&schema).unwrap_or_default(),
        ))
    }
}

// === Subtract Tool ===

#[derive(Debug, Deserialize, JsonSchema)]
pub struct SubtractArgs {
    /// Number to subtract from
    pub a: f64,
    /// Number to subtract
    pub b: f64,
}

pub struct SubtractTool;

#[async_trait]
impl ToolHandler for SubtractTool {
    async fn handle(&self, args: Value, _extra: RequestHandlerExtra) -> Result<Value, Error> {
        let input: SubtractArgs = serde_json::from_value(args)
            .map_err(|e| Error::validation(format!("Invalid arguments: {}", e)))?;
        
        let result = input.a - input.b;
        Ok(serde_json::to_value(CalculationResult {
            result,
            expression: format!("{} - {} = {}", input.a, input.b, result),
        })?)
    }
    
    fn metadata(&self) -> Option<pmcp::types::ToolInfo> {
        let schema = schemars::schema_for!(SubtractArgs);
        Some(pmcp::types::ToolInfo::new(
            "subtract",
            Some("Subtract two numbers".to_string()),
            serde_json::to_value(&schema).unwrap_or_default(),
        ))
    }
}

// === Multiply Tool ===

#[derive(Debug, Deserialize, JsonSchema)]
pub struct MultiplyArgs {
    /// First factor
    pub a: f64,
    /// Second factor
    pub b: f64,
}

pub struct MultiplyTool;

#[async_trait]
impl ToolHandler for MultiplyTool {
    async fn handle(&self, args: Value, _extra: RequestHandlerExtra) -> Result<Value, Error> {
        let input: MultiplyArgs = serde_json::from_value(args)
            .map_err(|e| Error::validation(format!("Invalid arguments: {}", e)))?;
        
        let result = input.a * input.b;
        Ok(serde_json::to_value(CalculationResult {
            result,
            expression: format!("{} × {} = {}", input.a, input.b, result),
        })?)
    }
    
    fn metadata(&self) -> Option<pmcp::types::ToolInfo> {
        let schema = schemars::schema_for!(MultiplyArgs);
        Some(pmcp::types::ToolInfo::new(
            "multiply",
            Some("Multiply two numbers".to_string()),
            serde_json::to_value(&schema).unwrap_or_default(),
        ))
    }
}

// === Divide Tool ===

#[derive(Debug, Deserialize, JsonSchema)]
pub struct DivideArgs {
    /// The dividend
    pub dividend: f64,
    /// The divisor (cannot be zero)
    pub divisor: f64,
}

pub struct DivideTool;

#[async_trait]
impl ToolHandler for DivideTool {
    async fn handle(&self, args: Value, _extra: RequestHandlerExtra) -> Result<Value, Error> {
        let input: DivideArgs = serde_json::from_value(args)
            .map_err(|e| Error::validation(format!("Invalid arguments: {}", e)))?;
        
        if input.divisor == 0.0 {
            return Err(Error::validation("Cannot divide by zero"));
        }
        
        let result = input.dividend / input.divisor;
        Ok(serde_json::to_value(CalculationResult {
            result,
            expression: format!("{} ÷ {} = {}", input.dividend, input.divisor, result),
        })?)
    }
    
    fn metadata(&self) -> Option<pmcp::types::ToolInfo> {
        let schema = schemars::schema_for!(DivideArgs);
        Some(pmcp::types::ToolInfo::new(
            "divide",
            Some("Divide two numbers (divisor cannot be zero)".to_string()),
            serde_json::to_value(&schema).unwrap_or_default(),
        ))
    }
}
}

What Claude Sees

When Claude connects to your server, it receives the tool list:

{
  "tools": [
    {
      "name": "add",
      "description": "Add two numbers",
      "inputSchema": {
        "type": "object",
        "properties": {
          "a": { "type": "number", "description": "First number" },
          "b": { "type": "number", "description": "Second number" }
        },
        "required": ["a", "b"]
      }
    },
    {
      "name": "divide",
      "description": "Divide two numbers (divisor cannot be zero)",
      "inputSchema": {
        "type": "object",
        "properties": {
          "dividend": { "type": "number", "description": "The dividend" },
          "divisor": { "type": "number", "description": "The divisor (cannot be zero)" }
        },
        "required": ["dividend", "divisor"]
      }
    }
  ]
}

Claude uses this information to:

Understand what tools are available
Know what arguments each tool requires
Generate valid tool calls automatically

Hands-On Exercise

Ready to build your own calculator? Head to the exercises page:

Chapter 2 Exercises - Build a calculator MCP server with proper error handling (Exercise 2)

Next, let's dive deeper into the patterns and conventions used in this code.

Continue to Understanding the Generated Code →

Understanding the Generated Code

Now that you've seen the calculator server, let's understand the patterns and conventions that make PMCP code production-ready.

The Prelude Pattern

Most PMCP code starts with:

#![allow(unused)]
fn main() {
use pmcp::prelude::*;
}

This imports commonly used types:

Type	Purpose
`Server`	The MCP server instance
`ServerBuilder`	Fluent API for building servers
`ServerCapabilities`	Declares what the server supports
`ToolHandler`	Trait for implementing tools
`RequestHandlerExtra`	Additional context for handlers
`Error`	PMCP error types

You can also import types explicitly:

#![allow(unused)]
fn main() {
use pmcp::{Server, ServerBuilder, ServerCapabilities, ToolHandler, Error};
}

Server Builder Pattern

The ServerBuilder uses the builder pattern for flexible configuration:

#![allow(unused)]
fn main() {
let server = Server::builder()
    .name("my-server")           // Required: server name
    .version("1.0.0")            // Required: semantic version
    .capabilities(caps)          // Required: what the server supports
    .tool("tool_name", handler)  // Add tools
    .resource("uri", provider)   // Add resources
    .prompt("name", template)    // Add prompts
    .build()?;                   // Finalize and validate
}

Server Capabilities

Capabilities tell clients what your server supports:

#![allow(unused)]
fn main() {
// Only tools
let caps = ServerCapabilities::tools_only();

// Only resources
let caps = ServerCapabilities::resources_only();

// Tools and resources
let caps = ServerCapabilities {
    tools: Some(pmcp::types::ToolCapabilities::default()),
    resources: Some(pmcp::types::ResourceCapabilities::default()),
    ..Default::default()
};

// Everything
let caps = ServerCapabilities::all();
}

Declaring capabilities correctly helps clients understand your server's features.

The ToolHandler Trait

Every tool implements ToolHandler:

#![allow(unused)]
fn main() {
#[async_trait]
pub trait ToolHandler: Send + Sync {
    /// Handle a tool invocation
    async fn handle(
        &self,
        args: Value,
        extra: RequestHandlerExtra,
    ) -> Result<Value, Error>;
    
    /// Return tool metadata (name, description, schema)
    fn metadata(&self) -> Option<pmcp::types::ToolInfo> {
        None  // Default: no metadata
    }
}
}

Why `async_trait`?

Rust doesn't natively support async functions in traits (yet). The #[async_trait] macro bridges this gap:

#![allow(unused)]
fn main() {
use async_trait::async_trait;

#[async_trait]
impl ToolHandler for MyTool {
    async fn handle(&self, args: Value, _extra: RequestHandlerExtra) -> Result<Value, Error> {
        // Can use .await here
        let data = fetch_data().await?;
        Ok(json!({ "data": data }))
    }
}
}

The `RequestHandlerExtra` Parameter

The extra parameter provides context about the request:

#![allow(unused)]
fn main() {
async fn handle(&self, args: Value, extra: RequestHandlerExtra) -> Result<Value, Error> {
    // Access request metadata
    if let Some(meta) = &extra.meta {
        tracing::info!("Request ID: {:?}", meta.progress_token);
    }
    
    // ... handle request
}
}

We'll use this more in later chapters for authentication and progress reporting.

Type-Safe Arguments with Serde

The pattern for parsing arguments:

#![allow(unused)]
fn main() {
#[derive(Debug, Deserialize, JsonSchema)]
pub struct MyToolArgs {
    pub required_field: String,
    
    #[serde(default)]
    pub optional_field: Option<i32>,
    
    #[serde(default = "default_limit")]
    pub limit: u32,
}

fn default_limit() -> u32 { 10 }
}

Serde Attributes

Attribute	Effect
`#[serde(default)]`	Use `Default::default()` if missing
`#[serde(default = "fn")]`	Use custom default function
`#[serde(rename = "name")]`	Use different JSON field name
`#[serde(skip)]`	Don't serialize/deserialize
`#[serde(flatten)]`	Inline nested struct fields

Parsing Pattern

Always parse with proper error handling:

#![allow(unused)]
fn main() {
let input: MyToolArgs = serde_json::from_value(args)
    .map_err(|e| Error::validation(format!("Invalid arguments: {}", e)))?;
}

This converts parsing errors into MCP validation errors that clients understand.

JSON Schema Generation

The JsonSchema derive generates schemas automatically:

#![allow(unused)]
fn main() {
use schemars::JsonSchema;

#[derive(JsonSchema)]
pub struct SearchArgs {
    /// The search query string
    pub query: String,
    
    /// Maximum results to return (1-100)
    #[schemars(range(min = 1, max = 100))]
    pub limit: u32,
    
    /// Filter by status
    pub status: Option<Status>,
}

#[derive(JsonSchema)]
pub enum Status {
    Active,
    Inactive,
    Pending,
}
}

Generated schema:

{
  "type": "object",
  "properties": {
    "query": {
      "type": "string",
      "description": "The search query string"
    },
    "limit": {
      "type": "integer",
      "minimum": 1,
      "maximum": 100,
      "description": "Maximum results to return (1-100)"
    },
    "status": {
      "type": "string",
      "enum": ["Active", "Inactive", "Pending"],
      "description": "Filter by status"
    }
  },
  "required": ["query", "limit"]
}

Schema Attributes

Attribute	Effect
`/// comment`	Becomes `description`
`#[schemars(range(min, max))]`	Adds numeric bounds
`#[schemars(length(min, max))]`	Adds string length bounds
`#[schemars(regex(pattern))]`	Adds pattern validation

Error Handling Patterns

Validation Errors (Client's Fault)

#![allow(unused)]
fn main() {
// Missing required field
if input.query.is_empty() {
    return Err(Error::validation("Query cannot be empty"));
}

// Invalid value
if input.limit > 100 {
    return Err(Error::validation("Limit cannot exceed 100"));
}

// Invalid format
if !input.email.contains('@') {
    return Err(Error::validation("Invalid email format"));
}
}

Internal Errors (Server's Fault)

#![allow(unused)]
fn main() {
// Database failure
let result = db.query(&sql).await
    .map_err(|e| Error::internal(format!("Database error: {}", e)))?;

// External service failure
let response = client.get(url).await
    .map_err(|e| Error::internal(format!("API error: {}", e)))?;
}

Resource Errors

#![allow(unused)]
fn main() {
// Not found
let user = db.find_user(id).await?
    .ok_or_else(|| Error::not_found(format!("User {} not found", id)))?;

// Permission denied
if !user.can_access(resource) {
    return Err(Error::permission_denied("Access denied"));
}
}

Structured Logging with Tracing

PMCP uses the tracing crate for structured logging:

#![allow(unused)]
fn main() {
use tracing::{info, warn, error, debug, instrument};

#[instrument(skip(self, extra))]
async fn handle(&self, args: Value, extra: RequestHandlerExtra) -> Result<Value, Error> {
    info!(tool = "my_tool", "Processing request");
    
    let input: MyArgs = serde_json::from_value(args)?;
    debug!(query = %input.query, "Parsed arguments");
    
    match do_work(&input).await {
        Ok(result) => {
            info!(result_count = result.len(), "Request completed");
            Ok(serde_json::to_value(result)?)
        }
        Err(e) => {
            error!(error = %e, "Request failed");
            Err(Error::internal(e.to_string()))
        }
    }
}
}

Log Levels

Level	Use For
`error!`	Failures that need attention
`warn!`	Unexpected but handled situations
`info!`	Normal operational messages
`debug!`	Detailed debugging info
`trace!`	Very verbose debugging

The `#[instrument]` Macro

Automatically creates a span with function arguments:

#![allow(unused)]
fn main() {
#[instrument(skip(db), fields(user_id = %user_id))]
async fn get_user(db: &Database, user_id: i64) -> Result<User, Error> {
    // Logs: get_user{user_id=123}
    db.find(user_id).await
}
}

Async Patterns

Sequential Operations

#![allow(unused)]
fn main() {
let user = db.get_user(user_id).await?;
let orders = db.get_orders(user_id).await?;
let total = calculate_total(&orders);
}

Parallel Operations

#![allow(unused)]
fn main() {
use tokio::try_join;

let (user, orders, preferences) = try_join!(
    db.get_user(user_id),
    db.get_orders(user_id),
    db.get_preferences(user_id),
)?;
}

Timeout Handling

#![allow(unused)]
fn main() {
use tokio::time::{timeout, Duration};

let result = timeout(Duration::from_secs(5), slow_operation())
    .await
    .map_err(|_| Error::internal("Operation timed out"))??;
}

Testing Tools

Unit Testing a Handler

#![allow(unused)]
fn main() {
#[cfg(test)]
mod tests {
    use super::*;
    
    #[tokio::test]
    async fn test_add_tool() {
        let tool = AddTool;
        let args = json!({ "a": 10.0, "b": 5.0 });
        let extra = RequestHandlerExtra::default();
        
        let result = tool.handle(args, extra).await.unwrap();
        
        assert_eq!(result["result"], 15.0);
        assert_eq!(result["expression"], "10 + 5 = 15");
    }
    
    #[tokio::test]
    async fn test_divide_by_zero() {
        let tool = DivideTool;
        let args = json!({ "dividend": 10.0, "divisor": 0.0 });
        let extra = RequestHandlerExtra::default();
        
        let result = tool.handle(args, extra).await;
        
        assert!(result.is_err());
        let err = result.unwrap_err();
        assert!(err.to_string().contains("divide by zero"));
    }
}
}

Testing Schema Generation

#![allow(unused)]
fn main() {
#[test]
fn test_args_schema() {
    let schema = schemars::schema_for!(AddArgs);
    let json = serde_json::to_value(&schema).unwrap();
    
    assert!(json["properties"]["a"].is_object());
    assert!(json["properties"]["b"].is_object());
    assert!(json["required"].as_array().unwrap().contains(&json!("a")));
}
}

Summary: The PMCP Pattern

Every PMCP tool follows this pattern:

Define input types with Deserialize and JsonSchema
Define output types with Serialize and JsonSchema
Implement ToolHandler with proper error handling
Provide metadata for client discovery
Register with ServerBuilder
Test thoroughly

#![allow(unused)]
fn main() {
// 1. Input type
#[derive(Debug, Deserialize, JsonSchema)]
pub struct MyToolArgs { /* ... */ }

// 2. Output type  
#[derive(Debug, Serialize, JsonSchema)]
pub struct MyToolResult { /* ... */ }

// 3. Handler implementation
pub struct MyTool;

#[async_trait]
impl ToolHandler for MyTool {
    async fn handle(&self, args: Value, _extra: RequestHandlerExtra) -> Result<Value, Error> {
        let input: MyToolArgs = serde_json::from_value(args)
            .map_err(|e| Error::validation(e.to_string()))?;
        
        // Business logic here
        
        Ok(serde_json::to_value(result)?)
    }
    
    // 4. Metadata
    fn metadata(&self) -> Option<pmcp::types::ToolInfo> {
        let schema = schemars::schema_for!(MyToolArgs);
        Some(pmcp::types::ToolInfo::new(
            "my_tool",
            Some("Description here".to_string()),
            serde_json::to_value(&schema).unwrap_or_default(),
        ))
    }
}

// 5. Registration
let server = Server::builder()
    .tool("my_tool", MyTool)
    .build()?;
}

Hands-On Exercise: Code Review

Now that you understand the patterns, practice your code review skills with a hands-on exercise. Code review is critical when working with AI-generated code.

Chapter 2 Exercises - Complete Exercise 3: Code Review Basics to practice identifying bugs, security issues, and anti-patterns in MCP server code.

Next, let's learn how to debug and test your server with MCP Inspector.

Continue to Testing with MCP Inspector →

Running with MCP Inspector

Chapter 2 Exercises

These hands-on exercises will solidify your understanding of MCP server development with the PMCP SDK.

Exercises

Your First MCP Server ⭐ Beginner (20 min)
- Create an MCP server with a simple "greet" tool
- Learn the builder pattern and typed inputs
The Calculator Tool ⭐ Beginner (25 min)
- Build a calculator with multiple operations
- Implement proper error handling for edge cases
Code Review Challenge ⭐ Beginner (20 min)
- Review code for bugs, security issues, and anti-patterns
- Practice systematic code review techniques

Next Steps

After completing these exercises, continue to:

Testing with MCP Inspector - Debug and test your servers
Database MCP Servers - Connect to real data sources

Exercise: Environment Setup

ch02-00-environment-setup

⭐ beginner ⏱️ 15 min

Before building your first MCP server, let's ensure your development environment is properly configured. This setup exercise will verify all required tools are installed and working.

🎯 Learning Objectives

💡 Hints

Hint 1: Installing Rust

If Rust is not installed, run:

curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
source ~/.cargo/env

Verify with rustc --version - should be 1.82.0 or later.

Hint 2: Installing cargo-pmcp

Install the PMCP development toolkit:

cargo install cargo-pmcp

If installation fails, first update Rust: rustup update stable

Hint 3: MCP Inspector

The MCP Inspector is a web-based tool for testing MCP servers:

npx @modelcontextprotocol/inspector

No installation needed - it runs via npx.

Hint 4: Setting up Claude Desktop

Download Claude Desktop from claude.ai.

Configure MCP servers in ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows).

⚠️ Try the exercise first!Show Solution

# Complete verification script
echo "=== Rust Toolchain ==="
rustc --version && cargo --version
echo -e "\n=== cargo-pmcp ==="
cargo pmcp --version
echo -e "\n=== Node.js (for MCP Inspector) ==="
node --version && npx --version
echo -e "\n=== Environment Ready! ==="

Explanation

Expected output:

🧪 Tests

View Test Code

# Test 1: Rust is installed rustc --version | grep -q "rustc 1\." && echo "PASS: Rust installed" || echo "FAIL: Rust not found" Test 2: Cargo is available cargo --version | grep -q "cargo 1." && echo "PASS: Cargo installed" || echo "FAIL: Cargo not found" Test 3: cargo-pmcp is installed cargo pmcp --version 2>/dev/null && echo "PASS: cargo-pmcp installed" || echo "FAIL: cargo-pmcp not found - run: cargo install cargo-pmcp" Test 4: Node.js is available (for MCP Inspector)

node --version | grep -q "v" && echo "PASS: Node.js installed" || echo "WARN: Node.js not found - needed for MCP Inspector"

🤔 Reflection

Did you encounter any installation issues? Note them for troubleshooting.
Which MCP client will you use? (Claude Desktop, Cursor, VS Code + Continue)
Are you planning to deploy to cloud? If so, ensure you have the relevant CLI installed (aws, wrangler, or gcloud).

Exercise: Your First MCP Server

ch02-01-hello-mcp

⭐ beginner ⏱️ 20 min

Every journey starts with a first step. In this exercise, you'll create your first MCP server - one that responds to a simple "greet" tool.

This might seem simple, but you're learning the foundation that every production MCP server builds upon. By the end, you'll understand:

How MCP servers are structured
How tools receive and process input
How to return results to clients

🎯 Learning Objectives

Thinking

How MCP servers are structured (builder pattern)
The relationship between server, tools, and responses
Why typed inputs matter for AI interactions

Doing

Create an MCP server using Server::builder()
Define a tool with typed input parameters
Return a properly formatted response

💬 Discussion

What do you think an MCP server does? How is it different from a REST API?
Why might we want to define input types (schemas) for our tools?
When Claude or another AI calls a tool, what information does it need?

src/main.rs

💡 Hints

Hint 1: Start with the builder

Start with the server builder:

#![allow(unused)]
fn main() {
let server = Server::builder()
    .name("hello-mcp")
    .version("1.0.0")
    // ...continue building
}

Hint 2: Configure capabilities

You need to configure capabilities and add a tool:

#![allow(unused)]
fn main() {
.capabilities(ServerCapabilities {
    tools: Some(ToolCapabilities::default()),
    ..Default::default()
})
.tool("greet", TypedTool::new(...))
}

Hint 3: Complete structure

The complete structure looks like:

#![allow(unused)]
fn main() {
let server = Server::builder()
    .name("hello-mcp")
    .version("1.0.0")
    .capabilities(ServerCapabilities {
        tools: Some(ToolCapabilities::default()),
        ..Default::default()
    })
    .tool("greet", TypedTool::new("greet", |input: GreetInput| {
        Box::pin(async move {
            // Your greeting logic here
            let greeting = if input.formal.unwrap_or(false) {
                format!("Good day, {}.", input.name)
            } else {
                format!("Hello, {}!", input.name)
            };
            Ok(serde_json::json!({ "message": greeting }))
        })
    }))
    .build()?;
}

⚠️ Try the exercise first!Show Solution

use pmcp::{Server, ServerCapabilities, ToolCapabilities};
use pmcp::server::TypedTool;
use serde::Deserialize;
use schemars::JsonSchema;
use anyhow::Result;
#[derive(Deserialize, JsonSchema)]
struct GreetInput {
/// The name of the person to greet
name: String,
/// Whether to use a formal greeting style
formal: Option<bool>,
}
#[tokio::main]
async fn main() -> Result<()> {
let server = Server::builder()
.name("hello-mcp")
.version("1.0.0")
.capabilities(ServerCapabilities {
tools: Some(ToolCapabilities::default()),
..Default::default()
})
.tool("greet", TypedTool::new("greet", |input: GreetInput| {
Box::pin(async move {
let greeting = if input.formal.unwrap_or(false) {
format!("Good day, {}.", input.name)
} else {
format!("Hello, {}!", input.name)
};
Ok(serde_json::json!({ "message": greeting }))
})
}))
.build()?;
// In a real server, you&#x27;d run this with a transport
// For now, we just verify it builds
println!(&quot;Server &#x27;{}&#x27; v{} ready!&quot;, server.name(), server.version());

Ok(())

}

Explanation

Let's break down what this code does:

1. Input Definition (GreetInput)

#[derive(Deserialize)] - Allows parsing JSON input from clients
#[derive(JsonSchema)] - Generates a schema that tells AI what inputs are valid
Option<bool> - Makes the formal field optional

2. Server Builder Pattern

Server::builder() - Starts building a server configuration
.name() / .version() - Metadata that identifies your server
.capabilities() - Declares what the server can do (tools, resources, etc.)
.tool() - Registers a tool that clients can call

3. TypedTool

Wraps your handler function with type information
Automatically deserializes JSON input to your struct
The closure receives typed input and returns a JSON result

4. Async Handler

Box::pin(async move { ... }) - Creates an async future
Returns Result<Value> - Either a JSON response or an error

Why This Pattern?

Type safety catches errors at compile time
Schemas help AI understand how to call your tools
The builder pattern makes configuration clear and extensible

🧪 Tests

Run these tests locally with:

cargo test

View Test Code

#![allow(unused)]
fn main() {
#[cfg(test)]
mod tests {
    use super::*;
#[test]
fn test_informal_greeting() {
    let input = GreetInput {
        name: &quot;Alice&quot;.to_string(),
        formal: None,
    };
    let result = create_greeting(&amp;input);
    assert!(result.contains(&quot;Hello&quot;));
    assert!(result.contains(&quot;Alice&quot;));
}

#[test]
fn test_formal_greeting() {
    let input = GreetInput {
        name: &quot;Dr. Smith&quot;.to_string(),
        formal: Some(true),
    };
    let result = create_greeting(&amp;input);
    assert!(result.contains(&quot;Good day&quot;));
    assert!(result.contains(&quot;Dr. Smith&quot;));
}

#[test]
fn test_explicit_informal() {
    let input = GreetInput {
        name: &quot;Bob&quot;.to_string(),
        formal: Some(false),
    };
    let result = create_greeting(&amp;input);
    assert!(result.contains(&quot;Hello&quot;));
}

fn create_greeting(input: &amp;GreetInput) -&gt; String {
    if input.formal.unwrap_or(false) {
        format!(&quot;Good day, {}.&quot;, input.name)
    } else {
        format!(&quot;Hello, {}!&quot;, input.name)
    }
}
}

}

🤔 Reflection

Why do we use a struct with derive macros instead of just parsing JSON manually?
What happens if a client sends an input that doesn't match the schema?
How might you extend this server to greet in different languages?
What would change if you wanted to add a second tool to this server?

Exercise: Building a Calculator Tool

ch02-02-calculator

⭐ beginner ⏱️ 25 min

Now that you've created your first MCP server, let's build something more useful: a calculator. But this isn't just about math - it's about learning how to handle different operations, validate inputs, and return meaningful errors.

Think about it: when an AI asks your calculator to divide by zero, what should happen? When someone passes "abc" instead of a number, how do you respond helpfully?

Production MCP servers must handle edge cases gracefully. This exercise teaches you how.

🎯 Learning Objectives

Thinking

Why input validation matters for AI interactions
How to design tools that handle edge cases
The difference between expected errors and bugs

Doing

Create a tool that handles multiple operations
Implement input validation with helpful error messages
Use Rust's Result type for error handling

💬 Discussion

If you were an AI trying to use a calculator, what operations would you expect?
What should happen if someone tries to divide by zero?
How can error messages help an AI correct its request?
Should a calculator tool accept 'two plus three' or just '2 + 3'?

src/main.rs

💡 Hints

Hint 1: Start with the match

Use pattern matching to handle each operation:

#![allow(unused)]
fn main() {
fn calculate(input: &CalculateInput) -> Result<CalculateResult> {
    let (result, op_symbol) = match input.operation {
        Operation::Add => (input.a + input.b, "+"),
        // Add other operations...
    };
// Build the result
}

}

Hint 2: Handle division safely

Check for division by zero before computing:

#![allow(unused)]
fn main() {
Operation::Divide => {
    if input.b == 0.0 {
        return Err(anyhow!("Cannot divide by zero"));
    }
    (input.a / input.b, "/")
}
}

Hint 3: Complete calculate function

#![allow(unused)]
fn main() {
fn calculate(input: &CalculateInput) -> Result<CalculateResult> {
    let (result, op_symbol) = match input.operation {
        Operation::Add => (input.a + input.b, "+"),
        Operation::Subtract => (input.a - input.b, "-"),
        Operation::Multiply => (input.a * input.b, "*"),
        Operation::Divide => {
            if input.b == 0.0 {
                return Err(anyhow!("Cannot divide by zero"));
            }
            (input.a / input.b, "/")
        }
    };
if result.is_nan() || result.is_infinite() {
    return Err(anyhow!("Invalid result"));
}

Ok(CalculateResult {
    result,
    expression: format!("{} {} {} = {}", input.a, op_symbol, input.b, result),
})
}

}

⚠️ Try the exercise first!Show Solution

#![allow(unused)]
fn main() {
use pmcp::{Server, ServerCapabilities, ToolCapabilities};
use pmcp::server::TypedTool;
use serde::{Deserialize, Serialize};
use schemars::JsonSchema;
use anyhow::{Result, anyhow};
#[derive(Deserialize, JsonSchema)]
#[serde(rename_all = "lowercase")]
enum Operation {
Add,
Subtract,
Multiply,
Divide,
}
#[derive(Deserialize, JsonSchema)]
struct CalculateInput {
a: f64,
b: f64,
operation: Operation,
}
#[derive(Serialize)]
struct CalculateResult {
result: f64,
expression: String,
}
fn calculate(input: &CalculateInput) -> Result<CalculateResult> {
let (result, op_symbol) = match input.operation {
Operation::Add => (input.a + input.b, "+"),
Operation::Subtract => (input.a - input.b, "-"),
Operation::Multiply => (input.a * input.b, "*"),
Operation::Divide => {
if input.b == 0.0 {
return Err(anyhow!(
"Cannot divide by zero. Please provide a non-zero divisor."
));
}
(input.a / input.b, "/")
}
};
if result.is_nan() || result.is_infinite() {
    return Err(anyhow!(
        &quot;Calculation produced an invalid result (NaN or Infinity)&quot;
    ));
}

Ok(CalculateResult {
    result,
    expression: format!(&quot;{} {} {} = {}&quot;, input.a, op_symbol, input.b, result),
})
}

}
#[tokio::main]
async fn main() -> Result<()> {
let server = Server::builder()
.name("calculator")
.version("1.0.0")
.capabilities(ServerCapabilities {
tools: Some(ToolCapabilities::default()),
..Default::default()
})
.tool("calculate", TypedTool::new("calculate", |input: CalculateInput| {
Box::pin(async move {
match calculate(&input) {
Ok(result) => Ok(serde_json::to_value(result)?),
Err(e) => Ok(serde_json::json!({
"error": e.to_string(),
"suggestion": "Check your inputs and try again"
})),
}
})
}))
.build()?;
println!(&quot;Calculator server ready!&quot;);
Ok(())

}

Explanation

This solution demonstrates several important patterns:

1. Enum for Operations Using an enum instead of a string for operations:

Compile-time validation of operation types
Pattern matching ensures all cases are handled
#[serde(rename_all = "lowercase")] allows JSON like "add" instead of "Add"

2. Separation of Concerns The calculate() function is separate from the tool handler:

Easier to test (pure function, no async)
Cleaner error handling
Reusable logic

3. Defensive Error Handling

Check for division by zero BEFORE computing
Check for NaN/Infinity AFTER computing
Return helpful error messages that guide the AI

4. Human-Readable Output

The expression field shows the full calculation
Helps debugging and transparency
AI can show this to users

5. Error Response Pattern Instead of returning a tool error (which might retry), we return a structured error response. This lets the AI understand what went wrong and explain it to the user.

🧪 Tests

Run these tests locally with:

cargo test

View Test Code

#![allow(unused)]
fn main() {
#[cfg(test)]
mod tests {
    use super::*;
#[test]
fn test_addition() {
    let input = CalculateInput {
        a: 5.0,
        b: 3.0,
        operation: Operation::Add,
    };
    let result = calculate(&amp;input).unwrap();
    assert_eq!(result.result, 8.0);
}

#[test]
fn test_division_by_zero() {
    let input = CalculateInput {
        a: 10.0,
        b: 0.0,
        operation: Operation::Divide,
    };
    assert!(calculate(&amp;input).is_err());
}

#[test]
fn test_expression_format() {
    let input = CalculateInput {
        a: 10.0,
        b: 5.0,
        operation: Operation::Multiply,
    };
    let result = calculate(&amp;input).unwrap();
    assert!(result.expression.contains(&quot;10 * 5 = 50&quot;));
}
}

}

🤔 Reflection

Why do we check for division by zero before computing, not after?
What's the advantage of returning a structured error vs failing the tool call?
How would you add a 'power' operation to this calculator?
What might go wrong with floating-point math that integers wouldn't have?

Exercise: Code Review Basics

ch02-03-code-review

⭐ beginner ⏱️ 20 min

You've been asked to review a colleague's MCP server code before it goes to production. The server is supposed to process user messages and return responses, but something isn't quite right.

This exercise develops a crucial skill: code review. When working with AI assistants, you'll often need to review generated code for issues. Even when you write code yourself, a critical eye catches bugs before users do.

Your task: Find at least 5 issues in this code, categorize them by severity, and suggest fixes.

🎯 Learning Objectives

Thinking

How to systematically review code for issues
Distinguishing bugs from style issues from security concerns
Why error handling patterns matter

Doing

Identify bugs, security issues, and anti-patterns
Categorize issues by severity
Propose concrete fixes

💬 Discussion

What's your usual approach when reviewing code?
What categories of issues should you look for?
How do you prioritize fixes?

💡 Hints

Hint 1: Where to look

Focus on these areas:

How is the mutex being used?
What happens with all those .unwrap() calls?
Does the server actually run?
What gets logged?

Hint 2: Critical issues

The most critical issues:

The mutex lock usage has a problem with mutable access
The server is built but never started with a transport
Multiple .unwrap() calls can panic

Hint 3: Full list

Issues to find:

Critical: Mutex borrow issue - needs mut for *count += 1
High: .lock().unwrap() panics if mutex poisoned
High: Server never starts (no transport)
High: Multiple .unwrap() calls can panic
Medium: Global mutable state hurts testing/scaling
Medium: Raw user input logged (security)
Low: .len() > 0 should be !.is_empty()
Low: Version "0.1" should be "0.1.0" for semver
Low: main() should return Result

⚠️ Try the exercise first!Show Solution

#![allow(unused)]
fn main() {
let count = MESSAGE_COUNT.lock().unwrap();
*count += 1;  // Error: count is not mutable!
}

Explanation

Fix: let mut count = MESSAGE_COUNT.lock().unwrap();

2. High - Panic on Poisoned Mutex Fix: Handle PoisonError or use lock().unwrap_or_else(|e| e.into_inner())

3. High - Server Never Starts Fix: Add server.run_stdio().await?; or HTTP transport

4. High - Unwrap on Serialization Fix: Use ? operator: Ok(serde_json::to_value(response)?)

5. Medium - Global Mutable State Fix: Use per-request or per-connection state, or Arc<Mutex<>> passed to handlers

6. Medium - Logging User Input Fix: Use structured logging (tracing), sanitize/truncate input

7. Low - Non-idiomatic Empty Check Fix: if !input.message.is_empty()

8. Low - Semver Version Format Fix: .version("0.1.0")

9. Low - main() Return Type Fix: async fn main() -> Result<(), Box<dyn std::error::Error>>

🤔 Reflection

What's your process for reviewing unfamiliar code?
How do you prioritize which issues to fix first?
How would you give feedback to the author without being discouraging?
What tools could help catch some of these issues automatically?

Database MCP Servers

Database access is the killer app for enterprise MCP. When employees can ask Claude "What were our top-selling products last quarter?" and get an instant, accurate answer from live data—that's transformative.

This chapter shows you how to build production-ready database MCP servers that are secure, performant, and enterprise-ready.

What You'll Learn

Section	Topics
The Enterprise Data Access Problem	Why database access is MCP's killer app, the friction it eliminates
Building db-explorer	Step-by-step server creation, query tools, schema introspection
SQL Safety and Injection Prevention	Security patterns, parameterized queries, allowlisting
Resource-Based Data Patterns	When to use resources vs tools, structured access patterns
Handling Large Results	Pagination, streaming, cursor-based navigation

Quick Preview

By the end of this chapter, you'll build a database server that lets Claude:

User: "Show me our top 10 customers by revenue"

Claude: I'll query the sales database for you.

[Calls list_tables tool]
[Calls query tool with: SELECT customer_name, SUM(order_total) as revenue 
 FROM orders GROUP BY customer_id ORDER BY revenue DESC LIMIT 10]

Here are your top 10 customers by revenue:
1. Acme Corp - $1,234,567
2. GlobalTech - $987,654
...

The Architecture

┌─────────────────────────────────────────────────────────┐
│                     Claude / AI Client                   │
└─────────────────────────┬───────────────────────────────┘
                          │ MCP Protocol
                          ▼
┌─────────────────────────────────────────────────────────┐
│                   Database MCP Server                    │
│  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐     │
│  │ list_tables │  │    query    │  │  Resources  │     │
│  │    Tool     │  │    Tool     │  │ (optional)  │     │
│  └──────┬──────┘  └──────┬──────┘  └──────┬──────┘     │
│         │                │                │             │
│         └────────────────┼────────────────┘             │
│                          ▼                              │
│              ┌───────────────────────┐                  │
│              │   Connection Pool     │                  │
│              │   (sqlx + Arc)        │                  │
│              └───────────┬───────────┘                  │
└──────────────────────────┼──────────────────────────────┘
                           │
                           ▼
                    ┌──────────────┐
                    │   Database   │
                    │  (SQLite,    │
                    │  PostgreSQL, │
                    │  MySQL)      │
                    └──────────────┘

Prerequisites

Before starting this chapter, you should have:

Completed Chapter 2: Your First Production Server
Basic familiarity with SQL
A sample database to work with (we'll provide one)

Sample Database

We'll use the Chinook database—a sample database representing a digital media store with customers, invoices, tracks, and artists.

# Download the sample database
curl -L -o chinook.db https://github.com/lerocha/chinook-database/raw/master/ChinookDatabase/DataSources/Chinook_Sqlite.sqlite

Chapter Sections

1. The Enterprise Data Access Problem

Understand why database access is MCP's killer app for enterprises:

The current friction in getting data to AI
How MCP eliminates the copy-paste workflow
Security considerations for enterprise data

2. Building db-explorer

Build a complete database MCP server step-by-step:

Creating the server with cargo pmcp
Implementing list_tables and query tools
Testing with MCP Inspector and Claude

3. SQL Safety and Injection Prevention

Master security patterns for database access:

SQL injection attacks and prevention
Parameterized queries with sqlx
Allowlisting vs blocklisting approaches
Defense in depth strategies

4. Resource-Based Data Patterns

Learn when to use MCP resources instead of SQL tools:

Resources for structured, predictable access
Tools for flexible, ad-hoc queries
Hybrid approaches for different use cases

5. Handling Large Results

Handle enterprise-scale data volumes:

Why OFFSET pagination fails at scale
Cursor-based pagination patterns
Streaming for very large results
Memory-safe result handling

Hands-On Exercises

After completing the lessons, practice with these exercises:

Chapter 3 Exercises

Exercise 1: Database Query Basics - Build list_tables and execute_query tools
Exercise 2: SQL Injection Review - Find and fix security vulnerabilities
Exercise 3: Pagination Patterns - Implement cursor-based pagination

Security Checklist

Before deploying any database MCP server to production:

Only SELECT queries allowed (no mutations)
Parameterized queries for all user input
Row limits enforced on all queries
Sensitive columns filtered (SSN, passwords, PII)
Connection pooling configured
Query timeout set
Audit logging enabled
Authentication required (OAuth in production)

Knowledge Check

Test your understanding after completing the chapter:

Start with The Enterprise Data Access Problem →

The Enterprise Data Access Problem

Every enterprise has data trapped in databases. Customer information in CRM systems. Financial data in ERP systems. Analytics in data warehouses. Operational metrics in PostgreSQL or MySQL.

This data is incredibly valuable—but getting it into an AI conversation is surprisingly painful.

The Current Workflow

When an employee wants to use AI to analyze company data, here's what typically happens:

┌─────────────────────────────────────────────────────────────┐
│                    The Data Access Gauntlet                 │
├─────────────────────────────────────────────────────────────┤
│                                                             │
│  1. REQUEST ACCESS                                          │
│     └─→ Submit IT ticket                                    │
│         └─→ Wait for approval (days/weeks)                  │
│             └─→ Get credentials                             │
│                                                             │
│  2. LEARN THE TOOLS                                         │
│     └─→ Figure out which database has the data              │
│         └─→ Learn SQL or the reporting tool                 │
│             └─→ Understand the schema                       │
│                                                             │
│  3. EXTRACT THE DATA                                        │
│     └─→ Write the query                                     │
│         └─→ Export to CSV                                   │
│             └─→ Maybe clean it up in Excel                  │
│                                                             │
│  4. USE WITH AI                                             │
│     └─→ Copy-paste into ChatGPT                             │
│         └─→ Hope it's not too large                         │
│             └─→ Repeat for every new question               │
│                                                             │
└─────────────────────────────────────────────────────────────┘

This workflow has serious problems:

Problem	Impact
Slow	Days or weeks to get access, minutes per query
Error-prone	Manual copy-paste introduces mistakes
Limited	Large datasets don't fit in chat contexts
Stale	Exported data is immediately out of date
Insecure	Data copied to external AI services
Inefficient	Every question requires the full workflow

The MCP Solution

With a database MCP server, the workflow becomes:

┌─────────────────────────────────────────────────────────────┐
│                    MCP Database Access                      │
├─────────────────────────────────────────────────────────────┤
│                                                             │
│  User: "What were our top products last quarter?"           │
│                                                             │
│  Claude: [Calls list_tables to understand schema]           │
│          [Calls query with appropriate SQL]                 │
│          "Here are your top 10 products by revenue..."      │
│                                                             │
│  Time: ~2 seconds                                           │
│                                                             │
└─────────────────────────────────────────────────────────────┘

The key differences:

Aspect	Before MCP	With MCP
Access time	Days/weeks	Instant (pre-authorized)
Data freshness	Stale exports	Live queries
Query complexity	User writes SQL	AI writes SQL
Data size	Limited by copy-paste	Paginated, unlimited
Security	Data leaves enterprise	Stays within boundary
Repeatability	Manual each time	Automatic

Why This Matters for Enterprises

1. Democratized Data Access

Not everyone knows SQL. With an MCP server, a salesperson can ask:

"Show me which customers haven't ordered in 90 days but were active last year"

Claude translates this to SQL, queries the database, and presents the results—no SQL knowledge required.

2. Real-Time Insights

Traditional BI dashboards show pre-defined reports. With MCP, users can ask ad-hoc questions:

"Compare this month's sales to the same period last year, broken down by region"

The AI understands the question, writes the query, and explains the results in context.

3. Secure by Design

The MCP server acts as a security boundary:

┌────────────────────────────────────────────────────────┐
│                  Enterprise Network                    │
│                                                        │
│  ┌─────────────┐      ┌─────────────────────────────┐  │
│  │  Database   │◄────►│  Database MCP Server        │  │
│  │  (Private)  │      │  - SELECT only              │  │
│  └─────────────┘      │  - Row limits               │  │
│                       │  - Column filtering         │  │
│                       │  - Audit logging            │  │
│                       │  - OAuth authentication     │  │
│                       └──────────────┬──────────────┘  │
│                                      │                 │
└──────────────────────────────────────┼─────────────────┘
                                       │ HTTPS + OAuth
                                       ▼
                              ┌─────────────────┐
                              │  Claude / AI    │
                              │  (Authorized)   │
                              └─────────────────┘

Data never leaves your network as raw exports. The MCP server:

Enforces read-only access
Limits result sizes
Filters sensitive columns
Logs all queries for audit
Requires authentication

4. Composable Intelligence

A database MCP server can work alongside other servers:

User: "Draft an email to customers who haven't ordered recently, 
       offering them our current promotion"

Claude: 
  1. [Calls database server] → Gets inactive customer list
  2. [Calls promotions server] → Gets current offer details  
  3. [Calls email server] → Drafts personalized emails

The database becomes one component in larger AI-powered workflows.

Common Enterprise Use Cases

Sales & CRM

"Who are my top 10 accounts by revenue?"
"Which deals are stalled in the pipeline?"
"Show me customer churn trends"

Finance & Operations

"What's our current inventory status?"
"Show me outstanding invoices over 60 days"
"Compare expenses by department"

HR & People

"What's our headcount by location?"
"Show me open positions and time-to-fill"
"Analyze training completion rates"

Product & Analytics

"What features are most used?"
"Show me user retention by cohort"
"Compare performance across regions"

Security Considerations

Database access requires careful security design:

What the MCP Server Should Enforce

Read-only access - No INSERT, UPDATE, DELETE, DROP
Query validation - Block dangerous SQL patterns
Result limits - Prevent memory exhaustion
Column filtering - Hide sensitive fields (SSN, passwords)
Row-level security - Users only see authorized data
Rate limiting - Prevent abuse
Audit logging - Track all queries

What the Database Should Enforce

Minimal privileges - MCP server user has SELECT only
Network isolation - Database not exposed to internet
Connection limits - Bounded connection pool
Query timeouts - Kill long-running queries

What the Infrastructure Should Enforce

Authentication - OAuth/OIDC for all access
Encryption - TLS for all connections
Monitoring - Alert on anomalies
Backup - Regular database backups

The Business Case

Metric	Traditional Approach	With MCP
Time to first insight	Hours to days	Seconds
Queries per day (per user)	2-5	20-50
SQL knowledge required	Yes	No
Data freshness	Hours/days old	Real-time
Security risk	High (data exports)	Low (controlled access)
IT ticket volume	High	Low

For a 1,000-person organization where 200 people regularly need data:

Before: 200 people × 3 queries/day × 10 min/query = 100 hours/day wasted
After: 200 people × 30 queries/day × 5 sec/query = 8 hours/day saved

That's 92 hours per day returned to productive work.

Getting Started

In the next section, we'll build a database MCP server from scratch. You'll learn:

How to create the server with cargo pmcp
Implementing list_tables and query tools
Connecting to SQLite (and other databases)
Testing with MCP Inspector and Claude

The patterns you learn will apply to any database—SQLite, PostgreSQL, MySQL, or cloud databases like AWS RDS or Google Cloud SQL.

Continue to Building db-explorer →

Building db-explorer

Let's build a database MCP server. Like Chapter 2, we'll start by getting a working server running in under 5 minutes—then we'll explore how it works.

Try It First: Database Server in 5 Minutes

Step 1: Add the Server

From your existing workspace (or create a new one with cargo pmcp new):

cargo pmcp add server db-explorer --template db-explorer

Step 2: Get a Sample Database

We'll use the Chinook database—a sample music store with customers, invoices, artists, and tracks:

curl -L -o chinook.db https://github.com/lerocha/chinook-database/raw/master/ChinookDatabase/DataSources/Chinook_Sqlite.sqlite

Step 3: Run the Server

DATABASE_URL=sqlite:./chinook.db cargo pmcp dev db-explorer

You should see:

INFO db_explorer: Starting db-explorer server
INFO db_explorer: Database: sqlite:./chinook.db
INFO db_explorer: Connected to database
INFO server_common: Listening on http://0.0.0.0:3000

Step 4: Connect to Claude Code

In a new terminal:

claude mcp add db-explorer -t http http://localhost:3000

Step 5: Explore the Database!

Start Claude Code and try these prompts:

"What tables are in the database?"

Claude will call list_tables and show you the schema:

The database contains 11 tables:
- albums (347 rows) - AlbumId, Title, ArtistId
- artists (275 rows) - ArtistId, Name
- customers (59 rows) - CustomerId, FirstName, LastName, Email...
- employees (8 rows) - EmployeeId, LastName, FirstName...
- genres (25 rows) - GenreId, Name
- invoices (412 rows) - InvoiceId, CustomerId, InvoiceDate...
- invoice_items (2240 rows) - ...
- media_types (5 rows) - ...
- playlists (18 rows) - ...
- playlist_track (8715 rows) - ...
- tracks (3503 rows) - ...

"Which country has the most customers?"

Claude writes SQL and queries the database:

SELECT Country, COUNT(*) as customer_count 
FROM customers 
GROUP BY Country 
ORDER BY customer_count DESC 
LIMIT 5

"Show me the top 5 selling artists by total revenue"

Claude handles the complex join:

SELECT ar.Name, SUM(ii.UnitPrice * ii.Quantity) as Revenue
FROM artists ar
JOIN albums al ON ar.ArtistId = al.ArtistId
JOIN tracks t ON al.AlbumId = t.AlbumId
JOIN invoice_items ii ON t.TrackId = ii.TrackId
GROUP BY ar.ArtistId
ORDER BY Revenue DESC
LIMIT 5

"What genres are most popular by number of tracks sold?"

"Find customers who haven't made a purchase in the last year"

"What's the average invoice total by country?"

What Just Happened?

You gave Claude direct access to a database. It can:

Discover the schema - Understand what data is available
Write SQL - Translate natural language to queries
Execute safely - Only SELECT queries are allowed
Present results - Format data for human understanding

This is the power of database MCP servers.

Test with MCP Inspector

Before connecting to Claude, you can test your server interactively:

npx @modelcontextprotocol/inspector http://localhost:3000/mcp

This opens a web UI where you can:

Action	How
Browse tools	See `list_tables` and `query` with their schemas
Call list_tables	Click the tool, then "Execute" (no parameters needed)
Run a query	Enter `{"query": "SELECT * FROM artists LIMIT 5"}`
See raw JSON	View the exact MCP protocol messages

Try these queries in the inspector:

{"query": "SELECT * FROM customers LIMIT 5"}

{"query": "SELECT Country, COUNT(*) as count FROM customers GROUP BY Country"}

{"query": "SELECT * FROM artists WHERE Name LIKE '%Rock%'"}

How It Works

Now that you've seen it in action, let's understand the code. The db-explorer template creates this structure:

servers/db-explorer/
├── Cargo.toml
└── src/
    ├── main.rs           # Entry point, server setup
    ├── database.rs       # Connection pool management
    └── tools/
        ├── mod.rs        # Tool exports
        ├── list_tables.rs # Schema introspection
        └── query.rs      # SQL execution

The Database Connection

#![allow(unused)]
fn main() {
// src/database.rs
use sqlx::{Pool, Sqlite, sqlite::SqlitePoolOptions};
use std::sync::Arc;

pub type DbPool = Arc<Pool<Sqlite>>;

pub async fn create_pool(database_url: &str) -> Result<DbPool> {
    let pool = SqlitePoolOptions::new()
        .max_connections(5)
        .connect(database_url)
        .await?;

    Ok(Arc::new(pool))
}
}

Key points:

Arc<Pool<Sqlite>> - Shared connection pool, thread-safe
max_connections(5) - Limits concurrent database connections
Pool is shared between all tool handlers

The list_tables Tool

#![allow(unused)]
fn main() {
// src/tools/list_tables.rs (simplified)

#[derive(Debug, Serialize, JsonSchema)]
pub struct TableInfo {
    pub name: String,
    pub columns: Vec<ColumnInfo>,
    pub row_count: i64,
}

async fn list_tables_impl(pool: &DbPool) -> Result<Vec<TableInfo>> {
    // Get table names from SQLite's system catalog
    let tables: Vec<(String,)> = sqlx::query_as(
        "SELECT name FROM sqlite_master 
         WHERE type = 'table' 
         AND name NOT LIKE 'sqlite_%'"
    )
    .fetch_all(pool.as_ref())
    .await?;

    // For each table, get columns and row count
    let mut result = Vec::new();
    for (table_name,) in tables {
        let columns = get_columns(pool, &table_name).await?;
        let row_count = get_row_count(pool, &table_name).await?;
        
        result.push(TableInfo { name: table_name, columns, row_count });
    }

    Ok(result)
}
}

This tool:

Queries SQLite's sqlite_master for table names
Uses PRAGMA table_info() to get column details
Counts rows in each table
Returns structured data Claude can understand

The query Tool

#![allow(unused)]
fn main() {
// src/tools/query.rs (simplified)

#[derive(Debug, Deserialize, JsonSchema)]
pub struct QueryInput {
    /// SQL query to execute (SELECT only)
    pub query: String,
    
    /// Maximum rows to return
    #[serde(default = "default_limit")]
    pub limit: i32,
}

async fn query_impl(pool: &DbPool, input: QueryInput) -> Result<QueryOutput> {
    // Security: Only allow SELECT
    if !input.query.trim().to_uppercase().starts_with("SELECT") {
        return Err(anyhow!("Only SELECT queries are allowed"));
    }

    // Security: Block dangerous keywords
    let blocked = ["INSERT", "UPDATE", "DELETE", "DROP"];
    for keyword in blocked {
        if input.query.to_uppercase().contains(keyword) {
            return Err(anyhow!("{} is not allowed", keyword));
        }
    }

    // Add LIMIT if not present
    let limited_query = if !input.query.to_uppercase().contains("LIMIT") {
        format!("{} LIMIT {}", input.query, input.limit + 1)
    } else {
        input.query.clone()
    };

    // Execute and return results
    let rows = sqlx::query(&limited_query)
        .fetch_all(pool.as_ref())
        .await?;

    Ok(format_results(rows, input.limit))
}
}

Security measures:

SELECT only - Rejects INSERT, UPDATE, DELETE
Keyword blocking - Extra protection against injection
Automatic LIMIT - Prevents memory exhaustion
Truncation detection - Tells Claude if more rows exist

The Main Entry Point

// src/main.rs
#[tokio::main]
async fn main() -> Result<()> {
    // Get database URL from environment
    let database_url = std::env::var("DATABASE_URL")
        .unwrap_or_else(|_| "sqlite:./chinook.db".to_string());

    // Create connection pool
    let pool = create_pool(&database_url).await?;

    // Build MCP server with both tools
    let server = ServerBuilder::new("db-explorer", "1.0.0")
        .capabilities(ServerCapabilities {
            tools: Some(ToolCapabilities::default()),
            ..Default::default()
        })
        .tool(ListTables::new(pool.clone()).into_tool())
        .tool(Query::new(pool.clone()).into_tool())
        .build()?;

    // Start HTTP server
    server_common::create_http_server(server)
        .serve("0.0.0.0:3000")
        .await
}

Building from Scratch

Want to build it yourself instead of using the template? Here's the complete process:

1. Create Minimal Server

cargo pmcp add server my-db-server --template minimal

2. Add Dependencies

Edit servers/my-db-server/Cargo.toml:

[dependencies]
pmcp = { path = "../../pmcp" }
server-common = { path = "../../server-common" }
tokio = { version = "1", features = ["full"] }
sqlx = { version = "0.7", features = ["runtime-tokio", "sqlite"] }
serde = { version = "1", features = ["derive"] }
serde_json = "1"
schemars = "0.8"
anyhow = "1"
tracing = "0.1"
tracing-subscriber = "0.3"

3. Create the Files

Create the file structure shown above, implementing:

database.rs - Connection pool
tools/list_tables.rs - Schema discovery
tools/query.rs - SQL execution
tools/mod.rs - Module exports
main.rs - Server setup

The complete code for each file is in the Chapter 3 Exercises.

What We Built

Component	Purpose
`DbPool`	Shared, pooled database connections
`list_tables`	Schema discovery for Claude
`query`	Flexible SQL execution with safety checks
Connection pooling	Efficient resource usage
Query validation	Basic SQL injection protection
Result limiting	Memory safety

Limitations of This Basic Server

This server works, but has security limitations:

Issue	Risk	Solution
String-based validation	Can be bypassed	Proper parsing
No parameterized queries	SQL injection	Use `.bind()`
No authentication	Anyone can query	Add OAuth
No audit logging	No accountability	Log all queries
No column filtering	May expose PII	Allowlist columns

The next sections address these:

SQL Safety - Proper parameterized queries, defense in depth
Resources - Structured access patterns
Pagination - Handling large result sets

Production Security Note

The examples in Part 1 focus on MCP fundamentals and omit authentication for simplicity. In production deployments, you should:

Require OAuth authentication for all MCP requests

Pass access tokens through to backend data systems as the source of truth for permissions

Let the database enforce row-level security based on the authenticated user

See Part 5: Security for complete OAuth integration patterns with AWS Cognito, Auth0, and Microsoft Entra ID.

Continue to SQL Safety and Injection Prevention →

SQL Safety and Injection Prevention

SQL injection is consistently in the OWASP Top 10 vulnerabilities. When you build a database MCP server, you're creating an interface between AI-generated queries and your production data. Security isn't optional—it's essential.

Understanding SQL Injection

SQL injection occurs when untrusted input is concatenated into SQL queries:

#![allow(unused)]
fn main() {
// DANGEROUS: SQL Injection vulnerability
let query = format!(
    "SELECT * FROM users WHERE name = '{}'", 
    user_input  // What if user_input is: ' OR '1'='1
);
}

If user_input is ' OR '1'='1, the query becomes:

SELECT * FROM users WHERE name = '' OR '1'='1'

This returns ALL users, bypassing the intended filter.

Attack Examples

Attack	Payload	Result
Data exfiltration	`' UNION SELECT password FROM users--`	Leaks passwords
Bypass authentication	`' OR '1'='1`	Returns all rows
Delete data	`'; DROP TABLE users;--`	Destroys table
Read files	`' UNION SELECT load_extension('...`	System compromise

Defense Layer 1: Parameterized Queries

Always use parameterized queries for any user-controlled values:

#![allow(unused)]
fn main() {
// SAFE: Parameterized query
let users = sqlx::query_as::<_, User>(
    "SELECT * FROM users WHERE name = ?"
)
.bind(&user_input)  // Value is escaped/handled by the driver
.fetch_all(&pool)
.await?;
}

The database driver handles escaping—the user input can never become SQL code.

When to Use Parameters

#![allow(unused)]
fn main() {
// ✅ SAFE: Values as parameters
sqlx::query("SELECT * FROM users WHERE id = ?")
    .bind(user_id)

sqlx::query("SELECT * FROM orders WHERE date > ? AND status = ?")
    .bind(start_date)
    .bind(status)

// ❌ UNSAFE: String formatting
format!("SELECT * FROM users WHERE id = {}", user_id)
format!("SELECT * FROM {} WHERE id = ?", table_name)  // Table names can't be parameterized!
}

The Table Name Problem

You cannot parameterize table or column names:

#![allow(unused)]
fn main() {
// This WON'T work - table names can't be parameters
sqlx::query("SELECT * FROM ? WHERE id = ?")
    .bind(table_name)  // Error! 
    .bind(id)
}

For dynamic table/column names, use allowlisting (see Layer 2).

Defense Layer 2: Allowlisting

When you can't use parameters (table names, column names, ORDER BY), use strict allowlists:

#![allow(unused)]
fn main() {
/// Tables that users are allowed to query
const ALLOWED_TABLES: &[&str] = &[
    "customers",
    "orders", 
    "products",
    "invoices",
];

/// Validate a table name against the allowlist
fn validate_table(table: &str) -> Result<&str> {
    let table_lower = table.to_lowercase();
    
    ALLOWED_TABLES
        .iter()
        .find(|&&t| t == table_lower)
        .map(|&t| t)
        .ok_or_else(|| anyhow!("Table '{}' is not accessible", table))
}

// Usage
let table = validate_table(&input.table)?;
let query = format!("SELECT * FROM {} WHERE id = ?", table);
}

Column Name Allowlisting

#![allow(unused)]
fn main() {
fn validate_order_column(table: &str, column: &str) -> Result<&'static str> {
    let allowed = match table {
        "customers" => &["id", "name", "email", "created_at"][..],
        "orders" => &["id", "customer_id", "total", "order_date"][..],
        "products" => &["id", "name", "price", "category"][..],
        _ => return Err(anyhow!("Unknown table")),
    };
    
    allowed
        .iter()
        .find(|&&c| c == column.to_lowercase())
        .copied()
        .ok_or_else(|| anyhow!("Cannot sort by '{}'", column))
}

// Usage in ORDER BY
let order_col = validate_order_column("customers", &input.sort_by)?;
let query = format!(
    "SELECT * FROM customers ORDER BY {} {}",
    order_col,
    if input.ascending { "ASC" } else { "DESC" }
);
}

Defense Layer 3: Query Validation

For MCP servers that accept raw SQL (like our query tool), validate the query structure:

#![allow(unused)]
fn main() {
/// Validate that a query is safe to execute
fn validate_query(sql: &str) -> Result<()> {
    let sql_upper = sql.trim().to_uppercase();
    
    // Must start with SELECT
    if !sql_upper.starts_with("SELECT") {
        return Err(anyhow!("Only SELECT queries are allowed"));
    }
    
    // Block dangerous keywords
    let blocked = [
        "INSERT", "UPDATE", "DELETE", "DROP", "CREATE", "ALTER",
        "TRUNCATE", "EXEC", "EXECUTE", "GRANT", "REVOKE",
        "INTO OUTFILE", "INTO DUMPFILE", "LOAD_FILE",
    ];
    
    for keyword in blocked {
        if sql_upper.contains(keyword) {
            return Err(anyhow!("'{}' is not allowed in queries", keyword));
        }
    }
    
    // Block multiple statements
    if sql.contains(';') {
        let parts: Vec<_> = sql.split(';').filter(|s| !s.trim().is_empty()).collect();
        if parts.len() > 1 {
            return Err(anyhow!("Multiple statements are not allowed"));
        }
    }
    
    // Block comments (often used in injection attacks)
    if sql.contains("--") || sql.contains("/*") {
        return Err(anyhow!("SQL comments are not allowed"));
    }
    
    Ok(())
}
}

Limitations of Query Validation

Query validation is a defense in depth measure, not a primary defense:

#![allow(unused)]
fn main() {
// These attacks might bypass simple validation:

// Unicode tricks
"SELECT * FROM users WHERE name = 'admin'--" // Normal
"SELECT * FROM users WHERE name = 'admin'－－" // Unicode dash

// Case variations
"sElEcT * fRoM users" // Mixed case

// Encoded characters
"SELECT%20*%20FROM%20users" // URL encoded

// Comments
"SELECT/**/*/**/FROM/**/users" // Block comments
}

Never rely on query validation alone. Use it alongside:

Database user with minimal privileges
Row limits
Query timeouts
Audit logging

Defense Layer 4: Database Permissions

The MCP server's database user should have minimal privileges:

-- Create a read-only user for the MCP server
CREATE USER 'mcp_reader'@'localhost' IDENTIFIED BY 'secure_password';

-- Grant only SELECT on specific tables
GRANT SELECT ON mydb.customers TO 'mcp_reader'@'localhost';
GRANT SELECT ON mydb.orders TO 'mcp_reader'@'localhost';
GRANT SELECT ON mydb.products TO 'mcp_reader'@'localhost';

-- Explicitly deny dangerous operations
-- (Usually not needed if you only GRANT SELECT, but good practice)
REVOKE ALL PRIVILEGES ON mydb.* FROM 'mcp_reader'@'localhost';
GRANT SELECT ON mydb.customers, mydb.orders, mydb.products TO 'mcp_reader'@'localhost';

For SQLite, use a read-only connection:

#![allow(unused)]
fn main() {
let pool = SqlitePoolOptions::new()
    .connect("sqlite:./data.db?mode=ro")  // Read-only mode
    .await?;
}

Defense Layer 5: Query Timeouts

Prevent denial-of-service via expensive queries:

#![allow(unused)]
fn main() {
use tokio::time::{timeout, Duration};

async fn execute_with_timeout(
    pool: &DbPool,
    query: &str,
    max_duration: Duration,
) -> Result<Vec<SqliteRow>> {
    timeout(max_duration, async {
        sqlx::query(query)
            .fetch_all(pool.as_ref())
            .await
    })
    .await
    .map_err(|_| anyhow!("Query timed out after {:?}", max_duration))?
    .map_err(|e| anyhow!("Query failed: {}", e))
}

// Usage
let rows = execute_with_timeout(
    &pool, 
    &query, 
    Duration::from_secs(30)
).await?;
}

Defense Layer 6: Result Limits

Always limit result sizes to prevent memory exhaustion:

#![allow(unused)]
fn main() {
const MAX_ROWS: i32 = 10_000;
const DEFAULT_ROWS: i32 = 100;

fn apply_limit(query: &str, requested_limit: Option<i32>) -> String {
    let limit = requested_limit
        .unwrap_or(DEFAULT_ROWS)
        .min(MAX_ROWS);
    
    let query_upper = query.to_uppercase();
    
    if query_upper.contains("LIMIT") {
        // Already has LIMIT - don't add another
        // But we should validate the existing limit isn't too high
        query.to_string()
    } else {
        format!("{} LIMIT {}", query.trim_end_matches(';'), limit)
    }
}
}

Defense Layer 7: Audit Logging

Log all queries for security monitoring:

#![allow(unused)]
fn main() {
use tracing::{info, warn};

async fn execute_query(
    pool: &DbPool,
    query: &str,
    user_id: &str,
) -> Result<QueryOutput> {
    let start = std::time::Instant::now();
    
    // Log the query attempt
    info!(
        user_id = %user_id,
        query_preview = %query.chars().take(100).collect::<String>(),
        "Query execution started"
    );
    
    let result = sqlx::query(query)
        .fetch_all(pool.as_ref())
        .await;
    
    let duration = start.elapsed();
    
    match &result {
        Ok(rows) => {
            info!(
                user_id = %user_id,
                row_count = rows.len(),
                duration_ms = duration.as_millis(),
                "Query completed successfully"
            );
        }
        Err(e) => {
            warn!(
                user_id = %user_id,
                error = %e,
                duration_ms = duration.as_millis(),
                "Query failed"
            );
        }
    }
    
    // Convert result...
    Ok(result?)
}
}

Complete Secure Query Implementation

Here's a production-ready query tool with all defenses:

#![allow(unused)]
fn main() {
use anyhow::{Result, anyhow};
use tokio::time::{timeout, Duration};
use tracing::{info, warn};

const MAX_ROWS: i32 = 10_000;
const DEFAULT_ROWS: i32 = 100;
const QUERY_TIMEOUT: Duration = Duration::from_secs(30);

const BLOCKED_KEYWORDS: &[&str] = &[
    "INSERT", "UPDATE", "DELETE", "DROP", "CREATE", "ALTER",
    "TRUNCATE", "EXEC", "EXECUTE", "GRANT", "REVOKE",
    "INTO OUTFILE", "INTO DUMPFILE", "LOAD_FILE",
];

pub async fn secure_query(
    pool: &DbPool,
    input: QueryInput,
    user_context: &UserContext,
) -> Result<QueryOutput> {
    // Layer 3: Query validation
    validate_query(&input.query)?;
    
    // Layer 6: Apply row limit
    let limit = input.limit.unwrap_or(DEFAULT_ROWS).min(MAX_ROWS);
    let limited_query = apply_limit(&input.query, limit);
    
    // Layer 7: Audit logging
    info!(
        user_id = %user_context.user_id,
        query = %limited_query,
        "Executing query"
    );
    
    // Layer 5: Timeout
    let result = timeout(QUERY_TIMEOUT, async {
        sqlx::query(&limited_query)
            .fetch_all(pool.as_ref())
            .await
    })
    .await
    .map_err(|_| anyhow!("Query timed out"))?
    .map_err(|e| anyhow!("Query failed: {}", e))?;
    
    // Check truncation
    let truncated = result.len() > limit as usize;
    let rows: Vec<_> = result.into_iter().take(limit as usize).collect();
    
    info!(
        user_id = %user_context.user_id,
        row_count = rows.len(),
        truncated = truncated,
        "Query completed"
    );
    
    Ok(format_output(rows, truncated))
}

fn validate_query(sql: &str) -> Result<()> {
    let sql_upper = sql.trim().to_uppercase();
    
    if !sql_upper.starts_with("SELECT") {
        return Err(anyhow!("Only SELECT queries are allowed"));
    }
    
    for keyword in BLOCKED_KEYWORDS {
        if sql_upper.contains(keyword) {
            return Err(anyhow!("'{}' is not allowed", keyword));
        }
    }
    
    if sql.matches(';').count() > 1 {
        return Err(anyhow!("Multiple statements not allowed"));
    }
    
    Ok(())
}

fn apply_limit(query: &str, limit: i32) -> String {
    if query.to_uppercase().contains("LIMIT") {
        query.to_string()
    } else {
        format!("{} LIMIT {}", query.trim_end_matches(';'), limit + 1)
    }
}
}

User Context and Token Pass-Through

The user_context parameter in the examples above is more than just a logging convenience—in production, it represents the authenticated user and should flow through to your backend systems.

Where Does UserContext Come From?

In production, UserContext is extracted from the OAuth access token in the MCP request:

#![allow(unused)]
fn main() {
/// User context extracted from OAuth access token
pub struct UserContext {
    /// User ID from the identity provider
    pub user_id: String,

    /// The raw access token - pass this to backend systems
    pub access_token: String,

    /// User's roles/groups from token claims
    pub roles: Vec<String>,
}

impl UserContext {
    /// Extract from MCP request metadata (simplified)
    pub fn from_request(extra: &RequestExtra) -> Result<Self> {
        let token = extra.headers
            .get("authorization")
            .and_then(|h| h.strip_prefix("Bearer "))
            .ok_or_else(|| anyhow!("Missing authorization header"))?;

        // Validate token and extract claims
        let claims = validate_jwt(token)?;

        Ok(Self {
            user_id: claims.sub,
            access_token: token.to_string(),
            roles: claims.groups,
        })
    }
}
}

Pass Tokens to Backend Systems

The MCP server should not be the source of truth for permissions. Pass the user's access token to your backend data systems and let them enforce authorization:

#![allow(unused)]
fn main() {
pub async fn secure_query_with_passthrough(
    pool: &DbPool,
    input: QueryInput,
    user_context: &UserContext,
) -> Result<QueryOutput> {
    // For databases that support session context (PostgreSQL, Oracle):
    // Pass the user identity so row-level security policies apply
    sqlx::query("SELECT set_config('app.current_user', $1, true)")
        .bind(&user_context.user_id)
        .execute(pool.as_ref())
        .await?;

    // Now queries are filtered by database RLS policies
    let result = sqlx::query(&input.query)
        .fetch_all(pool.as_ref())
        .await?;

    // ...
}
}

For external APIs, pass the token in the request:

#![allow(unused)]
fn main() {
pub async fn call_backend_api(
    client: &reqwest::Client,
    user_context: &UserContext,
    endpoint: &str,
) -> Result<serde_json::Value> {
    // Pass the user's token - let the backend validate permissions
    let response = client.get(endpoint)
        .header("Authorization", format!("Bearer {}", user_context.access_token))
        .send()
        .await?;

    // Backend enforces what this user can access
    Ok(response.json().await?)
}
}

Learn More: See Part 5: Security for complete OAuth integration patterns, including extracting tokens from MCP requests and configuring row-level security in PostgreSQL.

Security Checklist

Before deploying your database MCP server:

Layer	Check	Status
Authentication	OAuth required for all requests	☐
Token Pass-Through	Access tokens passed to backend systems	☐
Parameterization	All user values use `.bind()`	☐
Allowlisting	Table/column names validated against lists	☐
Query Validation	Dangerous keywords blocked	☐
Permissions	Database user has SELECT only	☐
Timeouts	Queries timeout after reasonable duration	☐
Limits	Result size is bounded	☐
Logging	All queries are logged with user context	☐
Sensitive Data	PII/secrets columns are filtered	☐

Common Mistakes to Avoid

❌ Blocklisting Instead of Allowlisting

#![allow(unused)]
fn main() {
// BAD: Trying to block known bad things
if !input.contains("DROP") && !input.contains("DELETE") {
    // Still vulnerable to: DrOp, DEL/**/ETE, etc.
}

// GOOD: Only allow known good things
if ALLOWED_TABLES.contains(&table) {
    // Secure - we control the list
}
}

❌ Trusting Client-Side Validation

#![allow(unused)]
fn main() {
// BAD: Assuming the schema validation caught everything
// JsonSchema regex can be bypassed by determined attackers
#[schemars(regex(pattern = r"^SELECT"))]
query: String,  // Don't rely on this alone!

// GOOD: Always validate server-side
fn validate_query(sql: &str) -> Result<()> {
    // Server-side validation that can't be bypassed
}
}

❌ Logging Sensitive Data

#![allow(unused)]
fn main() {
// BAD: Logging full query might expose sensitive filters
info!("Query: {}", query);  // Might contain: WHERE ssn = '123-45-6789'

// GOOD: Log query structure, not values
info!(
    query_type = "SELECT",
    tables = ?extract_tables(&query),
    "Query executed"
);
}

Continue to Resource-Based Data Patterns →

Resource-Based Data Patterns

MCP offers two ways to expose data: tools and resources. Understanding when to use each is key to building intuitive database servers.

Tools vs Resources: When to Use Each

Aspect	Tools	Resources
Nature	Actions, operations, queries	Documentation, reference data, metadata
Data	Dynamic, user-specific	Static or slowly-changing
Parameters	Flexible input parameters	URI-based, limited parameters
Use case	"Do something"	"Read about something"
Caching	Usually not cached	Often cached

Use Resources For:

Database schema documentation - Table structures, column descriptions
Reference data - Country codes, status enums, category lists
Configuration - Database settings, connection info
Metadata - Relationships, indexes, constraints
Help/documentation - Query examples, usage guides

Use Tools For:

Data queries - SELECT with filters, joins, aggregations
Entity lookups - Finding customers, orders, products
Search - Full-text search, fuzzy matching
Analytics - Aggregations, reports, dashboards

Why Not `db://customers/12345`?

You might think resources are good for entity lookups like db://customers/12345. But consider:

Resource approach:
  Claude: "I need customer 12345"
  → Read db://customers/12345
  → Returns one customer
  → Claude: "Now I need their orders"
  → Read db://customers/12345/orders
  → Returns orders
  → Claude: "What's their total spend?"
  → ??? No resource for aggregations

Tool approach:
  Claude: "I need customer 12345 with their order history and total spend"
  → query("SELECT c.*, SUM(o.total) as total_spend 
           FROM customers c 
           JOIN orders o ON c.id = o.customer_id 
           WHERE c.id = 12345
           GROUP BY c.id")
  → Returns everything in one call

Tools are more flexible for data access. Resources shine for metadata and documentation.

Practical Resource Examples

Example 1: Database Schema Resource

Expose the database schema as a readable resource that Claude can reference:

#![allow(unused)]
fn main() {
use pmcp::resource::{Resource, ResourceContent, ResourceInfo};

/// Database schema documentation as a resource
pub struct SchemaResource {
    pool: DbPool,
}

impl Resource for SchemaResource {
    fn info(&self) -> ResourceInfo {
        ResourceInfo {
            uri: "db://schema".to_string(),
            name: "Database Schema".to_string(),
            description: Some(
                "Complete database schema with tables, columns, types, and relationships. \
                 Use this to understand the database structure before writing queries."
                    .to_string()
            ),
            mime_type: Some("application/json".to_string()),
        }
    }

    async fn read(&self, _uri: &str) -> Result<ResourceContent> {
        let schema = self.build_schema_documentation().await?;
        Ok(ResourceContent::json(&schema)?)
    }
}

#[derive(Serialize)]
struct SchemaDocumentation {
    database_name: String,
    tables: Vec<TableDocumentation>,
    relationships: Vec<Relationship>,
    notes: Vec<String>,
}

#[derive(Serialize)]
struct TableDocumentation {
    name: String,
    description: String,
    columns: Vec<ColumnDocumentation>,
    primary_key: Vec<String>,
    row_count: i64,
    example_query: String,
}

#[derive(Serialize)]
struct ColumnDocumentation {
    name: String,
    data_type: String,
    nullable: bool,
    description: String,  // Can be populated from comments or a separate config
}

#[derive(Serialize)]
struct Relationship {
    from_table: String,
    from_column: String,
    to_table: String,
    to_column: String,
    relationship_type: String,  // "one-to-many", "many-to-many", etc.
}

impl SchemaResource {
    async fn build_schema_documentation(&self) -> Result<SchemaDocumentation> {
        let tables = self.get_all_tables().await?;
        let relationships = self.get_foreign_keys().await?;
        
        Ok(SchemaDocumentation {
            database_name: "Chinook Music Store".to_string(),
            tables,
            relationships,
            notes: vec![
                "All timestamps are in UTC".to_string(),
                "Monetary values are in USD".to_string(),
                "Use JOINs on foreign key relationships for related data".to_string(),
            ],
        })
    }

    async fn get_foreign_keys(&self) -> Result<Vec<Relationship>> {
        // Query SQLite's foreign key info
        let mut relationships = Vec::new();
        
        let tables: Vec<(String,)> = sqlx::query_as(
            "SELECT name FROM sqlite_master WHERE type='table'"
        )
        .fetch_all(self.pool.as_ref())
        .await?;

        for (table,) in tables {
            let fks = sqlx::query(&format!("PRAGMA foreign_key_list({})", table))
                .fetch_all(self.pool.as_ref())
                .await?;
            
            for fk in fks {
                relationships.push(Relationship {
                    from_table: table.clone(),
                    from_column: fk.get("from"),
                    to_table: fk.get("table"),
                    to_column: fk.get("to"),
                    relationship_type: "many-to-one".to_string(),
                });
            }
        }
        
        Ok(relationships)
    }
}
}

How Claude uses this:

User: "What tables are related to customers?"

Claude: [Reads db://schema resource]
        
Based on the schema, the customers table is related to:
- invoices (customers.CustomerId → invoices.CustomerId) - one-to-many
- Each customer can have multiple invoices

The invoices table connects to:
- invoice_items (invoices.InvoiceId → invoice_items.InvoiceId)
- Which connects to tracks for the actual purchased items

Example 2: Table-Specific Schema Resource

Provide detailed documentation for each table:

#![allow(unused)]
fn main() {
/// Individual table documentation
pub struct TableSchemaResource {
    pool: DbPool,
}

impl Resource for TableSchemaResource {
    fn info(&self) -> ResourceInfo {
        ResourceInfo {
            uri_template: "db://schema/{table_name}".to_string(),
            name: "Table Schema".to_string(),
            description: Some(
                "Detailed schema for a specific table including columns, \
                 types, constraints, and example queries.".to_string()
            ),
            mime_type: Some("application/json".to_string()),
        }
    }

    async fn read(&self, uri: &str) -> Result<ResourceContent> {
        let table_name = uri.strip_prefix("db://schema/")
            .ok_or_else(|| anyhow!("Invalid URI"))?;
        
        // Validate table exists
        let valid_tables = self.get_table_names().await?;
        if !valid_tables.contains(&table_name.to_string()) {
            return Err(anyhow!("Table '{}' not found", table_name));
        }
        
        let doc = self.build_table_documentation(table_name).await?;
        Ok(ResourceContent::json(&doc)?)
    }
}

impl TableSchemaResource {
    async fn build_table_documentation(&self, table: &str) -> Result<TableDocumentation> {
        let columns = self.get_columns(table).await?;
        let pk = self.get_primary_key(table).await?;
        let row_count = self.get_row_count(table).await?;
        
        Ok(TableDocumentation {
            name: table.to_string(),
            description: self.get_table_description(table),
            columns,
            primary_key: pk,
            row_count,
            example_query: format!(
                "SELECT * FROM {} LIMIT 10", 
                table
            ),
        })
    }
    
    fn get_table_description(&self, table: &str) -> String {
        // In production, this might come from a config file or database comments
        match table {
            "customers" => "Customer information including contact details and location",
            "invoices" => "Sales transactions with date, customer, and billing info",
            "invoice_items" => "Line items for each invoice, linking to tracks",
            "tracks" => "Music tracks with duration, genre, and pricing",
            "albums" => "Music albums with artist reference",
            "artists" => "Music artists/bands",
            "genres" => "Music genre categories",
            "playlists" => "User-created playlists",
            "employees" => "Company employees with reporting structure",
            _ => "No description available",
        }.to_string()
    }
}
}

Example 3: Reference Data Resources

Static lookup tables work well as resources:

#![allow(unused)]
fn main() {
/// Reference data: All available genres
pub struct GenresResource {
    pool: DbPool,
}

impl Resource for GenresResource {
    fn info(&self) -> ResourceInfo {
        ResourceInfo {
            uri: "db://reference/genres".to_string(),
            name: "Music Genres".to_string(),
            description: Some(
                "List of all music genres in the database. \
                 Use these values when filtering tracks by genre.".to_string()
            ),
            mime_type: Some("application/json".to_string()),
        }
    }

    async fn read(&self, _uri: &str) -> Result<ResourceContent> {
        let genres: Vec<Genre> = sqlx::query_as(
            "SELECT GenreId, Name FROM genres ORDER BY Name"
        )
        .fetch_all(self.pool.as_ref())
        .await?;
        
        Ok(ResourceContent::json(&genres)?)
    }
}

/// Reference data: All media types
pub struct MediaTypesResource {
    pool: DbPool,
}

impl Resource for MediaTypesResource {
    fn info(&self) -> ResourceInfo {
        ResourceInfo {
            uri: "db://reference/media-types".to_string(),
            name: "Media Types".to_string(),
            description: Some(
                "Available media formats (MP3, AAC, etc.). \
                 Use when filtering or understanding track formats.".to_string()
            ),
            mime_type: Some("application/json".to_string()),
        }
    }

    async fn read(&self, _uri: &str) -> Result<ResourceContent> {
        let types: Vec<MediaType> = sqlx::query_as(
            "SELECT MediaTypeId, Name FROM media_types ORDER BY Name"
        )
        .fetch_all(self.pool.as_ref())
        .await?;
        
        Ok(ResourceContent::json(&types)?)
    }
}
}

Example 4: Query Examples Resource

Help Claude write better queries:

#![allow(unused)]
fn main() {
/// Example queries for common operations
pub struct QueryExamplesResource;

impl Resource for QueryExamplesResource {
    fn info(&self) -> ResourceInfo {
        ResourceInfo {
            uri: "db://help/query-examples".to_string(),
            name: "Query Examples".to_string(),
            description: Some(
                "Example SQL queries for common operations. \
                 Reference these patterns when writing queries.".to_string()
            ),
            mime_type: Some("application/json".to_string()),
        }
    }

    async fn read(&self, _uri: &str) -> Result<ResourceContent> {
        let examples = vec![
            QueryExample {
                name: "Customer with orders",
                description: "Get a customer and their order history",
                sql: r#"
                    SELECT c.FirstName, c.LastName, c.Email,
                           i.InvoiceId, i.InvoiceDate, i.Total
                    FROM customers c
                    JOIN invoices i ON c.CustomerId = i.CustomerId
                    WHERE c.CustomerId = ?
                    ORDER BY i.InvoiceDate DESC
                "#.to_string(),
            },
            QueryExample {
                name: "Top selling tracks",
                description: "Tracks ordered by number of sales",
                sql: r#"
                    SELECT t.Name as Track, ar.Name as Artist, 
                           COUNT(*) as TimesSold
                    FROM tracks t
                    JOIN invoice_items ii ON t.TrackId = ii.TrackId
                    JOIN albums al ON t.AlbumId = al.AlbumId
                    JOIN artists ar ON al.ArtistId = ar.ArtistId
                    GROUP BY t.TrackId
                    ORDER BY TimesSold DESC
                    LIMIT 10
                "#.to_string(),
            },
            QueryExample {
                name: "Revenue by country",
                description: "Total sales grouped by customer country",
                sql: r#"
                    SELECT c.Country, 
                           COUNT(DISTINCT c.CustomerId) as Customers,
                           SUM(i.Total) as Revenue
                    FROM customers c
                    JOIN invoices i ON c.CustomerId = i.CustomerId
                    GROUP BY c.Country
                    ORDER BY Revenue DESC
                "#.to_string(),
            },
            QueryExample {
                name: "Genre popularity",
                description: "Number of tracks per genre",
                sql: r#"
                    SELECT g.Name as Genre, COUNT(*) as TrackCount
                    FROM genres g
                    JOIN tracks t ON g.GenreId = t.GenreId
                    GROUP BY g.GenreId
                    ORDER BY TrackCount DESC
                "#.to_string(),
            },
        ];
        
        Ok(ResourceContent::json(&examples)?)
    }
}

#[derive(Serialize)]
struct QueryExample {
    name: &'static str,
    description: &'static str,
    sql: String,
}
}

Example 5: Loading Resources from Files

Not all documentation comes from developers. DBAs, data analysts, and domain experts often maintain documentation in markdown or text files. Loading resources from the filesystem lets non-developers contribute without touching Rust code.

Directory structure:

db-explorer/
├── src/
│   └── main.rs
├── docs/                          # Maintained by DBAs/analysts
│   ├── database-guide.md
│   ├── tables/
│   │   ├── customers.md
│   │   ├── invoices.md
│   │   └── tracks.md
│   └── query-patterns.md
└── Cargo.toml

Example markdown file (docs/tables/customers.md):

# Customers Table

The customers table stores contact information for all registered customers.

## Columns

| Column | Type | Description |
|--------|------|-------------|
| CustomerId | INTEGER | Primary key, auto-increment |
| FirstName | TEXT | Customer's first name (required) |
| LastName | TEXT | Customer's last name (required) |
| Email | TEXT | Unique email address (required) |
| Company | TEXT | Company name (optional) |
| Phone | TEXT | Contact phone number |
| Country | TEXT | Billing country |

## Common Queries

Find customers by country:
```sql
SELECT * FROM customers WHERE Country = 'USA' ORDER BY LastName;

Find customers with their total spend:

SELECT c.FirstName, c.LastName, SUM(i.Total) as TotalSpend
FROM customers c
JOIN invoices i ON c.CustomerId = i.CustomerId
GROUP BY c.CustomerId
ORDER BY TotalSpend DESC;

Business Rules

Email must be unique across all customers
All monetary values are stored in USD
Customer deletion is soft-delete only (sets DeletedAt timestamp)


**Loading markdown files as resources:**

```rust
use std::path::{Path, PathBuf};
use tokio::fs;

/// Documentation loaded from markdown files
pub struct FileDocumentationResource {
    docs_dir: PathBuf,
}

impl FileDocumentationResource {
    pub fn new(docs_dir: impl AsRef<Path>) -> Self {
        Self {
            docs_dir: docs_dir.as_ref().to_path_buf(),
        }
    }
}

impl Resource for FileDocumentationResource {
    fn info(&self) -> ResourceInfo {
        ResourceInfo {
            uri: "db://docs/tables/{table_name}".to_string(),
            name: "Table Documentation".to_string(),
            description: Some(
                "Human-written documentation for database tables. \
                 Includes column descriptions, business rules, and example queries. \
                 Maintained by DBAs and data analysts.".to_string()
            ),
            mime_type: Some("text/markdown".to_string()),
        }
    }

    async fn read(&self, uri: &str) -> Result<ResourceContent> {
        let table_name = uri.strip_prefix("db://docs/tables/")
            .ok_or_else(|| anyhow!("Invalid URI format"))?;
        
        // Prevent path traversal attacks
        if table_name.contains("..") || table_name.contains('/') {
            return Err(anyhow!("Invalid table name"));
        }
        
        let file_path = self.docs_dir
            .join("tables")
            .join(format!("{}.md", table_name));
        
        // Check file exists within docs directory
        let canonical = file_path.canonicalize()
            .map_err(|_| anyhow!("Documentation not found for table '{}'", table_name))?;
        
        if !canonical.starts_with(self.docs_dir.canonicalize()?) {
            return Err(anyhow!("Invalid path"));
        }
        
        let content = fs::read_to_string(&file_path).await
            .map_err(|_| anyhow!("Documentation not found for table '{}'", table_name))?;
        
        Ok(ResourceContent::text(content))
    }
}

/// Database guide - single file resource
pub struct DatabaseGuideResource {
    docs_dir: PathBuf,
}

impl Resource for DatabaseGuideResource {
    fn info(&self) -> ResourceInfo {
        ResourceInfo {
            uri: "db://docs/guide".to_string(),
            name: "Database Guide".to_string(),
            description: Some(
                "Comprehensive database guide written by the DBA team. \
                 Includes naming conventions, relationships, and best practices.".to_string()
            ),
            mime_type: Some("text/markdown".to_string()),
        }
    }

    async fn read(&self, _uri: &str) -> Result<ResourceContent> {
        let file_path = self.docs_dir.join("database-guide.md");
        let content = fs::read_to_string(&file_path).await
            .map_err(|_| anyhow!("Database guide not found"))?;
        
        Ok(ResourceContent::text(content))
    }
}

Listing available documentation files:

#![allow(unused)]
fn main() {
/// List all available table documentation
pub struct TableDocsListResource {
    docs_dir: PathBuf,
}

impl Resource for TableDocsListResource {
    fn info(&self) -> ResourceInfo {
        ResourceInfo {
            uri: "db://docs/tables".to_string(),
            name: "Available Table Documentation".to_string(),
            description: Some(
                "Lists all tables that have documentation available.".to_string()
            ),
            mime_type: Some("application/json".to_string()),
        }
    }

    async fn read(&self, _uri: &str) -> Result<ResourceContent> {
        let tables_dir = self.docs_dir.join("tables");
        let mut entries = fs::read_dir(&tables_dir).await?;
        
        let mut tables = Vec::new();
        while let Some(entry) = entries.next_entry().await? {
            let path = entry.path();
            if path.extension().map_or(false, |ext| ext == "md") {
                if let Some(stem) = path.file_stem() {
                    tables.push(stem.to_string_lossy().to_string());
                }
            }
        }
        
        tables.sort();
        
        Ok(ResourceContent::json(&serde_json::json!({
            "tables": tables,
            "note": "Use db://docs/tables/{name} to read specific documentation"
        }))?)
    }
}
}

Why file-based resources?

Approach	Best For
Rust code (hardcoded)	Static strings, compile-time constants
Database queries	Dynamic data, schema introspection
File system	Human-maintained docs, external contributions

Benefits of file-based documentation:

Non-developer contributions - DBAs edit markdown, not Rust
Version control - Documentation changes tracked in git
No recompilation - Update docs without rebuilding
Rich formatting - Markdown supports tables, code blocks, links
External tools - Documentation can be generated by other tools

Hot reloading pattern:

For development, you might want to reload documentation without restarting:

#![allow(unused)]
fn main() {
impl Resource for FileDocumentationResource {
    fn cache_hint(&self) -> Option<Duration> {
        // In development: no caching, always fresh
        #[cfg(debug_assertions)]
        return None;
        
        // In production: cache for 5 minutes
        #[cfg(not(debug_assertions))]
        return Some(Duration::from_secs(300));
    }
}
}

Registering Resources

Add resources alongside your tools:

#![allow(unused)]
fn main() {
let docs_dir = PathBuf::from("./docs");

let server = ServerBuilder::new("db-explorer", "1.0.0")
    .capabilities(ServerCapabilities {
        tools: Some(ToolCapabilities::default()),
        resources: Some(ResourceCapabilities::default()),
        ..Default::default()
    })
    // Tools for dynamic queries
    .tool(ListTables::new(pool.clone()).into_tool())
    .tool(Query::new(pool.clone()).into_tool())
    // Resources from database introspection
    .resource(SchemaResource::new(pool.clone()))
    .resource(TableSchemaResource::new(pool.clone()))
    .resource(GenresResource::new(pool.clone()))
    .resource(MediaTypesResource::new(pool.clone()))
    // Resources from code
    .resource(QueryExamplesResource)
    // Resources from filesystem (maintained by DBAs)
    .resource(DatabaseGuideResource::new(docs_dir.clone()))
    .resource(TableDocsListResource::new(docs_dir.clone()))
    .resource(FileDocumentationResource::new(docs_dir))
    .build()?;
}

How Claude Uses Resources

When Claude connects to your server, it discovers available resources:

Available Resources:
- db://schema - Complete database schema
- db://schema/{table_name} - Schema for specific table
- db://reference/genres - Music genre list
- db://reference/media-types - Media format list
- db://help/query-examples - Example SQL queries
- db://docs/guide - Database guide (from file)
- db://docs/tables - List of documented tables
- db://docs/tables/{table_name} - Table documentation (from file)

Claude's workflow:

User: "What genres of music are in the database?"

Claude thinking:
  - This is asking about reference data
  - I can read db://reference/genres
  - No need to write a query

Claude: [Reads db://reference/genres]
        
The database contains 25 music genres:
Alternative, Blues, Classical, Comedy, Country...

User: "Show me the top 5 rock artists by sales"

Claude thinking:
  - I need to write a query
  - Let me check db://schema for table structure
  - And db://help/query-examples for patterns

Claude: [Reads db://schema]
        [Reads db://help/query-examples]
        [Uses query tool with adapted SQL]

Benefits of This Pattern

1. Better AI Understanding

Resources give Claude context without requiring queries:

Without resources:
  Claude must guess table/column names or call list_tables first

With resources:
  Claude reads schema once, understands the entire database

2. Reduced Tool Calls

Without resources:
  1. list_tables() - What tables exist?
  2. query("PRAGMA table_info(customers)") - What columns?
  3. query("PRAGMA foreign_key_list(customers)") - Relationships?
  4. query("SELECT...") - Finally, the actual query

With resources:
  1. Read db://schema - Understand everything
  2. query("SELECT...") - Execute the query

3. Cacheable Documentation

Resources can be cached since they change infrequently:

#![allow(unused)]
fn main() {
impl Resource for SchemaResource {
    fn cache_hint(&self) -> Option<Duration> {
        Some(Duration::from_secs(300))  // Cache for 5 minutes
    }
}
}

4. Clear Separation of Concerns

Resource	Purpose
`db://schema`	Understand the database
`db://reference/*`	Lookup valid values
`db://help/*`	Learn query patterns

Tool	Purpose
`query`	Execute any SELECT
`list_tables`	Quick table overview

Summary

When to Use Each Approach

Data Type	Approach	Example
Table structures	Resource	`db://schema`
Column descriptions	Resource	`db://schema/customers`
Lookup tables (genres, countries)	Resource	`db://reference/genres`
Foreign key relationships	Resource	Part of `db://schema`
Query patterns/examples	Resource	`db://help/query-examples`
Human-written docs	Resource	`db://docs/tables/customers`
Entity data (customers, orders)	Tool	`query("SELECT...")`
Aggregations (totals, counts)	Tool	`query("SELECT SUM...")`
Search/filtering	Tool	`query("SELECT...WHERE...")`

Three Ways to Populate Resources

Source	Best For	Example
Database queries	Dynamic schema, reference tables	`db://schema`, `db://reference/genres`
Rust code	Static content, computed examples	`db://help/query-examples`
Filesystem	Human-maintained docs, external tools	`db://docs/tables/{name}`

Resources = Documentation. Tools = Operations.

Continue to Handling Large Results →

Handling Large Results

Enterprise databases contain millions of rows. When Claude asks "Show me all customers," you can't return everything at once. This section covers patterns for handling large result sets safely and efficiently.

The Problem with Large Results

Returning too much data causes multiple problems:

Problem	Impact
Memory exhaustion	Server crashes with OOM
Slow responses	Users wait forever
Context overflow	AI can't process millions of rows
Network costs	Unnecessary data transfer
Poor UX	Information overload

Pagination Strategies

Strategy 1: Offset Pagination (Simple but Limited)

SELECT * FROM customers ORDER BY id LIMIT 100 OFFSET 0    -- Page 1
SELECT * FROM customers ORDER BY id LIMIT 100 OFFSET 100  -- Page 2
SELECT * FROM customers ORDER BY id LIMIT 100 OFFSET 200  -- Page 3

Implementation:

#![allow(unused)]
fn main() {
#[derive(Debug, Deserialize, JsonSchema)]
pub struct OffsetPaginatedInput {
    pub query: String,
    
    #[serde(default = "default_page")]
    pub page: i32,
    
    #[serde(default = "default_page_size")]
    pub page_size: i32,
}

fn default_page() -> i32 { 0 }
fn default_page_size() -> i32 { 50 }

#[derive(Debug, Serialize, JsonSchema)]
pub struct OffsetPaginatedOutput {
    pub rows: Vec<Vec<serde_json::Value>>,
    pub columns: Vec<String>,
    pub page: i32,
    pub page_size: i32,
    pub has_more: bool,
}

async fn paginated_query(pool: &DbPool, input: OffsetPaginatedInput) -> Result<OffsetPaginatedOutput> {
    let page_size = input.page_size.min(100);  // Cap at 100
    let offset = input.page * page_size;
    
    // Fetch one extra to detect if there are more
    let query = format!(
        "{} LIMIT {} OFFSET {}",
        input.query.trim_end_matches(';'),
        page_size + 1,
        offset
    );
    
    let rows = execute_query(pool, &query).await?;
    let has_more = rows.len() > page_size as usize;
    let rows: Vec<_> = rows.into_iter().take(page_size as usize).collect();
    
    Ok(OffsetPaginatedOutput {
        rows,
        columns: vec![],  // Extract from first row
        page: input.page,
        page_size,
        has_more,
    })
}
}

Problems with offset pagination:

Page 1:     OFFSET 0    → Scans 100 rows      ✓ Fast
Page 100:   OFFSET 9900 → Scans 10,000 rows   ⚠ Slow
Page 10000: OFFSET 999900 → Scans 1M rows    ✗ Very slow

The database must skip all offset rows before returning results. This gets slower as you paginate deeper.

Strategy 2: Cursor Pagination (Recommended)

Cursor pagination uses the last seen value to fetch the next page:

-- First page
SELECT * FROM customers ORDER BY id LIMIT 100

-- Next page (where 12345 was the last ID)
SELECT * FROM customers WHERE id > 12345 ORDER BY id LIMIT 100

This is O(1) regardless of how deep you paginate—the database uses an index seek, not a scan.

Implementation:

#![allow(unused)]
fn main() {
use base64::{Engine as _, engine::general_purpose::STANDARD as BASE64};

/// Opaque cursor containing pagination state
#[derive(Debug, Serialize, Deserialize)]
struct Cursor {
    /// The last seen ID
    last_id: i64,
    /// Table name (for validation)
    table: String,
    /// Sort column
    sort_column: String,
    /// Sort direction
    ascending: bool,
}

impl Cursor {
    /// Encode cursor to opaque string
    fn encode(&self) -> String {
        let json = serde_json::to_string(self).unwrap();
        BASE64.encode(json.as_bytes())
    }
    
    /// Decode cursor from opaque string
    fn decode(encoded: &str) -> Result<Self> {
        let bytes = BASE64.decode(encoded)
            .map_err(|_| anyhow!("Invalid cursor"))?;
        let json = String::from_utf8(bytes)
            .map_err(|_| anyhow!("Invalid cursor encoding"))?;
        serde_json::from_str(&json)
            .map_err(|_| anyhow!("Invalid cursor format"))
    }
}

#[derive(Debug, Deserialize, JsonSchema)]
pub struct CursorPaginatedInput {
    /// Table to query
    pub table: String,
    
    /// Number of results per page (max 100)
    #[serde(default = "default_page_size")]
    pub page_size: i32,
    
    /// Cursor from previous response (omit for first page)
    pub cursor: Option<String>,
}

#[derive(Debug, Serialize, JsonSchema)]
pub struct CursorPaginatedOutput {
    pub rows: Vec<serde_json::Value>,
    pub columns: Vec<String>,
    pub count: usize,
    
    /// Cursor to fetch next page (null if no more data)
    pub next_cursor: Option<String>,
    
    /// Human-readable pagination status
    pub status: String,
}

const ALLOWED_TABLES: &[&str] = &["customers", "orders", "products", "invoices"];

async fn cursor_paginated_query(
    pool: &DbPool,
    input: CursorPaginatedInput,
) -> Result<CursorPaginatedOutput> {
    // Validate table
    if !ALLOWED_TABLES.contains(&input.table.as_str()) {
        return Err(anyhow!("Table '{}' not allowed", input.table));
    }
    
    let page_size = input.page_size.min(100);
    
    // Decode cursor if provided
    let (start_id, sort_col, ascending) = match &input.cursor {
        Some(cursor_str) => {
            let cursor = Cursor::decode(cursor_str)?;
            
            // Validate cursor is for the same table
            if cursor.table != input.table {
                return Err(anyhow!("Cursor is for different table"));
            }
            
            (cursor.last_id, cursor.sort_column, cursor.ascending)
        }
        None => (0, "id".to_string(), true),
    };
    
    // Build query with cursor condition
    let comparison = if ascending { ">" } else { "<" };
    let order = if ascending { "ASC" } else { "DESC" };
    
    let query = format!(
        "SELECT * FROM {} WHERE {} {} ? ORDER BY {} {} LIMIT ?",
        input.table,
        sort_col,
        comparison,
        start_id,
        sort_col,
        order
    );
    
    let rows = sqlx::query(&query)
        .bind(start_id)
        .bind(page_size + 1)  // Fetch one extra to detect more
        .fetch_all(pool.as_ref())
        .await?;
    
    let has_more = rows.len() > page_size as usize;
    let rows: Vec<_> = rows.into_iter().take(page_size as usize).collect();
    
    // Create next cursor if there are more rows
    let next_cursor = if has_more && !rows.is_empty() {
        let last_row = rows.last().unwrap();
        let last_id: i64 = last_row.try_get(&sort_col)?;
        
        Some(Cursor {
            last_id,
            table: input.table.clone(),
            sort_column: sort_col,
            ascending,
        }.encode())
    } else {
        None
    };
    
    let count = rows.len();
    let status = if count == 0 {
        "No results found.".to_string()
    } else if next_cursor.is_some() {
        format!("Showing {} results. Use next_cursor to see more.", count)
    } else {
        format!("Showing all {} results.", count)
    };
    
    Ok(CursorPaginatedOutput {
        rows: convert_rows(rows),
        columns: vec![],  // Extract from schema
        count,
        next_cursor,
        status,
    })
}
}

Why Include Table in Cursor?

The cursor includes the table name for security:

#![allow(unused)]
fn main() {
// Attacker tries to use a customers cursor on the users table
cursor = { last_id: 12345, table: "customers", ... }
input.table = "users"  // Trying to access different table

// Validation catches this:
if cursor.table != input.table {
    return Err(anyhow!("Cursor is for different table"));
}
}

Without this check, an attacker could:

Get a cursor for a public table
Use it to paginate through a private table

Streaming Large Results

For very large exports, consider streaming:

#![allow(unused)]
fn main() {
use futures::StreamExt;

async fn stream_query(
    pool: &DbPool,
    query: &str,
) -> impl futures::Stream<Item = Result<serde_json::Value>> {
    sqlx::query(query)
        .fetch(pool.as_ref())
        .map(|row_result| {
            row_result
                .map(|row| row_to_json(&row))
                .map_err(|e| anyhow!("Row error: {}", e))
        })
}

// Usage for large exports
async fn export_table(pool: &DbPool, table: &str, output: &mut File) -> Result<()> {
    let query = format!("SELECT * FROM {}", table);
    let mut stream = stream_query(pool, &query);
    
    while let Some(row_result) = stream.next().await {
        let row = row_result?;
        writeln!(output, "{}", serde_json::to_string(&row)?)?;
    }
    
    Ok(())
}
}

Note: Streaming isn't directly supported in MCP responses, but you can use it for:

File exports
Background processing
Chunked responses (if your transport supports it)

Memory-Safe Patterns

Pattern 1: Always Limit

#![allow(unused)]
fn main() {
const MAX_ROWS: i32 = 10_000;
const DEFAULT_ROWS: i32 = 100;

fn safe_limit(requested: Option<i32>) -> i32 {
    requested
        .unwrap_or(DEFAULT_ROWS)
        .min(MAX_ROWS)
        .max(1)  // At least 1
}
}

Pattern 2: Early Termination

#![allow(unused)]
fn main() {
async fn fetch_limited(pool: &DbPool, query: &str, max: usize) -> Result<Vec<Row>> {
    let mut rows = Vec::with_capacity(max);
    let mut stream = sqlx::query(query).fetch(pool.as_ref());
    
    while let Some(row) = stream.next().await {
        rows.push(row?);
        if rows.len() >= max {
            break;  // Stop fetching, even if more exist
        }
    }
    
    Ok(rows)
}
}

Pattern 3: Result Size Estimation

#![allow(unused)]
fn main() {
async fn check_result_size(pool: &DbPool, query: &str) -> Result<i64> {
    // Wrap query in COUNT to check size first
    let count_query = format!(
        "SELECT COUNT(*) FROM ({}) as subquery",
        query.trim_end_matches(';')
    );
    
    let count: (i64,) = sqlx::query_as(&count_query)
        .fetch_one(pool.as_ref())
        .await?;
    
    Ok(count.0)
}

async fn safe_query(pool: &DbPool, query: &str, limit: i32) -> Result<QueryOutput> {
    let estimated_size = check_result_size(pool, query).await?;
    
    if estimated_size > 100_000 {
        return Err(anyhow!(
            "Query would return {} rows. Please add filters or use pagination.",
            estimated_size
        ));
    }
    
    // Proceed with actual query
    execute_query(pool, query, limit).await
}
}

AI-Friendly Pagination Messages

Help Claude understand pagination state:

#![allow(unused)]
fn main() {
fn pagination_message(count: usize, total: Option<i64>, has_more: bool) -> String {
    match (total, has_more) {
        (Some(t), true) => format!(
            "Showing {} of {} total results. Use the next_cursor to fetch more.",
            count, t
        ),
        (Some(t), false) => format!(
            "Showing all {} results.",
            t
        ),
        (None, true) => format!(
            "Showing {} results. More are available - use next_cursor to continue.",
            count
        ),
        (None, false) => format!(
            "Showing {} results. This is the complete result set.",
            count
        ),
    }
}
}

Claude can then naturally say:

"I found 50 customers matching your criteria. There are more results available. Would you like me to fetch the next page?"

Performance Comparison

Approach	Page 1	Page 100	Page 10,000
No pagination	✗ OOM	✗ OOM	✗ OOM
OFFSET	10ms	100ms	5000ms
Cursor	10ms	10ms	10ms

Cursor pagination maintains constant performance regardless of depth.

When to Use Each Strategy

Scenario	Recommended Strategy
Simple UI pagination	Offset (if depth < 100 pages)
API pagination	Cursor
Search results	Cursor
Infinite scroll	Cursor
Admin data export	Streaming
Real-time feeds	Cursor + polling

Complete Pagination Implementation

#![allow(unused)]
fn main() {
/// Paginated query tool with cursor-based pagination
pub struct PaginatedQuery {
    pool: DbPool,
}

impl PaginatedQuery {
    pub fn new(pool: DbPool) -> Self {
        Self { pool }
    }

    pub fn into_tool(self) -> TypedTool<CursorPaginatedInput, CursorPaginatedOutput> {
        let pool = self.pool.clone();
        
        TypedTool::new(
            "paginated_query",
            "Query a table with cursor-based pagination. Returns a cursor for fetching additional pages.",
            move |input: CursorPaginatedInput| {
                let pool = pool.clone();
                Box::pin(async move {
                    cursor_paginated_query(&pool, input).await
                })
            },
        )
    }
}
}

Summary

Problem	Solution
Too many rows	Always use LIMIT
Deep pagination slow	Use cursor pagination
Memory exhaustion	Stream or chunk
AI can't process all data	Provide clear pagination status
Cursor tampering	Include table in cursor, validate

Continue to Chapter 3 Exercises to practice these patterns →

Chapter 3 Exercises

These exercises focus on database integration - the "killer app" for enterprise MCP servers.

Exercises

Building a Database Query Tool ⭐⭐ Intermediate (35 min)
- Create list_tables and execute_query tools
- Learn to structure database results for AI consumption
SQL Injection Code Review ⭐⭐ Intermediate (25 min)
- Identify SQL injection vulnerabilities
- Learn parameterized queries and allowlisting
Pagination Patterns ⭐⭐ Intermediate (30 min)
- Implement cursor-based pagination
- Handle large result sets safely

Next Steps

After completing these exercises:

SQL Safety and Injection Prevention - Deep dive into security
Handling Large Results - Production patterns

Exercise: Database Query Basics

ch03-01-db-query-basics

⭐⭐ intermediate ⏱️ 35 min

Database access is the "killer app" for enterprise MCP servers. When employees need data for AI conversations, they shouldn't have to export CSVs and paste into chat windows. An MCP server can provide secure, direct access.

In this exercise, you'll build a database query tool that:

Lists available tables
Executes read-only SQL queries
Returns structured results

🎯 Learning Objectives

Thinking

Why read-only access is essential for AI tools
How to structure database results for AI consumption
The tradeoffs between flexibility and security in query tools

Doing

Create tools that interact with SQLite databases
Use sqlx for async database operations
Structure output for AI-friendly consumption

💬 Discussion

Why might you want an AI to query databases directly instead of using pre-built reports?
What's the risk of allowing arbitrary SQL queries? How would you mitigate it?
How should results be formatted so an AI can understand and explain them?
What metadata would help an AI write better queries?

src/main.rs

💡 Hints

Hint 1: Querying SQLite schema

To list tables in SQLite:

#![allow(unused)]
fn main() {
let tables = sqlx::query("SELECT name FROM sqlite_master WHERE type='table'")
    .fetch_all(pool.as_ref())
    .await?;
}

Hint 2: Validating SELECT queries

Check that the query is read-only:

#![allow(unused)]
fn main() {
let trimmed = input.query.trim().to_uppercase();
if !trimmed.starts_with("SELECT") {
    return Err(anyhow!("Only SELECT queries are allowed"));
}
}

Hint 3: Complete execute_query

#![allow(unused)]
fn main() {
async fn execute_query(pool: &DbPool, input: &QueryInput) -> Result<QueryResult> {
    let trimmed = input.query.trim().to_uppercase();
    if !trimmed.starts_with("SELECT") {
        return Err(anyhow!("Only SELECT queries are allowed"));
    }
let query = if !trimmed.contains("LIMIT") {
    format!("{} LIMIT {}", input.query, input.limit)
} else {
    input.query.clone()
};

let rows = sqlx::query(&amp;query)
    .fetch_all(pool.as_ref())
    .await?;

// Process rows into structured output
// ...
}

}

⚠️ Try the exercise first!Show Solution

#![allow(unused)]
fn main() {
use pmcp::{Server, ServerCapabilities, ToolCapabilities};
use pmcp::server::TypedTool;
use serde::{Deserialize, Serialize};
use schemars::JsonSchema;
use anyhow::{Result, anyhow};
use sqlx::{Pool, Sqlite, sqlite::SqlitePoolOptions, Row, Column};
use std::sync::Arc;
type DbPool = Arc<Pool<Sqlite>>;
#[derive(Deserialize, JsonSchema)]
struct ListTablesInput {}
#[derive(Serialize)]
struct TableInfo {
name: String,
row_count: i64,
}
#[derive(Deserialize, JsonSchema)]
struct QueryInput {
query: String,
#[serde(default = "default_limit")]
limit: i32,
}
fn default_limit() -> i32 { 100 }
#[derive(Serialize)]
struct QueryResult {
columns: Vec<String>,
rows: Vec<Vec<serde_json::Value>>,
row_count: usize,
}
async fn list_tables(pool: &DbPool) -> Result<Vec<TableInfo>> {
let tables: Vec<(String,)> = sqlx::query_as(
"SELECT name FROM sqlite_master WHERE type='table' AND name NOT LIKE 'sqlite_%'"
)
.fetch_all(pool.as_ref())
.await?;
let mut result = Vec::new();
for (name,) in tables {
    let count: (i64,) = sqlx::query_as(&amp;format!(&quot;SELECT COUNT(*) FROM {}&quot;, name))
        .fetch_one(pool.as_ref())
        .await?;
    result.push(TableInfo { name, row_count: count.0 });
}

Ok(result)
}

}
async fn execute_query(pool: &DbPool, input: &QueryInput) -> Result<QueryResult> {
let trimmed = input.query.trim().to_uppercase();
if !trimmed.starts_with("SELECT") {
return Err(anyhow!("Only SELECT queries are allowed"));
}
let query = if !trimmed.contains(&quot;LIMIT&quot;) {
    format!(&quot;{} LIMIT {}&quot;, input.query, input.limit)
} else {
    input.query.clone()
};

let rows = sqlx::query(&amp;query)
    .fetch_all(pool.as_ref())
    .await?;

let columns: Vec&lt;String&gt; = if let Some(row) = rows.first() {
    row.columns().iter().map(|c| c.name().to_string()).collect()
} else {
    vec![]
};

let data: Vec&lt;Vec&lt;serde_json::Value&gt;&gt; = rows.iter().map(|row| {
    columns.iter().enumerate().map(|(i, _)| {
        row.try_get::&lt;String, _&gt;(i)
            .map(serde_json::Value::String)
            .unwrap_or(serde_json::Value::Null)
    }).collect()
}).collect();

Ok(QueryResult {
    row_count: data.len(),
    columns,
    rows: data,
})

}
#[tokio::main]
async fn main() -> Result<()> {
let database_url = std::env::var("DATABASE_URL")
.unwrap_or_else(|_| "sqlite:./data.db".to_string());
let pool = Arc::new(
    SqlitePoolOptions::new()
        .max_connections(5)
        .connect(&amp;database_url)
        .await?
);

let pool_for_tables = pool.clone();
let pool_for_query = pool.clone();

let server = Server::builder()
    .name(&quot;db-query&quot;)
    .version(&quot;1.0.0&quot;)
    .capabilities(ServerCapabilities {
        tools: Some(ToolCapabilities::default()),
        ..Default::default()
    })
    .tool(&quot;list_tables&quot;, TypedTool::new(&quot;list_tables&quot;, move |_: ListTablesInput| {
        let pool = pool_for_tables.clone();
        Box::pin(async move {
            let tables = list_tables(&amp;pool).await?;
            Ok(serde_json::to_value(tables)?)
        })
    }))
    .tool(&quot;execute_query&quot;, TypedTool::new(&quot;execute_query&quot;, move |input: QueryInput| {
        let pool = pool_for_query.clone();
        Box::pin(async move {
            let result = execute_query(&amp;pool, &amp;input).await?;
            Ok(serde_json::to_value(result)?)
        })
    }))
    .build()?;

println!(&quot;Database query server ready!&quot;);
Ok(())

}

Explanation

Connection Pooling: Using Arc<Pool> allows sharing the connection pool across multiple tool handlers efficiently.

Read-Only Validation: Checking for SELECT prevents destructive queries, though this is a basic check - production systems need more robust validation.

Result Structuring: Returning columns and rows separately helps AI understand the data schema.

LIMIT Enforcement: Adding a default LIMIT prevents accidentally returning millions of rows.

🤔 Reflection

What SQL injection risks remain even with SELECT-only validation?
How would you handle different data types (integers, dates, blobs)?
What additional metadata would help an AI write better queries?
How would you add pagination for large result sets?

Exercise: SQL Injection Review

ch03-02-sql-injection-review

⭐⭐ intermediate ⏱️ 25 min

You've been asked to review a database query tool before it goes to production. The developer is new to security and made several classic mistakes. SQL injection vulnerabilities can lead to data breaches, data loss, and complete system compromise.

This exercise builds on your code review skills from Chapter 2, now with a security focus. SQL injection is consistently in the OWASP Top 10 - it's one of the most common and dangerous vulnerabilities in web applications.

Your task: Identify ALL security vulnerabilities, categorize them by severity, and propose secure alternatives using parameterized queries.

🎯 Learning Objectives

Thinking

How SQL injection attacks work and why they're dangerous
Why string concatenation for SQL is always wrong
The difference between blocklisting and allowlisting

Doing

Identify multiple SQL injection vulnerabilities
Propose fixes using parameterized queries
Recognize insufficient security controls

💬 Discussion

How does SQL injection work? What allows it to happen?
Why is checking for "DROP" and "DELETE" not sufficient protection?
What's the fundamental problem with string concatenation in SQL?
How do parameterized queries prevent injection?

src/main.rs

💡 Hints

Hint 1

Look for string concatenation patterns like format!() or push_str() that include user input directly in SQL queries.

Hint 2

The blocklist approach (checking for "DROP" and "DELETE") can be bypassed. Consider: '; SELECT * FROM users WHERE role='admin' --

Hint 3

Issues to find:

Name filter: SQL injection via string concatenation
Email domain filter: SQL injection (no validation)
Sort column: SQL injection (arbitrary column/expression)
Sort order: Injection possible (only checks exact match)
get_user: user_id is String, concatenated without validation
update_nickname: Direct string concatenation
Architecture: UPDATE tool on "read-only" server

⚠️ Try the exercise first!Show Solution

#![allow(unused)]
fn main() {
// Secure implementation of search_users using parameterized queries
async fn search_users(pool: &DbPool, input: SearchUsersInput) -> anyhow::Result<Vec<User>> {
    let mut conditions = vec!["1=1".to_string()];
    let mut params: Vec<String> = vec![];
if let Some(name) = &amp;input.name {
    conditions.push(&quot;name LIKE ?&quot;.to_string());
    params.push(format!(&quot;%{}%&quot;, name));
}

if let Some(domain) = &amp;input.email_domain {
    conditions.push(&quot;email LIKE ?&quot;.to_string());
    params.push(format!(&quot;%@{}&quot;, domain));
}

// For ORDER BY, use an allowlist - can&#x27;t parameterize column names
let allowed_columns = [&quot;id&quot;, &quot;name&quot;, &quot;email&quot;];
let order_clause = match &amp;input.sort_by {
    Some(col) if allowed_columns.contains(&amp;col.as_str()) =&gt; {
        let direction = match &amp;input.sort_order {
            Some(o) if o.to_lowercase() == &quot;desc&quot; =&gt; &quot;DESC&quot;,
            _ =&gt; &quot;ASC&quot;,
        };
        format!(&quot; ORDER BY {} {}&quot;, col, direction)
    }
    _ =&gt; String::new(),
};

let query = format!(
    &quot;SELECT id, name, email, role FROM users WHERE {} LIMIT 100{}&quot;,
    conditions.join(&quot; AND &quot;),
    order_clause
);

// Build query with dynamic binding
let mut query_builder = sqlx::query_as::&lt;_, (i64, String, String, String)&gt;(&amp;query);
for param in &amp;params {
    query_builder = query_builder.bind(param);
}

let rows = query_builder.fetch_all(pool.as_ref()).await?;

Ok(rows.into_iter().map(|(id, name, email, role)| {
    User { id, name, email, role }
}).collect())
}

}
// Key security principles:
// - Never use string concatenation for SQL with user input
// - Blocklists can always be bypassed - use allowlists instead
// - Parameterized queries separate SQL structure from data
// - Defense in depth: read-only connections, least privilege, audit logging
// - Code comments don't enforce security - "read-only server" with UPDATE tool

🤔 Reflection

Why can't you parameterize ORDER BY column names?
What's the difference between escaping quotes and parameterized queries?
If the database user only has SELECT permission, is SQL injection still dangerous?
How would you test for SQL injection in an automated way?

Exercise: Pagination Patterns

ch03-03-pagination-patterns

⭐⭐ intermediate ⏱️ 30 min

Your database query tool from the previous exercise works great for small result sets, but what happens when a table has millions of rows? Without proper pagination:

Memory exhaustion: Loading 10M rows into memory crashes your server
Timeouts: Long queries block the connection pool
Poor UX: AI assistants can't process massive JSON responses effectively

This exercise teaches cursor-based pagination - the production pattern for handling large datasets efficiently. You'll learn why it's superior to offset-based pagination and how to implement it safely.

🎯 Learning Objectives

Thinking

Why offset pagination fails at scale (OFFSET 1000000 is slow)
How cursor-based pagination maintains consistent performance
Tradeoffs between different pagination strategies

Doing

Implement cursor-based pagination with a 'next' token
Handle edge cases (empty results, last page, invalid cursors)
Design API responses that guide AI assistants to fetch more

💬 Discussion

If you have 10 million rows and an AI asks for "all customers", what should happen?
Why is `OFFSET 999000 LIMIT 1000` slower than `WHERE id > 999000 LIMIT 1000`?
How should an MCP response indicate that more data is available?
What makes a good pagination cursor? (hint: not just a page number)

src/main.rs

💡 Hints

Hint 1

Start by validating the table is in the allowlist:

#![allow(unused)]
fn main() {
if !ALLOWED_TABLES.contains(&input.table.as_str()) {
    return Err(anyhow::anyhow!("Table not allowed"));
}
}

Hint 2

Build the query with cursor support:

#![allow(unused)]
fn main() {
let start_id = if let Some(cursor_str) = &input.cursor {
    let cursor = Cursor::decode(cursor_str)?;
    if cursor.table != input.table {
        return Err(anyhow::anyhow!("Cursor table mismatch"));
    }
    cursor.last_id
} else {
    0
};
let query = format!(
"SELECT * FROM {} WHERE id > {} ORDER BY id LIMIT {}",
input.table, start_id, input.page_size + 1
);
}

Hint 3

Complete implementation with has_more detection:

#![allow(unused)]
fn main() {
async fn paginated_query(pool: &DbPool, input: PaginatedQueryInput) -> Result<PaginatedResult> {
    // Validate table
    if !ALLOWED_TABLES.contains(&input.table.as_str()) {
        return Err(anyhow::anyhow!("Table '{}' not allowed", input.table));
    }
// Limit page size
let page_size = input.page_size.min(100);

// Decode cursor
let start_id = match &amp;input.cursor {
    Some(c) =&gt; {
        let cursor = Cursor::decode(c)?;
        if cursor.table != input.table {
            return Err(anyhow::anyhow!("Cursor was for different table"));
        }
        cursor.last_id
    }
    None =&gt; 0,
};

// Build and execute query - fetch N+1 to detect more pages
let query = format!(
    "SELECT * FROM {} WHERE id &gt; {} ORDER BY id LIMIT {}",
    input.table, start_id, page_size + 1
);

let rows = sqlx::query(&amp;query)
    .fetch_all(pool.as_ref())
    .await?;

// Check for more results
let has_more = rows.len() &gt; page_size as usize;
let rows: Vec&lt;_&gt; = rows.into_iter().take(page_size as usize).collect();

// Build next_cursor if more pages exist...
}

}

⚠️ Try the exercise first!Show Solution

#![allow(unused)]
fn main() {
async fn paginated_query(pool: &DbPool, input: PaginatedQueryInput) -> Result<PaginatedResult> {
    // Validate table is in allowlist
    if !ALLOWED_TABLES.contains(&input.table.as_str()) {
        return Err(anyhow::anyhow!("Table '{}' not in allowlist", input.table));
    }
// Limit page size to max 100
let page_size = input.page_size.min(100).max(1);

// Decode cursor if provided
let start_id = match &amp;input.cursor {
    Some(cursor_str) =&gt; {
        let cursor = Cursor::decode(cursor_str)?;
        // Validate cursor is for same table (security check)
        if cursor.table != input.table {
            return Err(anyhow::anyhow!(
                &quot;Cursor was created for table &#x27;{}&#x27;, not &#x27;{}&#x27;&quot;,
                cursor.table, input.table
            ));
        }
        cursor.last_id
    }
    None =&gt; 0,
};

// Build query - fetch page_size + 1 to detect if more pages exist
let query = format!(
    &quot;SELECT * FROM {} WHERE id &gt; ? ORDER BY id LIMIT ?&quot;,
    input.table
);

let all_rows = sqlx::query(&amp;query)
    .bind(start_id)
    .bind(page_size + 1)
    .fetch_all(pool.as_ref())
    .await?;

// Determine if there are more results
let has_more = all_rows.len() &gt; page_size as usize;
let rows: Vec&lt;_&gt; = all_rows.into_iter().take(page_size as usize).collect();

// Extract column names
let columns: Vec&lt;String&gt; = if let Some(first_row) = rows.first() {
    first_row.columns().iter().map(|c| c.name().to_string()).collect()
} else {
    vec![]
};

// Convert rows to JSON values
let row_data: Vec&lt;Vec&lt;serde_json::Value&gt;&gt; = rows.iter().map(|row| {
    columns.iter().enumerate().map(|(i, _)| {
        // Try to get as different types
        if let Ok(v) = row.try_get::&lt;i64, _&gt;(i) {
            serde_json::Value::Number(v.into())
        } else if let Ok(v) = row.try_get::&lt;String, _&gt;(i) {
            serde_json::Value::String(v)
        } else {
            serde_json::Value::Null
        }
    }).collect()
}).collect();

// Get last ID for cursor
let last_id = row_data.last()
    .and_then(|row| row.first())
    .and_then(|v| v.as_i64());

// Create next cursor if more data exists
let next_cursor = if has_more {
    last_id.map(|id| Cursor {
        last_id: id,
        table: input.table.clone(),
    }.encode())
} else {
    None
};

// Human-readable status for AI
let status = if has_more {
    format!(
        &quot;Showing {} rows. More data available - pass next_cursor to continue.&quot;,
        row_data.len()
    )
} else {
    format!(&quot;Showing {} rows. This is all available data.&quot;, row_data.len())
};

Ok(PaginatedResult {
    columns,
    rows: row_data,
    count: row_data.len(),
    next_cursor,
    status,
})
}

}
// Key patterns demonstrated:
// 1. Opaque Cursors - base64 JSON hides implementation details
// 2. Fetch N+1 Pattern - efficiently detect more pages without COUNT
// 3. Table Validation in Cursor - prevent cursor reuse attacks
// 4. Human-Readable Status - helps AI understand pagination state

🧪 Tests

Run these tests locally with:

cargo test

View Test Code

#![allow(unused)]
fn main() {
#[cfg(test)]
mod tests {
    use super::*;
#[tokio::test]
async fn test_first_page() {
    // First page should return results and a next_cursor
}

#[tokio::test]
async fn test_continue_with_cursor() {
    // Second page should have no overlap with first
}

#[tokio::test]
async fn test_last_page() {
    // Final page should have no next_cursor
}

#[tokio::test]
async fn test_invalid_table() {
    // Tables not in allowlist should error
}

#[tokio::test]
async fn test_cursor_table_mismatch() {
    // Cursor from table A shouldn&#x27;t work for table B
}
}

}

🤔 Reflection

Why do we include the table name in the cursor?
What would happen if rows were deleted between page fetches?
How would you support sorting by a non-unique column?
Why is the cursor base64-encoded JSON instead of just an ID?

Beyond Tool Sprawl

You've built your first MCP servers. They work. Tools respond, resources load, tests pass. But working code isn't the same as well-designed code—especially in the MCP ecosystem.

This chapter challenges a dangerous assumption: that converting an existing API to MCP tools is sufficient. It's not. MCP operates in a fundamentally different environment than traditional APIs, and understanding this difference is critical to building servers that actually succeed in production.

The MCP Environment Is Not What You Think

When you build a REST API, you control:

Which endpoints exist
How clients authenticate
The order operations are called
Error handling and retries
Rate limiting and quotas

When you build an MCP server, you control almost none of this.

You Don't Control Other Servers

Your MCP server isn't alone. The MCP client (Claude Desktop, Cursor, ChatGPT, or a custom application) may have multiple servers connected simultaneously:

┌─────────────────────────────────────────────────────────────┐
│                      MCP Client                             │
│                   (Claude Desktop)                          │
├─────────────────────────────────────────────────────────────┤
│                                                             │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐       │
│  │ Your Server  │  │ Google Drive │  │ Asana        │       │
│  │ (db-explorer)│  │ Server       │  │ Server       │       │
│  │              │  │              │  │              │       │
│  │ • query_db   │  │ • get_doc    │  │ • get_task   │       │
│  │ • list_tables│  │ • create_doc │  │ • create_task│       │
│  │ • get_schema │  │ • list_docs  │  │ • list_tasks │       │
│  └──────────────┘  └──────────────┘  └──────────────┘       │
│                                                             │
│  The AI sees ALL tools from ALL servers simultaneously      │
└─────────────────────────────────────────────────────────────┘

If your db-explorer server has a tool called list, and another server also has list, you've created ambiguity. The AI must choose between them based on descriptions alone. Poor naming, vague descriptions, or overlapping functionality leads to unpredictable behavior.

You Don't Control the Client

The MCP client—typically an AI model—decides:

Which tools to call: Based on the user's request and tool descriptions
In what order: The AI determines the sequence of operations
With what parameters: The AI constructs the arguments
How many times: The AI may retry, iterate, or abandon

You cannot force the AI to call your tools in a specific order. You cannot prevent it from calling tools you didn't intend for a particular workflow. You cannot guarantee it will use the "right" tool for a task.

User: "Show me the sales data"

AI's internal reasoning (you don't see this):
- Found 3 potential tools: query_db, get_report, fetch_data
- query_db description mentions "SQL queries"
- get_report description mentions "sales reports"
- fetch_data description is vague: "fetches data"
- Choosing: get_report (best match for "sales")

What if get_report is from a DIFFERENT server than you expected?

The User Has Some Control (But Not You)

Modern MCP clients like Claude Desktop and ChatGPT provide users with control mechanisms:

Server Selection: Users can enable/disable MCP servers per conversation:

"Use only the database server for this task"
"Don't use the Asana server right now"

Prompt Templates: Users can invoke pre-defined prompts that guide the AI:

/analyze-schema - A prompt that structures how schema analysis should proceed
/generate-report - A prompt that defines report generation workflow

But notice: the user has this control, not you as the developer. Your job is to design servers that work well regardless of what other servers are connected, and to provide prompts that give users meaningful control over workflows.

What You Actually Control

As an MCP server developer, your influence is limited to three things:

1. Tool Design

How you name, describe, and structure your tools determines whether the AI will use them correctly:

#![allow(unused)]
fn main() {
// Poor design: vague, overlapping with common names
Tool::new("get")
    .description("Gets data")

// Better design: specific, clear purpose
Tool::new("query_sales_database")
    .description("Execute read-only SQL queries against the sales PostgreSQL database. Returns results as JSON. Use for retrieving sales records, customer data, and transaction history.")
}

2. Resource Design

How you expose data as resources affects discoverability and appropriate usage:

#![allow(unused)]
fn main() {
// Resources are for stable, addressable data
Resource::new("sales://schema/customers")
    .description("Customer table schema including all columns and constraints")
    .mime_type("application/json")
}

3. Prompt Design

Prompts are your most powerful tool for guiding complex workflows:

#![allow(unused)]
fn main() {
// Prompts give users control over multi-step operations
Prompt::new("analyze-sales-trend")
    .description("Analyze sales trends over a specified period")
    .arguments(vec![
        PromptArgument::new("period").description("Time period: daily, weekly, monthly"),
        PromptArgument::new("metric").description("Metric to analyze: revenue, units, customers"),
    ])
}

The Design Imperative

This chapter covers three critical design principles:

Avoid Anti-Patterns: Why "50 confusing tools" fails and what to do instead
Design for Cohesion: How to create tool sets that work together naturally
Single Responsibility: Why each tool should do one thing well

These principles aren't academic—they determine whether your MCP server will be reliably selected and correctly used by AI clients in a multi-server environment.

Let's start by examining what goes wrong when these principles are ignored.

The Anti-Pattern: 50 Confusing Tools

The most common mistake when building MCP servers is treating them like REST APIs. "We have 47 endpoints, so we'll create 47 tools." This approach fails spectacularly in the MCP environment.

The API Conversion Trap

Consider a typical e-commerce API:

POST   /api/products              # Create product
GET    /api/products              # List products
GET    /api/products/{id}         # Get product
PUT    /api/products/{id}         # Update product
DELETE /api/products/{id}         # Delete product
POST   /api/products/{id}/images  # Add image
DELETE /api/products/{id}/images/{img_id}  # Remove image
GET    /api/products/{id}/reviews # Get reviews
POST   /api/products/{id}/reviews # Add review
PUT    /api/products/{id}/inventory # Update inventory
GET    /api/categories            # List categories
POST   /api/categories            # Create category
# ... 35 more endpoints

The naive approach converts each endpoint to a tool:

#![allow(unused)]
fn main() {
// DON'T DO THIS
let tools = vec![
    Tool::new("create_product"),
    Tool::new("list_products"),
    Tool::new("get_product"),
    Tool::new("update_product"),
    Tool::new("delete_product"),
    Tool::new("add_product_image"),
    Tool::new("remove_product_image"),
    Tool::new("get_product_reviews"),
    Tool::new("add_product_review"),
    Tool::new("update_inventory"),
    Tool::new("list_categories"),
    Tool::new("create_category"),
    // ... 35 more tools
];
}

This creates a nightmare for AI clients.

Why This Fails

Problem 1: Tool Selection Overload

When an AI sees 47 tools, it must evaluate each one against the user's request. The cognitive load increases non-linearly:

User: "Add a new laptop to the store"

AI must consider:
- create_product? (probably)
- add_product_image? (maybe needed after?)
- update_inventory? (should set initial stock?)
- list_categories? (need to find Electronics category first?)
- create_category? (if Electronics doesn't exist?)

With 47 tools, the AI might:
- Choose the wrong tool
- Call tools in a suboptimal order
- Miss required steps
- Get confused and ask for clarification

Problem 2: Name Collisions

Your 47 tools don't exist in isolation. Other MCP servers connected to the same client may have similar names:

Your server:                  Asana server:            Google Drive server:
- create_product             - create_task            - create_document
- update_product             - update_task            - update_document
- delete_product             - delete_task            - delete_document
- list_products              - list_tasks             - list_documents
- get_product                - get_task               - get_document

A business user might have your e-commerce server connected alongside their project management (Asana, Notion) and document storage (Google Drive, SharePoint). The AI sees a sea of create_*, update_*, delete_*, list_*, get_* tools. Without excellent descriptions, it will make mistakes.

Problem 3: Implicit Workflows Hidden

APIs encode workflows implicitly through endpoint sequences. MCP tools are independent—there's no built-in way to say "call A, then B, then C":

REST workflow (implicit in client code):
1. POST /api/products → get product_id
2. POST /api/products/{id}/images → attach image
3. PUT /api/products/{id}/inventory → set stock

MCP reality:
- AI sees 3 independent tools
- No indication they should be called together
- User must know to request all three steps
- Or AI must infer the workflow (unreliable)

Problem 4: Description Burden

Each of your 47 tools needs a description good enough for an AI to understand when to use it. Most API endpoints don't have descriptions written for this purpose:

#![allow(unused)]
fn main() {
// Typical API-converted tool (inadequate)
Tool::new("update_inventory")
    .description("Updates inventory")  // Useless for AI decision-making

// What the AI actually needs
Tool::new("update_product_stock_level")
    .description(
        "Set the available quantity for a product in the inventory system. \
        Use this after creating a new product or when restocking. \
        Requires product_id and quantity. Quantity must be non-negative. \
        Returns the updated inventory record with last_modified timestamp."
    )
}

Writing 47 descriptions of this quality is significant work—and maintaining them as the API evolves is even harder.

Real-World Consequences

Case Study: The 73-Tool Disaster

A team converted their entire internal API to MCP tools: 73 tools covering user management, billing, reporting, and admin functions. Results:

AI accuracy dropped to 34% for multi-step tasks
Response latency increased 5x as the AI evaluated all 73 tools
Support tickets tripled as users got unexpected results
Rollback within 2 weeks to a 12-tool focused design

Case Study: The Naming Collision

A database tool server used query as a tool name. When connected alongside a logging server (which also had query), the AI would randomly choose between them based on subtle description differences. Users reported "sometimes it queries the database, sometimes it searches logs, I can't predict which."

The Better Approach: Purposeful Design

Instead of converting APIs to tools 1:1, design for how AI clients actually work:

1. Focus on User Tasks, Not API Operations

#![allow(unused)]
fn main() {
// Instead of 7 product CRUD tools, one task-focused tool:
Tool::new("manage_product_catalog")
    .description(
        "Create, update, or manage products in the catalog. \
        Handles product details, images, categories, and initial inventory. \
        Provide the operation type and relevant product data."
    )
    .input_schema(json!({
        "type": "object",
        "properties": {
            "operation": {
                "type": "string",
                "enum": ["create", "update", "add_image", "set_category", "discontinue"]
            },
            "product": {
                "type": "object",
                "properties": {
                    "id": { "type": "string" },
                    "name": { "type": "string" },
                    "description": { "type": "string" },
                    "price": { "type": "number" },
                    "category": { "type": "string" },
                    "initial_stock": { "type": "integer" }
                }
            }
        }
    }))
}

2. Use Prompts for Workflows

Instead of hoping the AI calls tools in the right order, define workflows as prompts:

#![allow(unused)]
fn main() {
Prompt::new("add-new-product")
    .description("Complete workflow to add a new product with images and inventory")
    .template(
        "I'll help you add a new product to the catalog. \
        This will:\n\
        1. Create the product with basic details\n\
        2. Upload any product images\n\
        3. Set initial inventory levels\n\
        4. Assign to appropriate categories\n\n\
        Please provide the product details..."
    )
}

3. Use Resources for Reference Data

Instead of list_categories and get_category tools, expose categories as resources:

#![allow(unused)]
fn main() {
Resource::new("catalog://categories")
    .description("All product categories with IDs and hierarchy")
    .mime_type("application/json")
}

The AI can read this resource to understand available categories without making a tool call.

Summary: From API to MCP

API Thinking	MCP Thinking
One endpoint = one tool	One user task = one tool
CRUD operations	High-level actions
Client controls workflow	Prompts guide workflow
Endpoints are independent	Tools designed for multi-server environment
Minimal descriptions	AI-decision-quality descriptions
47 endpoints → 47 tools	47 endpoints → 8-12 focused tools + prompts + resources

The next section covers how to design tool sets that are cohesive—tools that work together naturally and are easily distinguished by AI clients.

Cohesive API Design

Cohesion in MCP server design means your tools, resources, and prompts form a unified, understandable whole—both for AI clients that must choose between them and for users who need predictable behavior.

The Multi-Server Reality

Your MCP server operates in an environment you don't control. Consider what an AI client sees when a user has multiple servers connected:

Connected MCP Servers (typical business user setup):

1. google-drive-server (document storage)
   - create_document, update_document, delete_document,
   - list_documents, search_documents, share_document

2. asana-server (task management)
   - create_task, update_task, delete_task, list_tasks,
   - create_project, assign_task, set_due_date

3. salesforce-server (CRM)
   - query_accounts, update_opportunity, list_contacts, log_activity

4. your-server (you're building this)
   - ???

Total tools visible to AI: 20+ (and growing)

Your server's tools must be instantly distinguishable in this crowded environment.

Principles of Cohesive Design

1. Domain Prefixing

Prefix tool names with your domain to avoid collisions:

#![allow(unused)]
fn main() {
// Collision risk: generic names
Tool::new("query")           // Collides with postgres-server
Tool::new("search")          // Collides with Google Drive search_documents
Tool::new("list")            // Collides with everything

// Cohesive: domain-specific names
Tool::new("sales_query")     // Clearly your sales system
Tool::new("sales_report")    // Consistent prefix
Tool::new("sales_forecast")  // AI understands these are related
}

The AI can now reason: "The user asked about sales, I'll use the sales_* tools."

2. Consistent Verb Patterns

Choose a verb convention and stick to it across all tools:

#![allow(unused)]
fn main() {
// Inconsistent verbs (confusing)
Tool::new("get_customer")       // "get"
Tool::new("fetch_orders")       // "fetch" - same meaning, different word
Tool::new("retrieve_products")  // "retrieve" - yet another synonym
Tool::new("load_inventory")     // "load" - and another

// Consistent verbs (cohesive)
Tool::new("get_customer")
Tool::new("get_orders")
Tool::new("get_products")
Tool::new("get_inventory")
}

Consistent patterns help the AI predict tool names and understand tool relationships.

3. Hierarchical Organization

Structure tools to reflect their relationships:

#![allow(unused)]
fn main() {
// Flat structure (hard to understand relationships)
vec![
    Tool::new("create_order"),
    Tool::new("add_item"),
    Tool::new("remove_item"),
    Tool::new("apply_discount"),
    Tool::new("calculate_total"),
    Tool::new("submit_order"),
    Tool::new("cancel_order"),
]

// Hierarchical structure (clear relationships)
// Order lifecycle tools
Tool::new("order_create")
    .description("Create a new order. Returns order_id for subsequent operations.")

Tool::new("order_modify")
    .description("Add items, remove items, or apply discounts to an existing order.")
    .input_schema(json!({
        "properties": {
            "order_id": { "type": "string" },
            "action": {
                "type": "string",
                "enum": ["add_item", "remove_item", "apply_discount"]
            }
        }
    }))

Tool::new("order_finalize")
    .description("Calculate totals and submit the order, or cancel it.")
    .input_schema(json!({
        "properties": {
            "order_id": { "type": "string" },
            "action": {
                "type": "string",
                "enum": ["submit", "cancel"]
            }
        }
    }))
}

Three tools instead of seven, with clear lifecycle stages.

Designing for AI Understanding

Description Templates

Use consistent description structures across all tools:

#![allow(unused)]
fn main() {
// Template: What it does | When to use it | What it returns

Tool::new("sales_query")
    .description(
        "Execute SQL queries against the sales database. \
        Use for retrieving sales records, revenue data, and transaction history. \
        Returns query results as JSON array of records."
    )

Tool::new("sales_report")
    .description(
        "Generate formatted sales reports for a date range. \
        Use when the user needs summaries, trends, or printable reports. \
        Returns report data with totals, averages, and visualizable metrics."
    )

Tool::new("sales_forecast")
    .description(
        "Predict future sales based on historical data. \
        Use when the user asks about projections, predictions, or planning. \
        Returns forecast data with confidence intervals."
    )
}

The AI can now distinguish:

Raw data needs → sales_query
Summaries/reports → sales_report
Future predictions → sales_forecast

Negative Descriptions

Sometimes it helps to say what a tool is not for:

#![allow(unused)]
fn main() {
Tool::new("sales_query")
    .description(
        "Execute read-only SQL queries against the sales database. \
        Use for retrieving sales records and transaction history. \
        \
        NOTE: This tool CANNOT modify data. For updates, use sales_admin. \
        NOTE: For reports and summaries, use sales_report instead (faster)."
    )
}

Output Consistency

Tools in the same domain should return consistent structures:

#![allow(unused)]
fn main() {
// All sales tools return a consistent envelope
{
    "success": true,
    "data": { /* tool-specific data */ },
    "metadata": {
        "query_time_ms": 45,
        "source": "sales_db_replica",
        "cached": false
    }
}
}

This helps the AI chain tools together—it knows what to expect.

Cohesion Across Tool-Resource-Prompt

True cohesion spans all three MCP primitives:

#![allow(unused)]
fn main() {
// TOOLS: Actions on the sales domain
Tool::new("sales_query")
Tool::new("sales_report")
Tool::new("sales_forecast")

// RESOURCES: Reference data for sales operations
Resource::new("sales://schema")
    .description("Sales database schema - tables, columns, relationships")
Resource::new("sales://regions")
    .description("List of sales regions with IDs and territories")
Resource::new("sales://products")
    .description("Product catalog with IDs, names, and categories")

// PROMPTS: Guided workflows combining tools and resources
Prompt::new("quarterly-sales-analysis")
    .description("Comprehensive quarterly sales analysis with trends and forecasts")
Prompt::new("sales-territory-review")
    .description("Review sales performance by territory with recommendations")
}

The AI sees a complete, cohesive sales domain:

Resources provide context (what data exists)
Tools provide actions (what can be done)
Prompts provide workflows (how to accomplish complex tasks)

Testing Cohesion

The "50 Tools" Test

List all tools from your server plus common business servers (Google Drive, Asana, Salesforce). Can an AI easily distinguish yours?

google-drive: create_document, update_document, list_documents
asana: create_task, update_task, list_tasks
salesforce: query_accounts, update_opportunity, list_contacts
your-server: ???

If your tools are "query", "list", "get" - FAIL
If your tools are "sales_query", "sales_report", "sales_forecast" - PASS

The "Explain It" Test

Describe your server to a colleague in one sentence. If you can't, your tools aren't cohesive.

FAIL: "It queries databases, generates reports, and also manages inventory
       and does some customer stuff"

PASS: "It provides sales analytics - querying historical data, generating
       reports, and forecasting future sales"

The "New Tool" Test

When you add a new tool, does its name and description obviously fit with existing tools?

Existing: sales_query, sales_report, sales_forecast

Adding customer support?
FAIL: support_ticket, help_request  (different domain)
PASS: Create a new server for customer support

Adding sales alerts?
PASS: sales_alert_create, sales_alert_list (same domain, consistent naming)

Advanced: Foundation and Domain Servers

As your organization scales MCP adoption, cohesion becomes even more critical. In Part VIII: Server Composition, we explore a powerful pattern: Foundation Servers wrapped by Domain Servers.

The Pattern

Instead of building monolithic servers or having every team create their own database tools, you create a layered architecture:

┌─────────────────────────────────────────────────────────────┐
│                    Business Users                            │
├─────────────────────────────────────────────────────────────┤
│                                                              │
│  ┌──────────────────┐  ┌──────────────────┐                 │
│  │  Sales Manager   │  │  Finance Manager │  Domain Servers │
│  │  Domain Server   │  │  Domain Server   │  (department-   │
│  │                  │  │                  │   specific)     │
│  │ • pipeline_view  │  │ • budget_check   │                 │
│  │ • territory_perf │  │ • expense_report │                 │
│  │ • forecast_q4    │  │ • revenue_audit  │                 │
│  └────────┬─────────┘  └────────┬─────────┘                 │
│           │                     │                            │
│           └──────────┬──────────┘                            │
│                      │                                       │
│           ┌──────────▼──────────┐                            │
│           │   Foundation Server │  Foundation Server         │
│           │   (db-explorer)     │  (general-purpose)         │
│           │                     │                            │
│           │   • db_query        │                            │
│           │   • db_schema       │                            │
│           │   • db_export       │                            │
│           └─────────────────────┘                            │
│                                                              │
└─────────────────────────────────────────────────────────────┘

Why This Matters for Cohesion

Foundation Servers are general-purpose, reusable across the organization:

db-explorer: Generic database access
file-manager: Document and file operations
api-gateway: External API integrations

Domain Servers wrap foundations with business-specific cohesion:

Focused on one department's workflows
Pre-configured with relevant schemas and permissions
Include prompts tailored to that department's tasks
Hide complexity that's irrelevant to those users

#![allow(unused)]
fn main() {
// Sales Manager Domain Server
// Wraps db-explorer but exposes only sales-relevant operations

Tool::new("pipeline_view")
    .description("View sales pipeline with deal stages and probabilities")
    // Internally calls db_query with pre-built sales pipeline query

Tool::new("territory_performance")
    .description("Compare territory performance against targets")
    // Internally calls db_query + db_export for territory reports

Prompt::new("weekly-forecast")
    .description("Generate weekly sales forecast for your territories")
    // Guides the manager through a structured forecasting workflow
}

Benefits

User-Appropriate Cohesion: Sales managers see sales tools, not raw SQL
Controlled Access: Domain servers enforce what each role can access
Maintainability: Update the foundation; all domain servers benefit
Reduced Tool Sprawl: Each user sees only 5-10 relevant tools, not 50

When to Use This Pattern

Multiple departments need different views of the same data
You want to control what each role can access
Business users shouldn't need to understand database schemas
You're scaling from one team to organization-wide MCP adoption

We cover this pattern in depth in Chapter 19: Server Composition, including implementation details, authentication flows, and real-world examples.

Summary

Cohesive design makes your MCP server:

Distinguishable: AI easily identifies your tools among many servers
Predictable: Users know what to expect from your domain
Maintainable: New tools fit naturally into existing patterns

The key insight: design for the multi-server environment from the start. Your tools don't exist in isolation—they compete for the AI's attention alongside dozens of other tools.

Next, we'll examine the single responsibility principle—why each tool should do one thing well.

Single Responsibility for Tools

The single responsibility principle for MCP tools isn't about code organization—it's about AI comprehension. A tool that does one thing well is a tool that gets used correctly.

The Problem with Multi-Purpose Tools

Consider this "swiss army knife" tool:

#![allow(unused)]
fn main() {
Tool::new("data_operation")
    .description("Perform data operations - query, insert, update, delete, export, import, validate, transform")
    .input_schema(json!({
        "properties": {
            "operation": {
                "type": "string",
                "enum": ["query", "insert", "update", "delete", "export", "import", "validate", "transform"]
            },
            "table": { "type": "string" },
            "data": { "type": "object" },
            "format": { "type": "string" },
            "options": { "type": "object" }
        }
    }))
}

What's wrong with this design?

1. AI Decision Paralysis

The AI must understand 8 different behaviors from one tool. When a user says "get me the sales data," the AI must reason:

User: "get me the sales data"

AI reasoning about data_operation:
- Is this a "query" operation?
- Or should I "export" to get the data?
- What's the difference between query and export here?
- The description doesn't clarify...
- Maybe I should ask the user?

2. Parameter Confusion

Different operations need different parameters, but they share one schema:

#![allow(unused)]
fn main() {
// For "query": table and maybe some filter options
// For "insert": table and data object
// For "export": table and format
// For "transform": data and transformation options

// All crammed into one ambiguous schema
{
    "table": "???",     // Required for some, ignored by others
    "data": "???",      // Sometimes input, sometimes not
    "format": "???",    // Only for export
    "options": "???"    // Means different things per operation
}
}

3. Error Messages Are Vague

When something goes wrong, what failed?

{
    "error": "Invalid parameters for data_operation"
}

Did the query syntax fail? The data format? The export path? The tool is too broad to give useful feedback.

Single Responsibility Refactoring

Split the swiss army knife into focused tools:

#![allow(unused)]
fn main() {
// READ operations
Tool::new("db_query")
    .description(
        "Execute read-only SQL queries. \
        Use for retrieving data from any table. \
        Returns results as JSON array."
    )
    .input_schema(json!({
        "required": ["sql"],
        "properties": {
            "sql": { "type": "string" },
            "limit": { "type": "integer", "default": 100 }
        }
    }))

// WRITE operations (separate from read for safety)
Tool::new("db_modify")
    .description(
        "Insert, update, or delete records. \
        Use when the user explicitly requests data changes. \
        Returns affected row count."
    )
    .input_schema(json!({
        "required": ["operation", "table"],
        "properties": {
            "operation": { "enum": ["insert", "update", "delete"] },
            "table": { "type": "string" },
            "data": { "type": "object" },
            "where": { "type": "string" }
        }
    }))

// EXPORT operations
Tool::new("db_export")
    .description(
        "Export table data to file formats (CSV, JSON, Parquet). \
        Use when user needs to download or share data. \
        Returns file path or download URL."
    )
    .input_schema(json!({
        "required": ["table", "format"],
        "properties": {
            "table": { "type": "string" },
            "format": { "enum": ["csv", "json", "parquet"] },
            "filter": { "type": "string" }
        }
    }))

// VALIDATION operations
Tool::new("db_validate")
    .description(
        "Check data integrity and validate against schemas. \
        Use before imports or to diagnose data issues. \
        Returns validation report."
    )
}

Now the AI's job is clear:

User wants data? → db_query
User wants to change data? → db_modify
User wants a file? → db_export
User wants to check data? → db_validate

Helping AI Generate Correct SQL

When your tool accepts SQL queries, the AI must generate syntactically correct SQL for your specific database. Different databases have vastly different SQL dialects:

Database	Date Literal	String Concat	Window Functions	JSON Access
PostgreSQL	`'2024-01-15'::date`	`		`
MySQL	`STR_TO_DATE('2024-01-15', '%Y-%m-%d')`	`CONCAT()`	MySQL 8+ only	`JSON_EXTRACT()`
Oracle	`TO_DATE('2024-01-15', 'YYYY-MM-DD')`	`		`
Amazon Athena	`DATE '2024-01-15'`	`CONCAT()`	Full support	`json_extract_scalar()`
SQLite	`'2024-01-15'`	`		`

Always specify the database flavor in your tool description:

#![allow(unused)]
fn main() {
// POOR: AI doesn't know which SQL dialect to use
Tool::new("db_query")
    .description(
        "Execute read-only SQL queries. \
        Returns results as JSON array."
    )

// BETTER: AI knows the exact database engine
Tool::new("db_query")
    .description(
        "Execute read-only SQL queries against PostgreSQL 15. \
        Supports all PostgreSQL features including WINDOW functions, \
        CTEs, LATERAL joins, and JSON operators (->>, @>). \
        Use PostgreSQL-specific date functions (DATE_TRUNC, EXTRACT). \
        Returns results as JSON array."
    )

// FOR ATHENA: Specify Presto/Trino SQL dialect
Tool::new("athena_query")
    .description(
        "Execute read-only queries against Amazon Athena (Trino SQL). \
        Use Presto SQL syntax: CONCAT() for strings, DATE '2024-01-15' \
        for date literals, json_extract_scalar() for JSON. \
        Supports WINDOW functions and CTEs. \
        Returns results as JSON array with max 1000 rows."
    )
}

Why This Matters

When a user asks "show me sales by month for 2024," the AI must generate SQL:

Without dialect information:

-- AI might generate generic SQL that fails
SELECT MONTH(sale_date), SUM(amount)
FROM sales
WHERE YEAR(sale_date) = 2024
GROUP BY MONTH(sale_date)
-- Fails on PostgreSQL: MONTH() doesn't exist

With PostgreSQL specified:

-- AI generates PostgreSQL-correct SQL
SELECT DATE_TRUNC('month', sale_date) AS month, SUM(amount)
FROM sales
WHERE sale_date >= '2024-01-01' AND sale_date < '2025-01-01'
GROUP BY DATE_TRUNC('month', sale_date)
ORDER BY month

With Amazon Athena specified:

-- AI generates Athena/Presto-correct SQL
SELECT DATE_TRUNC('month', sale_date) AS month, SUM(amount)
FROM sales
WHERE sale_date >= DATE '2024-01-01' AND sale_date < DATE '2025-01-01'
GROUP BY DATE_TRUNC('month', sale_date)
ORDER BY month

Include Capability Hints

Beyond the engine name, mention key capabilities the AI can leverage:

#![allow(unused)]
fn main() {
Tool::new("analytics_query")
    .description(
        "Execute analytical queries against ClickHouse. \
        Optimized for aggregations over large datasets. \
        Supports: WINDOW functions, Array functions (arrayJoin, groupArray), \
        approximate functions (uniq, quantile), sampling (SAMPLE 0.1). \
        Use ClickHouse date functions: toStartOfMonth(), toYear(). \
        Column-oriented: SELECT only columns you need for best performance."
    )
}

This enables the AI to use advanced features when appropriate:

-- AI can leverage ClickHouse-specific features
SELECT
    toStartOfMonth(sale_date) AS month,
    uniq(customer_id) AS unique_customers,  -- Approximate count, very fast
    quantile(0.95)(amount) AS p95_amount    -- 95th percentile
FROM sales
WHERE sale_date >= '2024-01-01'
GROUP BY month
ORDER BY month

Database Version Matters

Different versions have different capabilities:

#![allow(unused)]
fn main() {
// MySQL 5.7 - limited window function support
Tool::new("legacy_query")
    .description(
        "Query against MySQL 5.7. \
        Note: WINDOW functions not supported. \
        Use subqueries or temporary tables for ranking/running totals."
    )

// MySQL 8.0 - full modern SQL support
Tool::new("modern_query")
    .description(
        "Query against MySQL 8.0. \
        Full WINDOW function support (ROW_NUMBER, RANK, LAG/LEAD). \
        Supports CTEs (WITH clause) and JSON_TABLE()."
    )
}

The "One Sentence" Rule

If you can't describe what a tool does in one clear sentence, it's doing too much:

#![allow(unused)]
fn main() {
// FAIL: Multiple responsibilities
"Perform data operations - query, insert, update, delete, export, import, validate, transform"

// PASS: Single responsibility
"Execute read-only SQL queries against the database"
"Export table data to file formats"
"Validate data integrity against schemas"
}

Balancing Granularity

Single responsibility doesn't mean creating hundreds of micro-tools. Find the right level of abstraction:

Too Granular (tool explosion)

#![allow(unused)]
fn main() {
Tool::new("select_from_customers")
Tool::new("select_from_orders")
Tool::new("select_from_products")
Tool::new("select_with_where")
Tool::new("select_with_join")
Tool::new("select_with_group_by")
// 50 more query variations...
}

Too Coarse (swiss army knife)

#![allow(unused)]
fn main() {
Tool::new("database")  // Does everything database-related
}

Just Right (task-oriented)

#![allow(unused)]
fn main() {
Tool::new("db_query")      // Read data with SQL
Tool::new("db_schema")     // Explore table structures
Tool::new("db_export")     // Export to files
Tool::new("db_admin")      // Administrative operations (with appropriate guards)
}

Responsibility and Safety

Single responsibility also enables better safety controls:

#![allow(unused)]
fn main() {
// Read operations: safe, can be used freely
Tool::new("db_query")
    .description("Read-only queries - safe for exploration")

// Write operations: need confirmation
Tool::new("db_modify")
    .description("Modifies data - AI should confirm with user before destructive operations")

// Admin operations: restricted
Tool::new("db_admin")
    .description("Administrative operations - requires explicit user authorization")
    .annotations(json!({
        "requires_confirmation": true,
        "risk_level": "high"
    }))
}

With separate tools, you can apply different security policies to each.

The Composition Principle

Single-responsibility tools compose better than multi-purpose tools:

#![allow(unused)]
fn main() {
// Multi-purpose tools can't be combined
Tool::new("analyze_and_report")  // Does analysis AND reporting
// What if user wants analysis without report? Too bad.

// Single-purpose tools compose flexibly
Tool::new("db_query")           // Get the data
Tool::new("data_analyze")       // Analyze it
Tool::new("report_generate")    // Create report

// AI can now:
// - Query without analysis
// - Analyze without report
// - Query, analyze, AND report
// - Any combination the user needs
}

Testing Single Responsibility

The "What If" Test

For each tool, ask: "What if the user only wants part of what this tool does?"

#![allow(unused)]
fn main() {
// FAIL: Can't partially use
Tool::new("fetch_and_format_data")
// What if user wants raw data without formatting?

// PASS: Separable concerns
Tool::new("fetch_data")
Tool::new("format_data")
}

The "Who Cares" Test

For each operation in a tool, ask: "Would a different user care about just this operation?"

#![allow(unused)]
fn main() {
// In "data_operation":
// - query: Data analysts care about this
// - insert: Application developers care about this
// - export: Business users care about this
// - validate: Data engineers care about this

// Different audiences = different tools
}

The "Change Impact" Test

If the tool's behavior needs to change, how much else breaks?

#![allow(unused)]
fn main() {
// Multi-purpose: changing export format affects everything
Tool::new("data_operation")  // Export format change touches all code paths

// Single-purpose: changes are isolated
Tool::new("db_export")  // Only export code needs to change
}

Summary

Single responsibility for MCP tools means:

Principle	Benefit
One clear purpose per tool	AI selects correctly
Focused parameter schemas	Less confusion, better errors
Separable concerns	Users get exactly what they need
Composable operations	Flexible workflows
Isolated safety controls	Appropriate permissions per operation

Remember: you're not writing code for other developers. You're writing tools for AI clients that must choose correctly from dozens of options. Make their job easy.

Chapter 4 Exercises

These exercises will help you practice designing cohesive, well-structured MCP tool sets.

Quiz

Test your understanding of the design principles covered in this chapter:

Exercises

Tool Design Review ⭐⭐ Intermediate (30 min)
- Review a poorly designed MCP server
- Identify anti-patterns and propose improvements
- Apply domain prefixing and single responsibility

Key Concepts to Practice

Domain Prefixing: Use sales_, customer_, order_ prefixes to avoid collisions
Single Responsibility: Each tool does one thing well
The 50 Tools Test: Would your tools be distinguishable in a crowded environment?
The One Sentence Rule: Can you describe each tool in one clear sentence?

Next Steps

After completing these exercises, continue to:

Input Validation and Output Schemas - Make your tools AI-friendly
Resources, Prompts, and Workflows - Beyond just tools

Exercise: Tool Design Review

ch04-01-tool-design-review

⭐⭐ intermediate ⏱️ 30 min

A startup has asked you to review their MCP server design before they deploy it to production. Their server started as a direct conversion of their REST API, and they're concerned about usability.

Your task is to identify the design problems and propose a refactored design that follows the principles from this chapter.

🎯 Learning Objectives

Thinking

Recognize anti-patterns in tool design (API-to-MCP trap, swiss army knife)
Apply cohesive naming with domain prefixes
Understand single responsibility for tools
Design for the multi-server environment

Doing

Analyze a problematic tool set and identify issues
Refactor tool names for cohesion and discoverability
Split multi-purpose tools into focused single-purpose tools
Write clear descriptions that help AI distinguish tools

💬 Discussion

What would happen if a user connects this server alongside GitHub and filesystem servers?
How would an AI decide which tool to use for "show me recent activity"?
What's the difference between a good tool set for humans vs. AI?

review.md

💡 Hints

Hint 1: Identifying collision risks

Look for generic names that other servers might also use:

query - postgres-server also has query
list - many servers have list operations
get - very generic, could mean anything
action - what kind of action?

Ask: "If I saw just this tool name, would I know which server it came from?"

Hint 2: Domain groupings

Consider organizing by business domain, not by operation type:

Customer domain:

customer_get, customer_list, customer_update

Order domain:

order_get, order_list, order_create, order_cancel

Reporting domain:

report_sales, report_inventory, report_customers

Admin domain:

admin_send_email, admin_create_ticket, admin_export

Hint 3: Refactoring the report tool

The report tool does 4 different things. Split by report type:

report_sales
  - Description: "Generate sales report with revenue, units, and trends.
    Use when user asks about sales performance, revenue, or sales trends.
    Returns report data with totals, comparisons, and visualizable metrics."
  - Parameters: { date_range, group_by, include_forecast }
report_inventory

Description: "Generate inventory status report with stock levels and alerts.
Use when user asks about stock, inventory, or supply levels.
Returns current stock, reorder alerts, and turnover metrics."
Parameters: { warehouse, category, include_projections }

report_customers

Description: "Generate customer analytics report with segments and health.
Use when user asks about customer behavior, churn, or segments.
Returns segment breakdown, health scores, and trend analysis."
Parameters: { segment, time_period, include_cohort_analysis }

⚠️ Try the exercise first!Show Solution

# MCP Server Design Review - Solution
Problem Analysis
Tool 1: query
Problems:

❌ Generic name - collides with postgres-server's query
❌ Vague description - "Query data" tells AI nothing
❌ Swiss army knife - queries any table with dynamic type

Tool 2: modify
Problems:

❌ Swiss army knife - insert, update, AND delete in one tool
❌ Dangerous - no separation between safe and destructive operations
❌ Vague description and parameters

Tool 3: get
Problems:

❌ Generic name - get is used everywhere
❌ Swiss army knife - gets customers, orders, products, or users
❌ Description "Get something" is useless

Tool 4: list
Problems:

❌ Generic name - collides with many servers
❌ Swiss army knife - lists any entity type
❌ AI must guess what "things" to list

Tool 5: report
Problems:

❌ Swiss army knife - 4 different report types
❌ AI must know all report types exist
❌ Different reports need different parameters

Tool 6: action
Problems:

❌ Extremely generic - "Perform action" on what?
❌ Mixes unrelated operations (email, tickets, archive, export)
❌ AI can't discover what actions are available


Refactored Design
Customer Domain

  <div class="solution-explanation"><h4>Explanation</h4><p>customer_get

Description: "Get customer details by ID. Use when user asks about a specific customer. Returns profile, contact info, and account status."

customer_list Description: "List customers with optional filters. Use when user asks to see customers or search for customers. Returns paginated customer list with summary info."

customer_update Description: "Update customer information. Use when user explicitly requests customer changes. Returns updated customer record." order_get Description: "Get order details by ID. Use for order lookups and status checks. Returns order with items, status, and tracking."

order_list Description: "List orders with filters. Use for order history and order searches. Returns paginated orders with summary."

order_create Description: "Create a new order. Use when user wants to place an order. Returns created order with ID." report_sales Description: "Generate sales performance report. Use for revenue analysis, sales trends, and performance reviews. Returns totals, comparisons, and trend data."

report_inventory Description: "Generate inventory status report. Use for stock levels, reorder alerts, and supply planning. Returns stock levels and projections."

report_customer_analytics Description: "Generate customer analytics report. Use for churn analysis, segmentation, and customer health. Returns segment data and health metrics." admin_send_email Description: "Send email to customer or internal recipient. Use when user explicitly requests sending an email. Returns send confirmation and tracking ID."

admin_export_data Description: "Export data to file format. Use when user needs data download or file export. Returns file path or download URL."

🧪 Tests

Run these tests locally with:

cargo test

View Test Code

#![allow(unused)]
fn main() {
#[cfg(test)]
mod tests {
    // These are conceptual tests for the exercise
#[test]
fn tool_names_have_domain_prefix() {
    let tool_names = vec![
        &quot;customer_get&quot;,
        &quot;customer_list&quot;,
        &quot;order_create&quot;,
        &quot;report_sales&quot;,
    ];

    for name in tool_names {
        assert!(
            name.contains(&quot;_&quot;),
            &quot;Tool {} should have domain prefix&quot;,
            name
        );
    }
}

#[test]
fn descriptions_follow_template() {
    let description = &quot;Execute read-only queries against the customer database. \
        Use for retrieving customer records. \
        Returns query results as JSON array.&quot;;

    assert!(description.contains(&quot;Use for&quot;),
        &quot;Description should explain when to use&quot;);
    assert!(description.contains(&quot;Returns&quot;),
        &quot;Description should explain what it returns&quot;);
}
}

}

🤔 Reflection

How would you handle a case where a tool legitimately needs to do multiple things?
What's the trade-off between fewer multi-purpose tools and many focused tools?
How might you document the relationships between related tools?
Should you ever break the domain prefix convention? When?