MakingMCP

MCP

Intent Surfaces™

A reference manual for people who design and build MCP (Model Context Protocol) ecosystems

MCP

Intent Surfaces™

A reference manual for people who design and build MCP (Model Context Protocol) ecosystems

MCP

Intent Surfaces™

A reference manual for people who design and build MCP (Model Context Protocol) ecosystems

The Dance of Client and Server

A love story between two machines • The handshake that starts it all • JSON-RPC: The language of connection

The Millisecond Ballet

>> 09:47:23.451 UTC

a client reaches out through the digital void.

>> 09:47:23.451 UTC

a client reaches out through the digital void.

>> 09:47:23.451 UTC

a client reaches out through the digital void.

>> 09:47:23.451 UTC

six milliseconds later—a server responds.

>> 09:47:23.451 UTC

six milliseconds later—a server responds.

>> 09:47:23.451 UTC

six milliseconds later—a server responds.

In those six milliseconds, a negotiation more complex than any human protocol takes place. Capabilities are declared. Versions are verified. Trust is established. A connection is born.

This is the dance of MCP, performed millions of times per second, so elegant that those who witness it can't help but marvel at its choreography.

Initialize.TS

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
// The opening move - a client's first reach
{
  "jsonrpc": "2.0",
  "method": "initialize",
  "params": {
    "protocolVersion": "1.0.0",
    "clientInfo": {
      "name": "claude-desktop",
      "version": "2.1.4"
    },
    "capabilities": {
      "tools": true,
      "resources": true,
      "prompts": true,
      "progressReporting": true,
      "partialResults": true
    }
  },
  "id": "init_1710925643451"
}

// The server's response - 6ms later
{
  "jsonrpc": "2.0",
  "result": {
    "protocolVersion": "1.0.0",
    "serverInfo": {
      "name": "github-mcp-server",
      "version": "0.5.2"
    },
    "capabilities": {
      "tools": [
        { "name": "list_repos", "progressive": true },
        { "name": "create_issue", "confirmRequired": true }
      ],
      "resources": [
        { "uri": "github://repos/*", "subscriptions": true }
      ]
    }
  },
  "id": "init_1710925643451"
}

4.1 The protocol Heartbear

The State Machine Symphony

Every MCP connection is a state machine—a finite set of states with precisely defined transitions. It's deterministic beauty at its finest.

Symphony

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
enum ConnectionState {
  DISCONNECTED = "disconnected",
  CONNECTING = "connecting", 
  INITIALIZING = "initializing",
  READY = "ready",
  BUSY = "busy",
  ERROR = "error",
  CLOSING = "closing"
}

class MCPStateMachine {
  private state: ConnectionState = ConnectionState.DISCONNECTED;
  private stateHistory: Array<{state: ConnectionState, timestamp: number}> = [];
  
  // The rules of the dance
  private transitions: Record<ConnectionState, ConnectionState[]> = {
    [ConnectionState.DISCONNECTED]: [ConnectionState.CONNECTING],
    [ConnectionState.CONNECTING]: [ConnectionState.INITIALIZING, ConnectionState.ERROR],
    [ConnectionState.INITIALIZING]: [ConnectionState.READY, ConnectionState.ERROR],
    [ConnectionState.READY]: [ConnectionState.BUSY, ConnectionState.CLOSING, ConnectionState.ERROR],
    [ConnectionState.BUSY]: [ConnectionState.READY, ConnectionState.ERROR],
    [ConnectionState.ERROR]: [ConnectionState.CLOSING, ConnectionState.CONNECTING],
    [ConnectionState.CLOSING]: [ConnectionState.DISCONNECTED]
  };
  
  transition(newState: ConnectionState): void {
    const validTransitions = this.transitions[this.state];
    
    if (!validTransitions.includes(newState)) {
      throw new Error(
        `Invalid transition: ${this.state} → ${newState}\n` +
        `Valid transitions: ${validTransitions.join(", ")}`
      );
    }
    
    this.stateHistory.push({
      state: newState,
      timestamp: performance.now()
    });
    
    this.state = newState;
    this.emit("stateChange", { from: this.state, to: newState });
  }
}

STATE TIMELINE

4.2 Circle of Orchestra

The Handshake Dissected

Let's slow down time and watch the handshake frame by frame:

Frame 1: The Client Awakens (T+0ms)

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
class MCPClient {
  private socket: WebSocket;
  private messageId: number = 0;
  private pendingRequests = new Map();
  
  async connect(serverUrl: string): Promise<void> {
    // Create connection with exponential backoff
    this.socket = new WebSocket(serverUrl);
    
    // The moment of first contact
    this.socket.onopen = () => {
      console.log(`[T+0ms] Connection established to ${serverUrl}`);
      this.performHandshake();
    };
    
    // Set up message handler
    this.socket.onmessage = (event) => {
      const message = JSON.parse(event.data);
      this.handleMessage(message);
    };
  }
}

Frame 2: The Initialize Request (T+1ms)

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
private async performHandshake(): Promise<void> {
  const handshakeStart = performance.now();
  
  const initRequest = {
    jsonrpc: "2.0",
    method: "initialize",
    params: {
      protocolVersion: "1.0.0",
      clientInfo: {
        name: "mcp-client",
        version: "1.0.0"
      },
      capabilities: {
        tools: true,
        resources: true,
        prompts: true,
        progressReporting: true,
        streaming: true
      }
    },
    id: this.generateId()
  };

Frame 3: The Server Awakens (T+2ms)

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
class MCPClient {
  private socket: WebSocket;
  private messageId: number = 0;
  private pendingRequests = new Map();
  
  async connect(serverUrl: string): Promise<void> {
    // Create connection with exponential backoff
    this.socket = new WebSocket(serverUrl);
    
    // The moment of first contact
    this.socket.onopen = () => {
      console.log(`[T+0ms] Connection established to ${serverUrl}`);
      this.performHandshake();
    };
    
    // Set up message handler
    this.socket.onmessage = (event) => {
      const message = JSON.parse(event.data);
      this.handleMessage(message);
    };
  }
}

Frame 4: Capability Negotiation (T+3-5ms)

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
private async performHandshake(): Promise<void> {
  const handshakeStart = performance.now();
  
  const initRequest = {
    jsonrpc: "2.0",
    method: "initialize",
    params: {
      protocolVersion: "1.0.0",
      clientInfo: {
        name: "mcp-client",
        version: "1.0.0"
      },
      capabilities: {
        tools: true,
        resources: true,
        prompts: true,
        progressReporting: true,
        streaming: true
      }
    },
    id: this.generateId()
  };

The Message Anatomy

Every MCP message is a precisely structured JSON-RPC 2.0 packet. Let's dissect one:

Anatomy Code

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
interface MCPMessage {
  jsonrpc: "2.0";  // Always exactly this
  
  // ONE of these (never both):
  method?: string;  // For requests/notifications  
  result?: any;     // For successful responses
  error?: {         // For error responses
    code: number;
    message: string;
    data?: any;
  };
  
  // Required for requests/responses (not notifications)
  id?: string | number | null;
}

// Let's trace a complete request/response cycle
class MessageTracer {
  private traces = new Map<string, MessageTrace>();
  
  traceRequest(message: MCPRequest): void {
    const trace: MessageTrace = {
      id: message.id,
      startTime: performance.now(),
      request: message,
      state: "in-flight",
      hops: [{
        location: "client",
        timestamp: performance.now(),
        action: "send"
      }]
    };
    
    this.traces.set(message.id, trace);
  }
  
  traceResponse(message: MCPResponse): void {
    const trace = this.traces.get(message.id);
    if (!trace) return;
    
    trace.response = message;
    trace.endTime = performance.now();
    trace.duration = trace.endTime - trace.startTime;
    trace.state = "completed";
    
    trace.hops.push({
      location: "server", 
      timestamp: trace.endTime,
      action: "respond"
    });
    
    // Analyze the round trip
    this.analyzeTrace(trace);
  }
  
  private analyzeTrace(trace: MessageTrace): void {
    console.log(`Message ${trace.id} completed in ${trace.duration.toFixed(2)}ms`);
    
    // Detect anomalies
    if (trace.duration > 100) {
      console.warn(`Slow message detected: ${trace.duration}ms`);
    }
    
    // Message size analysis
    const requestSize = JSON.stringify(trace.request).length;
    const responseSize = JSON.stringify(trace.response).length;
    
    console.log(`Bytes sent: ${requestSize}, received: ${responseSize}`);
  }
}

4.3 Message Dissection Lab

The Transport Layer Symphony

MCP can flow over multiple transports, each with its own rhythm:

Standard I/O: The Unix Philosophy

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
interface MCPMessage {
  jsonrpc: "2.0";  // Always exactly this
  
  // ONE of these (never both):
  method?: string;  // For requests/notifications  
  result?: any;     // For successful responses
  error?: {         // For error responses
    code: number;
    message: string;
    data?: any;
  };
  
  // Required for requests/responses (not notifications)
  id?: string | number | null;
}

// Let's trace a complete request/response cycle
class MessageTracer {
  private traces = new Map<string, MessageTrace>();
  
  traceRequest(message: MCPRequest): void {
    const trace: MessageTrace = {
      id: message.id,
      startTime: performance.now(),
      request: message,
      state: "in-flight",
      hops: [{
        location: "client",
        timestamp: performance.now(),
        action: "send"
      }]
    };
    
    this.traces.set(message.id, trace);
  }
  
  traceResponse(message: MCPResponse): void {
    const trace = this.traces.get(message.id);
    if (!trace) return;
    
    trace.response = message;
    trace.endTime = performance.now();
    trace.duration = trace.endTime - trace.startTime;
    trace.state = "completed";
    
    trace.hops.push({
      location: "server", 
      timestamp: trace.endTime,
      action: "respond"
    });
    
    // Analyze the round trip
    this.analyzeTrace(trace);
  }
  
  private analyzeTrace(trace: MessageTrace): void {
    console.log(`Message ${trace.id} completed in ${trace.duration.toFixed(2)}ms`);
    
    // Detect anomalies
    if (trace.duration > 100) {
      console.warn(`Slow message detected: ${trace.duration}ms`);
    }
    
    // Message size analysis
    const requestSize = JSON.stringify(trace.request).length;
    const responseSize = JSON.stringify(trace.response).length;
    
    console.log(`Bytes sent: ${requestSize}, received: ${responseSize}`);
  }
}

HTTP + SSE: The Web Native

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
// HTTP transport with Server-Sent Events for server→client
class HttpSseTransport implements Transport {
  private server: Server;
  private clients = new Set<ServerSentEventStream>();
  
  async start(port: number): Promise<void> {
    this.server = new Server({
      port,
      handler: async (req) => {
        const url = new URL(req.url);
        
        // Handle different endpoints
        switch (url.pathname) {
          case "/mcp/v1/call":
            return await this.handleCall(req);
            
          case "/mcp/v1/events":
            return this.handleEventStream(req);
            
          default:
            return new Response("Not Found", { status: 404 });
        }
      }
    });
  }
  
  private async handleCall(req: Request): Response {
    const message = await req.json();
    const response = await this.processMessage(message);
    
    return new Response(JSON.stringify(response), {
      headers: { "Content-Type": "application/json" }
    });
  }
  
  private handleEventStream(req: Request): Response {
    const stream = new ServerSentEventStream();
    this.clients.add(stream);
    
    // Send heartbeat every 30s
    const heartbeat = setInterval(() => {
      stream.sendEvent({
        event: "heartbeat",
        data: JSON.stringify({ timestamp: Date.now() })
      });
    }, 30000);
    
    req.signal.addEventListener("abort", () => {
      clearInterval(heartbeat);
      this.clients.delete(stream);
    });
    
    return new Response(stream.readable, {
      headers: {
        "Content-Type": "text/event-stream",
        "Cache-Control": "no-cache",
        "Connection": "keep-alive"
      }
    });
  }
}

Click to animate

Click to animate

Click to animate

Click to animate

Click to animate

Click to animate

4.4 Transport Layer Observatory

Error Handling: When Dancers Stumble

Even the most elegant dance can falter. MCP's error handling is itself a thing of beauty:

Handling.ts

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
enum MCPErrorCode {
  // JSON-RPC standard errors
  PARSE_ERROR = -32700,
  INVALID_REQUEST = -32600,
  METHOD_NOT_FOUND = -32601,
  INVALID_PARAMS = -32602,
  INTERNAL_ERROR = -32603,
  
  // MCP-specific errors  
  PROTOCOL_ERROR = -32000,
  CAPABILITY_NOT_SUPPORTED = -32001,
  RESOURCE_NOT_FOUND = -32002,
  TOOL_EXECUTION_FAILED = -32003,
  PERMISSION_DENIED = -32004,
  RATE_LIMITED = -32005,
  TIMEOUT = -32006
}

class MCPErrorHandler {
  private errorStats = new Map<number, ErrorStats>();
  private circuitBreaker = new CircuitBreaker();
  
  async handleError(error: MCPError, context: RequestContext): Promise<MCPErrorResponse> {
    // Track error statistics
    this.trackError(error.code);
    
    // Determine retry strategy
    const strategy = this.getRetryStrategy(error.code);
    
    // Check circuit breaker
    if (this.circuitBreaker.isOpen(context.serverId)) {
      return {
        jsonrpc: "2.0",
        error: {
          code: MCPErrorCode.INTERNAL_ERROR,
          message: "Service temporarily unavailable",
          data: {
            retryAfter: this.circuitBreaker.getResetTime(context.serverId)
          }
        },
        id: context.requestId
      };
    }
    
    // Handle specific error types
    switch (error.code) {
      case MCPErrorCode.RATE_LIMITED:
        return this.handleRateLimit(error, context);
        
      case MCPErrorCode.TIMEOUT:
        return this.handleTimeout(error, context);
        
      case MCPErrorCode.PERMISSION_DENIED:
        return this.handlePermissionDenied(error, context);
        
      default:
        return this.defaultErrorResponse(error, context);
    }
  }
  
  private handleRateLimit(error: MCPError, context: RequestContext): MCPErrorResponse {
    const resetTime = error.data?.resetTime || Date.now() + 60000;
    const retryAfter = Math.ceil((resetTime - Date.now()) / 1000);
    
    return {
      jsonrpc: "2.0",
      error: {
        code: error.code,
        message: "Rate limit exceeded",
        data: {
          limit: error.data?.limit,
          remaining: 0,
          resetTime,
          retryAfter
        }
      },
      id: context.requestId
    };
  }
}

4.5 Error Theater

Performance: The Speed of Light

In the world of protocols, microseconds matter. Here's how MCP stays fast:

Performance Monitor

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
class PerformanceMonitor {
  private metrics = {
    messageCount: 0,
    totalBytes: 0,
    latencies: [] as number[],
    throughput: [] as number[],
    errors: 0
  };
  
  private performanceObserver = new PerformanceObserver((list) => {
    for (const entry of list.getEntries()) {
      if (entry.entryType === "measure" && entry.name.startsWith("mcp.")) {
        this.recordLatency(entry.duration);
      }
    }
  });
  
  recordMessage(message: MCPMessage, direction: "in" | "out"): void {
    const size = JSON.stringify(message).length;
    
    this.metrics.messageCount++;
    this.metrics.totalBytes += size;
    
    // Mark for performance measurement
    performance.mark(`mcp.${direction}.${message.id}`);
    
    // Calculate throughput
    const now = Date.now();
    const throughput = this.calculateThroughput();
    this.metrics.throughput.push(throughput);
    
    // Emit metrics event
    this.emit("metrics", {
      instant: {
        messagesPerSecond: this.getMessagesPerSecond(),
        bytesPerSecond: this.getBytesPerSecond(),
        averageLatency: this.getAverageLatency(),
        p99Latency: this.getP99Latency()
      }
    });
  }
  
  private calculateThroughput(): number {
    const window = 1000; // 1 second window
    const now = Date.now();
    const cutoff = now - window;
    
    // Count messages in window
    const recentMessages = this.metrics.throughput.filter(t => t > cutoff).length;
    return recentMessages;
  }
}

// Real production optimization
class OptimizedMCPClient {
  private messagePool = new ObjectPool<MCPMessage>(() => ({ jsonrpc: "2.0" }));
  private compressionThreshold = 1024; // Compress messages over 1KB
  
  async send(method: string, params: any): Promise<any> {
    const message = this.messagePool.acquire();
    
    try {
      message.method = method;
      message.params = params;
      message.id = this.generateId();
      
      // Optimize payload
      const optimized = this.optimize(message);
      
      // Send with timing
      const start = performance.now();
      const response = await this.transport.send(optimized);
      const duration = performance.now() - start;
      
      // Record metrics
      this.metrics.record({
        method,
        duration,
        payloadSize: JSON.stringify(optimized).length,
        compressed: optimized !== message
      });
      
      return response;
    } finally {
      this.messagePool.release(message);
    }
  }
  
  private optimize(message: MCPMessage): MCPMessage | CompressedMessage {
    const json = JSON.stringify(message);
    
    if (json.length > this.compressionThreshold) {
      return {
        jsonrpc: "2.0",
        compressed: true,
        data: gzip(json),
        id: message.id
      };
    }
    
    return message;
  }
}

4.6 Performance Monitoring

The Multi-Client Orchestra

In production, a single server often handles dozens of simultaneous clients. This is where the dance becomes a ballet:

Multi Client

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
class MCPServerOrchestrator:
    def __init__(self, max_clients: int = 100):
        self.clients: Dict[str, ClientSession] = {}
        self.max_clients = max_clients
        self.client_semaphore = asyncio.Semaphore(max_clients)
        self.message_router = MessageRouter()
        self.load_balancer = LoadBalancer()
        
    async def handle_new_client(self, websocket: WebSocket, client_id: str):
        async with self.client_semaphore:
            # Create session
            session = ClientSession(
                id=client_id,
                websocket=websocket,
                connected_at=time.time()
            )
            
            self.clients[client_id] = session
            
            try:
                # Perform handshake
                await self.handshake(session)
                
                # Main message loop
                await self.message_loop(session)
                
            except Exception as e:
                logger.error(f"Client {client_id} error: {e}")
                
            finally:
                # Cleanup
                del self.clients[client_id]
                await self.broadcast_client_left(client_id)
    
    async distribute_work(self, task: Task) -> TaskResult:
        """Distribute work across available clients"""
        # Find capable clients
        capable_clients = [
            client for client in self.clients.values()
            if task.required_capability in client.capabilities
        ]
        
        if not capable_clients:
            raise NoCapableClientError(f"No client supports {task.required_capability}")
        
        # Load balance
        selected_client = self.load_balancer.select(
            capable_clients,
            factors={
                "current_load": lambda c: c.active_requests,
                "latency": lambda c: c.average_latency,
                "success_rate": lambda c: c.success_rate
            }
        )
        
        # Execute on selected client
        return await self.execute_on_client(selected_client, task)

4.7 Multi Client

The Production Story: Slack's MCP Migration

Let's look at how Slack migrated to MCP, handling 10 million+ messages per day:

Old Migration Code

1
2
3
4
5
// Before: Custom WebSocket protocol
class SlackLegacyProtocol {
  // 5000+ lines of custom protocol code
  // Handling auth, reconnection, message routing...
}

new_migration.typescript

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
// After: MCP implementation
class SlackMCPServer implements MCPServer {
  private slack: SlackAPIClient;
  private messageCache: LRUCache<string, Message>;
  private rateLimiter: RateLimiter;
  
  constructor() {
    this.messageCache = new LRUCache({ max: 10000 });
    this.rateLimiter = new TokenBucket({
      capacity: 1000,
      fillRate: 100 // 100 requests per second
    });
  }
  
  // Complete implementation in under 500 lines
  async getCapabilities(): Promise<ServerCapabilities> {
    return {
      tools: [
        {
          name: "send_message",
          description: "Send a message to a Slack channel",
          inputSchema: {
            type: "object",
            properties: {
              channel: { type: "string" },
              text: { type: "string" },
              thread_ts: { type: "string", optional: true }
            }
          }
        },
        {
          name: "search_messages",
          description: "Search Slack messages",
          inputSchema: {
            type: "object",
            properties: {
              query: { type: "string" },
              count: { type: "number", default: 20 }
            }
          }
        }
      ],
      resources: [
        {
          uri: "slack://messages/*",
          mimeType: "application/json",
          description: "Access Slack messages"
        },
        {
          uri: "slack://users/*",
          mimeType: "application/json", 
          description: "Access user profiles"
        }
      ]
    };
  }
  
  async handleToolCall(name: string, args: any): Promise<any> {
    // Rate limiting
    await this.rateLimiter.consume(1);
    
    switch (name) {
      case "send_message":
        return await this.sendMessage(args);
      case "search_messages":
        return await this.searchMessages(args);
      default:
        throw new Error(`Unknown tool: ${name}`);
    }
  }
  
  private async sendMessage(args: SendMessageArgs): Promise<SendMessageResult> {
    // Implementation with caching and error handling
    const result = await this.slack.chat.postMessage({
      channel: args.channel,
      text: args.text,
      thread_ts: args.thread_ts
    });
    
    // Cache the sent message
    this.messageCache.set(result.ts, {
      ts: result.ts,
      channel: result.channel,
      text: args.text,
      user: "bot"
    });
    
    return {
      ts: result.ts,
      channel: result.channel,
      permalink: `https://slack.com/archives/${result.channel}/p${result.ts.replace('.', '')}`
    };
  }
}

// Performance results:
// - 80% reduction in code complexity
// - 50ms → 5ms average latency  
// - 99.99% uptime (was 99.9%)
// - 10x easier to add new features

4.8 Old migration

The Production Story: Slack's MCP Migration

As we write this in March 2025, MCP 2.0 is on the horizon:

MCP 2.0 Preview

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
// MCP 2.0 Preview - Streaming and multiplexing
interface MCP2Features {
  // Bidirectional streaming
  streaming: {
    server2client: boolean;
    client2server: boolean;
    backpressure: boolean;
  };
  
  // Request multiplexing  
  multiplexing: {
    maxConcurrent: number;
    prioritization: boolean;
  };
  
  // Binary data support
  binaryData: {
    formats: ["msgpack", "protobuf", "cbor"];
    compression: ["gzip", "brotli", "zstd"];
  };
  
  // Federation
  federation: {
    serverDiscovery: boolean;
    crossServerCalls: boolean;
  };
}

Example: Streaming tool response

1
2
3
4
5
6
7
8
9
10
11
12
// Example: Streaming tool response
{
  "jsonrpc": "2.0",
  "method": "tools/call",
  "params": {
    "name": "analyze_large_dataset",
    "arguments": { "dataset": "sales_2024" },
    "streaming": true
  },
  "id": "stream_001"
}

Server sends multiple partial results

1
2
3
4
5
6
7
8
9
10
// Server sends multiple partial results
{
  "jsonrpc": "2.0",
  "partial": true,
  "result": {
    "progress": 0.25,
    "data": { "rowsProcessed": 250000 }
  },
  "id": "stream_001"
}

More partials...

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
// After: MCP implementation
class SlackMCPServer implements MCPServer {
  private slack: SlackAPIClient;
  private messageCache: LRUCache<string, Message>;
  private rateLimiter: RateLimiter;
  
  constructor() {
    this.messageCache = new LRUCache({ max: 10000 });
    this.rateLimiter = new TokenBucket({
      capacity: 1000,
      fillRate: 100 // 100 requests per second
    });
  }
  
  // Complete implementation in under 500 lines
  async getCapabilities(): Promise<ServerCapabilities> {
    return {
      tools: [
        {
          name: "send_message",
          description: "Send a message to a Slack channel",
          inputSchema: {
            type: "object",
            properties: {
              channel: { type: "string" },
              text: { type: "string" },
              thread_ts: { type: "string", optional: true }
            }
          }
        },
        {
          name: "search_messages",
          description: "Search Slack messages",
          inputSchema: {
            type: "object",
            properties: {
              query: { type: "string" },
              count: { type: "number", default: 20 }
            }
          }
        }
      ],
      resources: [
        {
          uri: "slack://messages/*",
          mimeType: "application/json",
          description: "Access Slack messages"
        },
        {
          uri: "slack://users/*",
          mimeType: "application/json", 
          description: "Access user profiles"
        }
      ]
    };
  }
  
  async handleToolCall(name: string, args: any): Promise<any> {
    // Rate limiting
    await this.rateLimiter.consume(1);
    
    switch (name) {
      case "send_message":
        return await this.sendMessage(args);
      case "search_messages":
        return await this.searchMessages(args);
      default:
        throw new Error(`Unknown tool: ${name}`);
    }
  }
  
  private async sendMessage(args: SendMessageArgs): Promise<SendMessageResult> {
    // Implementation with caching and error handling
    const result = await this.slack.chat.postMessage({
      channel: args.channel,
      text: args.text,
      thread_ts: args.thread_ts
    });
    
    // Cache the sent message
    this.messageCache.set(result.ts, {
      ts: result.ts,
      channel: result.channel,
      text: args.text,
      user: "bot"
    });
    
    return {
      ts: result.ts,
      channel: result.channel,
      permalink: `https://slack.com/archives/${result.channel}/p${result.ts.replace('.', '')}`
    };
  }
}

// Performance results:
// - 80% reduction in code complexity
// - 50ms → 5ms average latency  
// - 99.99% uptime (was 99.9%)
// - 10x easier to add new features

4.9 The Time Machine

PREV. CHAPTER

PREV.

NEXT CHAPTER

Who built this? Follow the signal