Lab 3: Architecture Deep Dive¶

This page provides detailed architectural information for the AgentCore with MCP Server pattern used in Lab 3.

🏗️ System Architecture¶

High-Level Overview¶

graph TB subgraph "Frontend Layer" UI[React Search UI
App Runner] end subgraph "API Layer" AG[API Gateway
HTTP API] AUTH[Cognito
Authentication] end subgraph "Processing Layer" RUNTIME[AgentCore Runtime
Lambda Function] end subgraph "AWS Bedrock AgentCore" AGENT[Agent Runtime
Container
Nova Lite v1] MEMORY[AgentCore Memory
Session + Cross-session] end subgraph "MCP Server Container" MCP[MCP Server
Python FastAPI] T1[search_coveo
Tool] T2[passage_retrieval
Tool] T3[answer_question
Tool] end subgraph "External Services" COVEO[Coveo Platform
Search + Passages + Answer APIs] end UI --> AG AG --> AUTH AG --> RUNTIME RUNTIME --> AGENT AGENT --> MEMORY AGENT --> MCP MCP --> T1 MCP --> T2 MCP --> T3 T1 --> COVEO T2 --> COVEO T3 --> COVEO style UI fill:#e1f5fe style AGENT fill:#f3e5f5 style MCP fill:#fff3e0 style COVEO fill:#e8f5e9

Pattern 3: Overall Request Flow¶

%%{init: {'theme':'base', 'themeVariables': { 'primaryColor':'#f3e5f5','primaryTextColor':'#000','primaryBorderColor':'#9c27b0','lineColor':'#9c27b0','secondaryColor':'#e1f5fe','tertiaryColor':'#fff3e0'}}}%% sequenceDiagram participant User participant UI as Search UI participant Runtime as AgentCore Runtime participant MCP as Coveo MCP Server participant Coveo as Coveo APIs participant Memory as AgentCore Memory User->>+UI: Question UI->>+Runtime: Invoke agent Runtime->>Runtime: Analyze question Runtime->>+MCP: Call MCP tool 1
(search_coveo) MCP->>+Coveo: Search API Coveo-->>-MCP: Search results MCP-->>-Runtime: Tool response Runtime->>+MCP: Call MCP tool 2
(passage_retrieval) MCP->>+Coveo: Passages API Coveo-->>-MCP: Relevant passages MCP-->>-Runtime: Tool response Runtime->>+MCP: Call MCP tool 3
(answer_question) MCP->>+Coveo: Answer API Coveo-->>-MCP: AI answer MCP-->>-Runtime: Tool response Runtime->>Runtime: Synthesize response Runtime->>+Memory: Store conversation Memory-->>-Runtime: Context saved Runtime-->>-UI: Comprehensive response UI-->>-User: Answer + citations Note over User,Memory: Pattern 3: AgentCore + MCP Note over Runtime,MCP: Multi-Tool Orchestration

Component Details¶

AgentCore Runtime Lambda¶

Purpose: Invokes AgentCore Agent Runtime
Responsibilities:
- Extract user query from request
- Manage session IDs
- Invoke Agent Runtime container
- Stream responses back to UI
- Handle errors and retries

Agent Runtime Container¶

Deployment: Containerized on AWS Bedrock AgentCore
Foundation Model: Amazon Nova Lite v1:0
Capabilities:
- Natural language understanding
- Multi-tool orchestration
- Response generation
- Memory management
Configuration:
- System prompt
- MCP server connection
- Memory settings
- Model parameters

MCP Server Container¶

Technology: Python FastAPI application
Protocol: Model Context Protocol (MCP)
Deployment: Containerized on AWS Bedrock AgentCore
Tools Provided:
1. search_coveo: Search the knowledge base
2. passage_retrieval: Get relevant text passages
3. answer_question: Generate AI answers
Communication: HTTP/JSON over MCP protocol

AgentCore Memory¶

Type: Session + Cross-session memory
Storage: Managed by AWS Bedrock
Capabilities:
- Within-session context retention
- Cross-session memory (remember previous conversations)
- Configurable retention period
- Automatic cleanup

🔄 Request Flow Sequences¶

Single Query with Multi-Tool Orchestration¶

sequenceDiagram participant User participant UI as React UI participant AG as API Gateway participant RUNTIME as Runtime Lambda participant AGENT as Agent Runtime participant MCP as MCP Server participant COVEO as Coveo API participant MEM as Memory User->>UI: Enter complex question UI->>AG: POST /chat (backend=coveoMCP) AG->>RUNTIME: Invoke Lambda RUNTIME->>RUNTIME: Generate/retrieve session ID RUNTIME->>AGENT: Invoke agent with query Note over AGENT: Agent analyzes query
Determines multiple tools needed AGENT->>MCP: Call search_coveo(query) MCP->>COVEO: Search API request COVEO-->>MCP: Search results MCP-->>AGENT: Formatted search results Note over AGENT: Agent evaluates results
Decides more detail needed AGENT->>MCP: Call passage_retrieval(query) MCP->>COVEO: Passages API request COVEO-->>MCP: Relevant passages MCP-->>AGENT: Formatted passages Note over AGENT: Agent has enough information
Generates comprehensive response AGENT->>MEM: Store conversation context AGENT-->>RUNTIME: Response with citations RUNTIME->>RUNTIME: Format response RUNTIME-->>AG: JSON response AG-->>UI: HTTP 200 + response UI-->>User: Display comprehensive answer

Key Steps:

User enters complex question
Agent Runtime analyzes query complexity
Agent calls search_coveo for overview
Agent evaluates if more detail needed
Agent calls passage_retrieval for specifics
Agent synthesizes comprehensive response
Conversation stored in memory
Response returned with citations from multiple tools

Multi-Turn Conversation with Tool Orchestration¶

sequenceDiagram participant User participant AGENT as Agent Runtime participant MCP as MCP Server participant MEM as Memory Note over User,MEM: Turn 1: Initial Complex Question User->>AGENT: "Compare traditional IRA vs Roth IRA" AGENT->>MEM: Check for context (none) AGENT->>MCP: search_coveo("traditional IRA") MCP-->>AGENT: Traditional IRA results AGENT->>MCP: search_coveo("Roth IRA") MCP-->>AGENT: Roth IRA results AGENT->>MCP: passage_retrieval("IRA comparison") MCP-->>AGENT: Comparison passages AGENT->>MEM: Store: User comparing IRAs AGENT-->>User: Comprehensive comparison Note over User,MEM: Turn 2: Follow-up Question User->>AGENT: "Which is better for my situation?" AGENT->>MEM: Retrieve context (IRA comparison) Note over AGENT: Agent knows context:
Traditional vs Roth IRA AGENT->>MCP: passage_retrieval("IRA selection criteria") MCP-->>AGENT: Decision criteria passages AGENT->>MEM: Update context AGENT-->>User: Guidance on selection Note over User,MEM: Turn 3: Specific Detail User->>AGENT: "What are the contribution limits?" AGENT->>MEM: Retrieve full context Note over AGENT: Agent knows we're discussing
IRA contribution limits AGENT->>MCP: search_coveo("IRA contribution limits 2024") MCP-->>AGENT: Current limits AGENT->>MEM: Update context AGENT-->>User: Specific limit information

🔧 Technical Implementation¶

AgentCore Runtime Lambda¶

import json
import boto3
from uuid import uuid4

bedrock_agentcore = boto3.client('bedrock-agent-runtime')

def lambda_handler(event, context):
    """
    Handles chat requests for AgentCore integration
    """
    # 1. Extract request parameters
    body = json.loads(event['body'])
    query = body.get('query', '')
    session_id = body.get('sessionId') or str(uuid4())
    memory_id = body.get('memoryId')  # For cross-session memory

    # 2. Invoke AgentCore Runtime
    try:
        response = bedrock_agentcore.invoke_agent(
            agentId=os.environ['AGENT_RUNTIME_ARN'],
            sessionId=session_id,
            memoryId=memory_id,
            inputText=query,
            enableTrace=True  # Enable detailed tracing
        )

        # 3. Process streaming response
        answer = ""
        citations = []
        tool_invocations = []

        for event in response['completion']:
            if 'chunk' in event:
                chunk = event['chunk']
                if 'bytes' in chunk:
                    answer += chunk['bytes'].decode('utf-8')

            if 'trace' in event:
                # Extract tool invocations and citations
                trace = event['trace']
                tool_invocations.append(extract_tool_info(trace))
                citations.extend(extract_citations(trace))

        # 4. Format and return response
        return {
            'statusCode': 200,
            'headers': {
                'Content-Type': 'application/json',
                'Access-Control-Allow-Origin': '*'
            },
            'body': json.dumps({
                'answer': answer,
                'citations': citations,
                'toolInvocations': tool_invocations,
                'sessionId': session_id,
                'memoryId': memory_id
            })
        }

    except Exception as e:
        return {
            'statusCode': 500,
            'body': json.dumps({
                'error': 'AgentCore invocation failed',
                'message': str(e)
            })
        }

MCP Server Implementation¶

from fastapi import FastAPI
from mcp import MCPServer, Tool
import requests
import os

app = FastAPI()
mcp_server = MCPServer()

# Tool 1: Search Coveo
@mcp_server.tool(
    name="search_coveo",
    description="Search the Coveo knowledge base for relevant information",
    parameters={
        "query": {
            "type": "string",
            "description": "The search query"
        }
    }
)
async def search_coveo(query: str):
    """Search Coveo knowledge base"""
    coveo_request = {
        'q': query,
        'organizationId': os.environ['COVEO_ORG_ID'],
        'searchHub': 'workshop',
        'numberOfResults': 10
    }

    headers = {
        'Authorization': f'Bearer {os.environ["COVEO_API_KEY"]}',
        'Content-Type': 'application/json'
    }

    response = requests.post(
        'https://platform.cloud.coveo.com/rest/search/v2/',
        json=coveo_request,
        headers=headers
    )

    results = response.json()
    return {
        'results': results.get('results', []),
        'totalCount': results.get('totalCount', 0)
    }

# Tool 2: Passage Retrieval
@mcp_server.tool(
    name="passage_retrieval",
    description="Retrieve relevant text passages from Coveo knowledge base",
    parameters={
        "query": {
            "type": "string",
            "description": "The query to find relevant passages"
        }
    }
)
async def passage_retrieval(query: str):
    """Retrieve passages from Coveo"""
    coveo_request = {
        'q': query,
        'organizationId': os.environ['COVEO_ORG_ID'],
        'searchHub': 'workshop',
        'numberOfPassages': 5
    }

    headers = {
        'Authorization': f'Bearer {os.environ["COVEO_API_KEY"]}',
        'Content-Type': 'application/json'
    }

    response = requests.post(
        'https://platform.cloud.coveo.com/rest/search/v2/passages',
        json=coveo_request,
        headers=headers
    )

    passages = response.json()
    return {
        'passages': passages.get('passages', [])
    }

# Tool 3: Answer Question
@mcp_server.tool(
    name="answer_question",
    description="Generate an AI answer using Coveo Answer API",
    parameters={
        "query": {
            "type": "string",
            "description": "The question to answer"
        }
    }
)
async def answer_question(query: str):
    """Generate answer using Coveo Answer API"""
    coveo_request = {
        'q': query,
        'organizationId': os.environ['COVEO_ORG_ID'],
        'searchHub': 'workshop',
        'answerConfigurationId': os.environ['COVEO_ANSWER_CONFIG_ID']
    }

    headers = {
        'Authorization': f'Bearer {os.environ["COVEO_API_KEY"]}',
        'Content-Type': 'application/json'
    }

    response = requests.post(
        'https://platform.cloud.coveo.com/rest/search/v2/answer',
        json=coveo_request,
        headers=headers
    )

    answer_data = response.json()
    return {
        'answer': answer_data.get('answer', ''),
        'citations': answer_data.get('citations', [])
    }

# MCP Server endpoints
@app.post("/mcp/tools/list")
async def list_tools():
    """List available tools"""
    return mcp_server.list_tools()

@app.post("/mcp/tools/call")
async def call_tool(request: dict):
    """Call a specific tool"""
    return await mcp_server.call_tool(
        request['tool_name'],
        request['parameters']
    )

if __name__ == "__main__":
    import uvicorn
    uvicorn.run(app, host="0.0.0.0", port=3000)

Scalability¶

Agent Runtime: Containerized, auto-scales
MCP Server: Containerized, auto-scales
Concurrent Sessions: 1000+ per runtime
Tool Concurrency: Parallel tool invocations supported
Memory Storage: Managed by AWS, scales automatically

🔒 Security Implementation¶

Multi-Layer Security¶

graph TB A[User Request] --> B[Cognito JWT] B --> C[API Gateway] C --> D{Validate Token} D -->|Valid| E[Runtime Lambda] D -->|Invalid| F[401 Error] E --> G[IAM Role] G --> H[Agent Runtime] H --> I[MCP Server] I --> J[IAM Role] J --> K[Coveo API] style B fill:#fff3e0 style D fill:#f3e5f5 style G fill:#e8f5e9 style J fill:#e8f5e9

Security Layers:

User Authentication: Cognito JWT tokens
API Authorization: API Gateway validates tokens
Lambda Execution: IAM roles with least privilege
Container Security: Isolated runtime environments
MCP Communication: Encrypted internal communication
API Access: Secure Coveo API key management

Data Protection¶

In Transit: All communications over HTTPS/TLS
At Rest: Memory encrypted in AWS storage
Container Isolation: Each runtime isolated
API Keys: Stored in environment variables
Session Data: Automatically expires

🎯 Benefits of This Architecture¶

Advantages¶

🔧 Multi-Tool
Orchestrate multiple tools for comprehensive answers

🔌 Standardized
MCP protocol for consistent tool integration

📊 Observable
Detailed logs and traces for debugging

🚀 Scalable
Containerized components auto-scale

🔄 Flexible
Easy to add new tools and capabilities

Trade-offs¶

Aspect	Benefit	Limitation
Latency	Acceptable (3-5s)	Slower than simpler patterns
Complexity	Powerful capabilities	More components to manage
Cost	Pay for value	Higher than direct API
Flexibility	Highly extensible	Requires container management

🔄 Comparison with Other Patterns¶

Aspect	Coveo Direct	Bedrock Agent	AgentCore MCP
Architecture	Simple (3 components)	Moderate (6 components)	Complex (8+ components)
Deployment	Serverless	Managed service	Containerized
Tools	N/A	1 tool (supports multiple)	3+ tools
Orchestration	None	Basic	Advanced
Protocol	REST API	Agent-driven via action-group interface	MCP (standard)
Observability	Basic logs	Traces	Enhanced traces + logs
Latency	~200ms	~2-3s	~3-5s
Scalability	High	High	Very High
Flexibility	Low	Medium	High
Use Case	FAQ	Support Chat	Consultation

📚 Additional Resources¶

Coveo Documentation¶

AWS Documentation¶

MCP Protocol¶

🚀 Next Steps¶

Now that you understand the architecture, return to the Lab 3 introduction to complete the hands-on exercises.

Return to Lab 3 Introduction →