Plugin Framework¶

Production Ready

The plugin framework is production ready with comprehensive hook coverage, robust error handling, and battle-tested implementations. Supports both native and external service plugins.

Overview¶

The MCP Context Forge Plugin Framework provides a comprehensive, production-grade system for extending gateway functionality through pre/post processing hooks at various points in the MCP request lifecycle. The framework supports both high-performance native plugins and sophisticated external AI service integrations.

Key Capabilities¶

AI Safety Middleware - Integration with LlamaGuard, OpenAI Moderation, custom ML models
Content Security - PII detection and masking, input validation, output sanitization
Policy Enforcement - Business rules, compliance checking, audit trails
Performance Protection - Timeout handling, resource limits, graceful degradation
Operational Excellence - Health‑oriented design, clear errors, sensible defaults
Enterprise Features - Multi-tenant isolation, conditional execution, sophisticated context management

Enabling Plugins¶

Environment Configuration¶

Enable the plugin framework in your .env file:

# Enable plugin framework
PLUGINS_ENABLED=true

# Optional: Custom plugin config path
PLUGIN_CONFIG_FILE=plugins/config.yaml

If deploying the gateway as a container, set these environment variables in your compose file or in the container's run command.

Architecture¶

Plugin Framework Specification

Check the specification docs for a detailed design of the plugin system.

The plugin framework implements a hybrid architecture supporting both native and external service integrations:

Native Plugins¶

In-Process Execution: Written in Python, run directly within the gateway process
High Performance: Sub-millisecond latency, no network overhead
Direct Access: Full access to gateway internals and context
Use Cases: PII filtering, regex transformations, input validation, simple business rules
Examples: PIIFilterPlugin, SearchReplacePlugin, DenyListPlugin

External Service Plugins¶

MCP Integration: External plugins communicate via MCP using STDIO or Streamable HTTP
Enterprise AI Support: LlamaGuard, OpenAI Moderation, custom ML models
Independent Scaling: Services run outside the gateway and can scale separately
Use Cases: Advanced AI safety, complex ML inference, policy engines (e.g., OPA)
Examples: OPA external plugin server, LlamaGuard integration, OpenAI Moderation

Unified Plugin Interface¶

Both plugin types implement the same interface, enabling seamless switching between deployment models:

class Plugin:
    async def prompt_pre_fetch(self, payload: PromptPrehookPayload,
                              context: PluginContext) -> PromptPrehookResult: ...
    async def prompt_post_fetch(self, payload: PromptPosthookPayload,
                               context: PluginContext) -> PromptPosthookResult: ...
    async def tool_pre_invoke(self, payload: ToolPreInvokePayload,
                             context: PluginContext) -> ToolPreInvokeResult: ...
    async def tool_post_invoke(self, payload: ToolPostInvokePayload,
                              context: PluginContext) -> ToolPostInvokeResult: ...
    async def resource_pre_fetch(self, payload: ResourcePreFetchPayload,
                                context: PluginContext) -> ResourcePreFetchResult: ...
    async def resource_post_fetch(self, payload: ResourcePostFetchPayload,
                                 context: PluginContext) -> ResourcePostFetchResult: ...
    # ... additional hook methods

Build Your First Plugin (Quickstart)¶

Decide between a native (in‑process) or external (MCP) plugin:

Native: simplest path; write Python class extending Plugin, configure via plugins/config.yaml using fully‑qualified class path.
External: runs as a separate MCP server (STDIO or Streamable HTTP); great for independent scaling and isolation.

Plugins CLI

To bootstrap a plugin quickly (native or external), run mcpplugins bootstrap and follow the prompts.

Quick native skeleton:

from mcpgateway.plugins.framework import Plugin, PluginConfig, PluginContext, PromptPrehookPayload, PromptPrehookResult

class MyPlugin(Plugin):
    def __init__(self, config: PluginConfig):
        super().__init__(config)

    async def prompt_pre_fetch(self, payload: PromptPrehookPayload, context: PluginContext) -> PromptPrehookResult:
        # modify
        return PromptPrehookResult(modified_payload=payload)

        # or block
        # return PromptPrehookResult(
        #     continue_processing=False,
        #     violation=PluginViolation(
        #         reason=f"Blocked by {self.name}",
        #         description="...",
        #         code="...",
        #         details="..."
        #     )
        # )

Register it in plugins/config.yaml:

plugins:

  - name: "MyPlugin"
    kind: "plugins.my_plugin.plugin.MyPlugin"
    hooks: ["prompt_pre_fetch"]
    mode: "permissive"
    priority: 120

External plugin quickstart:

Plugins Lifecycle Guide

See the plugin lifecycle guide for building, testing, and serving extenal plugins.

plugins:

  - name: "MyExternal"
    kind: "external"
    priority: 10
    mcp:
      proto: STREAMABLEHTTP
      url: http://localhost:8000/mcp

Plugin Configuration¶

The plugin configuration file is used to configure a set of plugins that implement hook functions used to register to hook points throughout the MCP Context Forge. An example configuration is below. It contains two main sections: plugins and plugin_settings.

Plugin Configuration

Check here for detailed explanations of configuration options and fields.

# plugins/config.yaml
plugins:

  - name: "PIIFilterPlugin"                    # Unique plugin identifier
    kind: "plugins.pii_filter.pii_filter.PIIFilterPlugin"  # Plugin class path
    description: "Detects and masks PII"       # Human-readable description
    version: "1.0.0"                          # Plugin version
    author: "Security Team"                   # Plugin author
    hooks:                                    # Hook registration

      - "prompt_pre_fetch"
      - "tool_pre_invoke"
      - "tool_post_invoke"
    tags:                                     # Searchable tags

      - "security"
      - "pii"
      - "compliance"
    mode: "enforce"                           # enforce|enforce_ignore_error|permissive|disabled
    priority: 50                              # Execution priority (lower = higher)
    conditions:                               # Conditional execution

      - server_ids: ["prod-server"]
        tenant_ids: ["enterprise"]
        tools: ["sensitive-tool"]
    config:                                   # Plugin-specific configuration
      detect_ssn: true
      detect_credit_card: true
      mask_strategy: "partial"
      redaction_text: "[REDACTED]"

# Global plugin settings
plugin_settings:
  parallel_execution_within_band: false      # Execute same-priority plugins in parallel
  plugin_timeout: 30                         # Per-plugin timeout (seconds)
  fail_on_plugin_error: false                # Continue on plugin failures
  plugin_health_check_interval: 60           # Health check interval (seconds)

Getting Started (Native Plugins)¶

Use the native plugins out of the box:

Copy and adapt the example config (enable any subset):

# plugins/config.yaml
plugins:

  - name: "PIIFilterPlugin"
    kind: "plugins.pii_filter.pii_filter.PIIFilterPlugin"
    hooks: ["prompt_pre_fetch", "prompt_post_fetch", "tool_pre_invoke", "tool_post_invoke"]
    mode: "permissive"
    priority: 50
    config:
      detect_ssn: true
      detect_email: true
      detect_credit_card: true
      default_mask_strategy: "partial"

  - name: "ReplaceBadWordsPlugin"
    kind: "plugins.regex_filter.search_replace.SearchReplacePlugin"
    hooks: ["prompt_pre_fetch", "prompt_post_fetch", "tool_pre_invoke", "tool_post_invoke"]
    mode: "enforce"
    priority: 150
    config:
      words:

        - { search: "crap", replace: "crud" }
        - { search: "crud", replace: "yikes" }

  - name: "DenyListPlugin"
    kind: "plugins.deny_filter.deny.DenyListPlugin"
    hooks: ["prompt_pre_fetch"]
    mode: "enforce"
    priority: 100
    config:
      words: ["innovative", "groundbreaking", "revolutionary"]

  - name: "ResourceFilterExample"
    kind: "plugins.resource_filter.resource_filter.ResourceFilterPlugin"
    hooks: ["resource_pre_fetch", "resource_post_fetch"]
    mode: "enforce"
    priority: 75
    config:
      max_content_size: 1048576
      allowed_protocols: ["http", "https"]
      blocked_domains: ["malicious.example.com"]
      content_filters:

        - { pattern: "password\\s*[:=]\\s*\\S+", replacement: "password: [REDACTED]" }

plugin_settings:
  parallel_execution_within_band: false
  plugin_timeout: 30
  fail_on_plugin_error: false
  enable_plugin_api: true
  plugin_health_check_interval: 60

Ensure .env contains: PLUGINS_ENABLED=true and PLUGIN_CONFIG_FILE=plugins/config.yaml.
Start the gateway: make dev (or make serve).

That's it — the gateway now runs the enabled plugins at the selected hook points.

Plugin Configuration¶

The plugins section lists the set of configured plugins that will be loaded by the Context Forge at startup. Each plugin contains a set of standard configurations, and then a config section designed for plugin specific configurations. The attributes are defined as follows:

Field	Type	Required	Default	Description	Example Values
`name`	`string`	Yes	-	Unique plugin identifier within the configuration	`"PIIFilterPlugin"`, `"OpenAIModeration"`
`kind`	`string`	Yes	-	Plugin class path for native plugins or `"external"` for MCP servers	`"plugins.pii_filter.pii_filter.PIIFilterPlugin"`, `"external"`
`description`	`string`		`null`	Human-readable description of plugin functionality	`"Detects and masks PII in requests"`
`author`	`string`		`null`	Plugin author or team responsible for maintenance	`"Security Team"`, `"AI Safety Group"`
`version`	`string`		`null`	Plugin version for tracking and compatibility	`"1.0.0"`, `"2.3.1-beta"`
`hooks`	`string[]`		`[]`	List of hook points where plugin executes	`["prompt_pre_fetch", "tool_pre_invoke"]`
`tags`	`string[]`		`[]`	Searchable tags for plugin categorization	`["security", "pii", "compliance"]`
`mode`	`string`		`"enforce"`	Plugin execution mode controlling behavior on violations	`"enforce"`, `"enforce_ignore_error"`, `"permissive"`, `"disabled"`
`priority`	`integer`		`null`	Execution priority (lower number = higher priority)	`10`, `50`, `100`
`conditions`	`object[]`		`[]`	Conditional execution rules for targeting specific contexts	See Condition Fields below
`config`	`object`		`{}`	Plugin-specific configuration parameters	`{"detect_ssn": true, "mask_strategy": "partial"}`
`mcp`	`object`		`null`	External MCP server configuration (required for external plugins)	See MCP Configuration below

Hook Types¶

Available hook values for the hooks field:

MCP Protocol Hooks:

Hook Value	Description	Timing
`"prompt_pre_fetch"`	Process prompt requests before template processing	Before prompt template retrieval
`"prompt_post_fetch"`	Process prompt responses after template rendering	After prompt template processing
`"tool_pre_invoke"`	Process tool calls before execution	Before tool invocation
`"tool_post_invoke"`	Process tool results after execution	After tool completion
`"resource_pre_fetch"`	Process resource requests before fetching	Before resource retrieval
`"resource_post_fetch"`	Process resource content after loading	After resource content loading

HTTP Authentication & Middleware Hooks:

Hook Value	Description	Timing
`"http_pre_request"`	Transform HTTP headers before processing	Before authentication
`"http_auth_resolve_user"`	Implement custom authentication	During user authentication
`"http_auth_check_permission"`	Custom permission checking logic	Before RBAC checks
`"http_post_request"`	Process responses and add audit headers	After request completion

See the HTTP Authentication Hooks Guide for detailed implementation examples.

Condition Fields¶

Users may only want plugins to be invoked on specific servers, tools, and prompts. To address this, a set of conditionals can be applied to a plugin. The attributes in a conditional combine together in as a set of and operations, while each attribute list item is ored with other items in the list.

The conditions array contains objects that specify when plugins should execute:

Field	Type	Description	Example
`server_ids`	`string[]`	Execute only for specific virtual server IDs	`["prod-server", "api-gateway"]`
`tenant_ids`	`string[]`	Execute only for specific tenant/organization IDs	`["enterprise", "premium-tier"]`
`tools`	`string[]`	Execute only for specific tool names	`["file_reader", "web_scraper"]`
`prompts`	`string[]`	Execute only for specific prompt names	`["user_prompt", "system_message"]`
`resources`	`string[]`	Execute only for specific resource URI patterns	`["https://api.example.com/*"]`
`user_patterns`	`string[]`	Execute for users matching regex patterns	`["admin_.", ".@company.com"]`
`content_types`	`string[]`	Execute for specific content types	`["application/json", "text/plain"]`

MCP Configuration Fields¶

For external plugins (kind: "external"), the mcp object configures the MCP server connection:

Field	Type	Required	Description	Example
`proto`	`string`	Yes	MCP transport protocol	`"stdio"`, `"sse"`, `"streamablehttp"`, `"websocket"`
`url`	`string`		Service URL for HTTP-based transports	`"http://openai-plugin:3000/mcp"`
`script`	`string`		Script path for STDIO transport	`"/opt/plugins/custom-filter.py"`

Global Plugin Settings¶

The plugin_settings are as follows:

Setting	Type	Default	Description
`parallel_execution_within_band`	`boolean`	`false`	Execute plugins with same priority in parallel
`plugin_timeout`	`integer`	`30`	Per-plugin timeout in seconds
`fail_on_plugin_error`	`boolean`	`false`	Stop processing on plugin errors
`plugin_health_check_interval`	`integer`	`60`	Health check interval in seconds

Execution Modes¶

Each plugin can operate in one of four modes:

Mode	Behavior	Use Case
`"enforce"`	Block requests when plugin detects violations or errors	Production security plugins, critical compliance checks
`"enforce_ignore_error"`	Block on violations but continue on plugin errors	Security plugins that should block violations but not break on technical errors
`"permissive"`	Log violations and errors but allow requests to continue	Development environments, monitoring-only plugins
`"disabled"`	Plugin is loaded but never executed	Temporary plugin deactivation, maintenance mode

Priority and Execution Order¶

Plugins execute in priority order (ascending):

# Execution order example
plugins:

  - name: "Authentication"
    priority: 10      # Runs first

  - name: "RateLimiter"
    priority: 50      # Runs second

  - name: "ContentFilter"
    priority: 100     # Runs third

  - name: "Logger"
    priority: 200     # Runs last

Plugins with the same priority may execute in parallel if parallel_execution_within_band is enabled.

Available Hooks¶

The plugin framework provides comprehensive hook coverage across the entire MCP request lifecycle:

Production Hooks (Implemented)¶

Hook	Execution Point	Use Cases	Payload Type
`prompt_pre_fetch`	Before prompt template retrieval	Argument validation, PII scanning, input sanitization	`PromptPrehookPayload`
`prompt_post_fetch`	After prompt template rendering	Content filtering, output transformation, safety checks	`PromptPosthookPayload`
`tool_pre_invoke`	Before tool execution	Authorization, argument validation, dangerous operation blocking	`ToolPreInvokePayload`
`tool_post_invoke`	After tool execution	Result filtering, PII masking, audit logging, response transformation	`ToolPostInvokePayload`
`resource_pre_fetch`	Before resource fetching	URI validation, protocol checking, metadata injection	`ResourcePreFetchPayload`
`resource_post_fetch`	After resource content retrieval	Content filtering, size validation, sensitive data redaction	`ResourcePostFetchPayload`
`http_pre_request`	Before HTTP request processing	Header transformation (e.g., custom token to Bearer)	`HttpPreRequestPayload`
`http_auth_resolve_user`	During user authentication	Custom authentication systems (LDAP, mTLS, token-based)	`HttpAuthResolveUserPayload`
`http_auth_check_permission`	Before RBAC permission checks	Custom permission logic (token-based access, time-based rules)	`HttpAuthCheckPermissionPayload`
`http_post_request`	After HTTP request completion	Audit logging, response header injection	`HttpPostRequestPayload`
`agent_pre_invoke`	Before agent invocation	Message filtering, access control, tool restrictions	`AgentPreInvokePayload`
`agent_post_invoke`	After agent response	Response filtering, content moderation, audit logging	`AgentPostInvokePayload`

HTTP Authentication & Middleware Hooks

For detailed information on implementing custom authentication and authorization, see the HTTP Authentication Hooks Guide.

Agent-to-Agent (A2A) Hooks

Agent hooks enable filtering and monitoring of Agent-to-Agent interactions. These hooks allow you to: - Filter/transform messages before they reach agents - Control which tools are available to agents - Override model or system prompt settings - Filter agent responses for safety/compliance - Monitor tool invocations made by agents

See A2A Documentation for more information on Agent-to-Agent features.

Planned Hooks (Roadmap)¶

Hook	Purpose	Expected Release
`server_pre_register`	Server attestation and validation before admission	v0.9.0
`server_post_register`	Post-registration processing and setup	v0.9.0
`federation_pre_sync`	Gateway federation validation and filtering	v0.10.0
`federation_post_sync`	Post-federation data processing and reconciliation	v0.10.0

Prompt Hooks Details¶

The prompt hooks allow plugins to intercept and modify prompt retrieval and rendering:

prompt_pre_fetch: Receives the prompt name and arguments before prompt template retrieval. Can modify the arguments.
prompt_post_fetch: Receives the completed prompt after rendering. Can modify the prompt text or block it from being returned.

Example Use Cases:

Detect prompt injection attacks
Sanitize or anonymize prompts
Search and replace

Prompt Hook Payloads¶

PromptPrehookPayload: Payload for prompt pre-fetch hooks.

class PromptPrehookPayload(BaseModel):
    name: str                                    # Prompt template name
    args: Optional[dict[str, str]] = Field(default_factory=dict)  # Template arguments

Example:

payload = PromptPrehookPayload(
    name="user_greeting",
    args={"user_name": "Alice", "time_of_day": "morning"}
)

PromptPosthookPayload: Payload for prompt post-fetch hooks.

class PromptPosthookPayload(BaseModel):
    name: str                                    # Prompt name
    result: PromptResult                         # Rendered prompt result

Tool Hooks Details¶

The tool hooks enable plugins to intercept and modify tool invocations:

tool_pre_invoke: Receives the tool name and arguments before execution. Can modify arguments or block the invocation entirely.
tool_post_invoke: Receives the tool result after execution. Can modify the result or block it from being returned.

Example use cases:

PII detection and masking in tool inputs/outputs
Rate limiting specific tools
Audit logging of tool usage
Input validation and sanitization
Output filtering and transformation

Tool Hook Payloads¶

ToolPreInvokePayload: Payload for tool pre-invoke hooks.

class ToolPreInvokePayload(BaseModel):
    name: str                                    # Tool name
    args: Optional[dict[str, Any]] = Field(default_factory=dict)  # Tool arguments
    headers: Optional[HttpHeaderPayload] = None  # HTTP pass-through headers

ToolPostInvokePayload: Payload for tool post-invoke hooks.

class ToolPostInvokePayload(BaseModel):
    name: str                                    # Tool name
    result: Any                                  # Tool execution result

The associated HttpHeaderPayload object for the ToolPreInvokePayload is as follows:

Special payload for HTTP header manipulation.

class HttpHeaderPayload(RootModel[dict[str, str]]):
    # Provides dictionary-like access to HTTP headers
    # Supports: __iter__, __getitem__, __setitem__, __len__

Usage:

headers = HttpHeaderPayload({"Authorization": "Bearer token", "Content-Type": "application/json"})
headers["X-Custom-Header"] = "custom_value"
auth_header = headers["Authorization"]

Resource Hooks Details¶

The resource hooks enable plugins to intercept and modify resource fetching:

resource_pre_fetch: Receives the resource URI and metadata before fetching. Can modify the URI, add metadata, or block the fetch entirely.
resource_post_fetch: Receives the resource content after fetching. Can modify the content, redact sensitive information, or block it from being returned.

Example use cases:

Protocol validation (block non-HTTPS resources)
Domain blocklisting/allowlisting
Content size limiting
Sensitive data redaction
Content transformation and filtering
Resource caching metadata

Resource Hook Payloads¶

ResourcePreFetchPayload: Payload for resource pre-fetch hooks.

class ResourcePreFetchPayload(BaseModel):
    uri: str                                     # Resource URI
    metadata: Optional[dict[str, Any]] = Field(default_factory=dict)  # Request metadata

ResourcePostFetchPayload: Payload for resource post-fetch hooks.

class ResourcePostFetchPayload(BaseModel):
    uri: str                                     # Resource URI
    content: Any                                 # Fetched resource content

HTTP Authentication & Middleware Hooks¶

For HTTP request processing and authentication, see the dedicated guide:

HTTP Authentication Hooks Guide - Complete guide to http_pre_request, http_auth_resolve_user, http_auth_check_permission, and http_post_request hooks

Agent Hooks Details¶

The agent hooks allow plugins to intercept and modify Agent-to-Agent (A2A) interactions:

agent_pre_invoke: Receives agent invocation details before the agent processes the request. Can filter messages, restrict tools, override model settings, or block the request entirely.
agent_post_invoke: Receives the agent's response after processing. Can filter response content, redact sensitive information, or add audit metadata.

Example Use Cases:

Filter offensive or sensitive content in messages
Restrict which tools an agent can access
Override model selection or system prompts
Apply content moderation to agent responses
Log all agent interactions for compliance
Block agents from accessing certain resources

Agent Hook Payloads¶

AgentPreInvokePayload: Payload for agent pre-invoke hooks.

class AgentPreInvokePayload(BaseModel):
    agent_id: str                                    # Agent identifier (can be modified for routing)
    messages: List[Message]                          # Conversation messages (can be filtered/transformed)
    tools: Optional[List[str]] = None                # Available tools (can be restricted)
    headers: Optional[HttpHeaderPayload] = None      # HTTP headers
    model: Optional[str] = None                      # Model override
    system_prompt: Optional[str] = None              # System instructions override
    parameters: Optional[Dict[str, Any]] = None      # LLM parameters (temperature, max_tokens, etc.)

AgentPostInvokePayload: Payload for agent post-invoke hooks.

class AgentPostInvokePayload(BaseModel):
    agent_id: str                                    # Agent identifier
    messages: List[Message]                          # Response messages (can be filtered/transformed)
    tool_calls: Optional[List[Dict[str, Any]]] = None  # Tool invocations made by agent

Example Plugin:

from mcpgateway.plugins.framework import (
    Plugin,
    PluginContext,
    AgentPreInvokePayload,
    AgentPreInvokeResult,
    PluginViolation,
)

class AgentSafetyPlugin(Plugin):
    """Filter agent interactions for safety."""

    async def agent_pre_invoke(
        self,
        payload: AgentPreInvokePayload,
        context: PluginContext
    ) -> AgentPreInvokeResult:
        # Restrict dangerous tools
        if payload.tools:
            safe_tools = [t for t in payload.tools if t not in ["file_delete", "system_exec"]]
            if len(safe_tools) < len(payload.tools):
                payload.tools = safe_tools
                self.logger.info(f"Restricted tools for agent {payload.agent_id}")

        # Filter offensive content in messages
        for msg in payload.messages:
            if self._contains_offensive_content(msg.content):
                return AgentPreInvokeResult(
                    continue_processing=False,
                    violation=PluginViolation(
                        code="OFFENSIVE_CONTENT",
                        reason="Message contains offensive content",
                        description="Agent request blocked due to policy violation"
                    )
                )

        return AgentPreInvokeResult(
            modified_payload=payload,
            metadata={"safety_checked": True}
        )

Planned Hooks (Roadmap)¶

server_pre_register / server_post_register - Server validation
federation_pre_sync / federation_post_sync - Gateway federation

Writing Plugins¶

Understanding the Plugin Base Class¶

The Plugin class is an abstract base class (ABC) that provides the foundation for all plugins. You must subclass it and implement at least one hook method to create a functional plugin.

from abc import ABC
from mcpgateway.plugins.framework import Plugin, PluginConfig

class MyPlugin(Plugin):
    """Your plugin must inherit from Plugin."""

    def __init__(self, config: PluginConfig):
        super().__init__(config)
        # Initialize plugin-specific configuration
        self.my_setting = config.config.get("my_setting", "default")

Key Design Principle

Plugins implement only the hooks they need using one of three registration patterns. You don't need to implement all hooks - just the ones relevant to your plugin's purpose.

Three Hook Registration Patterns¶

Pattern 1: Convention-Based (Recommended)¶

The simplest approach - just name your method to match the hook type:

from mcpgateway.plugins.framework import (
    Plugin,
    PluginContext,
    ToolPreInvokePayload,
    ToolPreInvokeResult,
)

class ContentFilterPlugin(Plugin):
    """Convention-based hook - method name matches hook type."""

    async def tool_pre_invoke(
        self,
        payload: ToolPreInvokePayload,
        context: PluginContext
    ) -> ToolPreInvokeResult:
        """This hook is automatically discovered by its name."""

        # Block dangerous operations
        if payload.name == "file_delete" and "system" in str(payload.args):
            from mcpgateway.plugins.framework import PluginViolation
            return ToolPreInvokeResult(
                continue_processing=False,
                violation=PluginViolation(
                    code="DANGEROUS_OP",
                    reason="Dangerous operation blocked",
                    description=f"Cannot delete system files"
                )
            )

        # Modify arguments
        modified_args = {**payload.args, "processed": True}
        modified_payload = ToolPreInvokePayload(
            name=payload.name,
            args=modified_args,
            headers=payload.headers
        )

        return ToolPreInvokeResult(
            modified_payload=modified_payload,
            metadata={"processed_by": self.name}
        )

When to use: Default choice for implementing standard framework hooks.

Pattern 2: Decorator-Based (Custom Method Names)¶

Use the @hook decorator to register a hook with a custom method name:

from mcpgateway.plugins.framework import Plugin, PluginContext
from mcpgateway.plugins.framework.decorator import hook
from mcpgateway.plugins.framework import (
    ToolHookType,
    ToolPostInvokePayload,
    ToolPostInvokeResult,
)

class AuditPlugin(Plugin):
    """Decorator-based hook with descriptive method name."""

    @hook(ToolHookType.TOOL_POST_INVOKE)
    async def audit_tool_execution(
        self,
        payload: ToolPostInvokePayload,
        context: PluginContext
    ) -> ToolPostInvokeResult:
        """Method name doesn't match hook type, but @hook decorator registers it."""

        # Log tool execution
        self.logger.info(f"Tool executed: {payload.name}")

        # Filter sensitive data from results
        if isinstance(payload.result, dict) and "password" in payload.result:
            filtered_result = {**payload.result, "password": "[REDACTED]"}
            modified_payload = ToolPostInvokePayload(
                name=payload.name,
                result=filtered_result
            )
            return ToolPostInvokeResult(modified_payload=modified_payload)

        return ToolPostInvokeResult(continue_processing=True)

When to use: When you want descriptive method names that better match your plugin's purpose.

Pattern 3: Custom Hooks (Advanced)¶

Register completely new hook types with custom payload and result types:

from mcpgateway.plugins.framework import (
    Plugin,
    PluginContext,
    PluginPayload,
    PluginResult
)
from mcpgateway.plugins.framework.decorator import hook

# Define custom payload type
class EmailPayload(PluginPayload):
    recipient: str
    subject: str
    body: str

# Define custom result type
class EmailResult(PluginResult[EmailPayload]):
    pass

class EmailPlugin(Plugin):
    """Custom hook with new hook type."""

    @hook("email_pre_send", EmailPayload, EmailResult)
    async def validate_email(
        self,
        payload: EmailPayload,
        context: PluginContext
    ) -> EmailResult:
        """Completely new hook type: 'email_pre_send'"""

        # Validate email address
        if "@" not in payload.recipient:
            modified_payload = EmailPayload(
                recipient=f"{payload.recipient}@example.com",
                subject=payload.subject,
                body=payload.body
            )
            return EmailResult(
                modified_payload=modified_payload,
                metadata={"fixed_email": True}
            )

        return EmailResult(continue_processing=True)

When to use: When extending the framework with domain-specific hook points not covered by standard hooks.

Hook Method Signature Requirements¶

All hook methods must follow these rules:

Must be async: All hooks are asynchronous
Three parameters: self, payload, context
Type hints required: Payload and result types must be properly typed for validation
Return appropriate result type: Each hook returns a PluginResult typed with the hook's payload type

async def hook_name(
    self,
    payload: PayloadType,           # Specific to the hook (e.g., ToolPreInvokePayload)
    context: PluginContext          # Always PluginContext
) -> PluginResult[PayloadType]:     # PluginResult parameterized by payload type
    """Hook implementation."""
    pass

Understanding Result Types:

Each hook has a corresponding result type that is actually a type alias for PluginResult[PayloadType]:

# These are type aliases defined in the framework
ToolPreInvokeResult = PluginResult[ToolPreInvokePayload]
ToolPostInvokeResult = PluginResult[ToolPostInvokePayload]
PromptPrehookResult = PluginResult[PromptPrehookPayload]
HttpAuthResolveUserResult = PluginResult[dict]  # Special case for user dict
# ... and so on for each hook type

This means when you return a result, you're returning a PluginResult instance:

# All of these are valid ways to construct results:
return ToolPreInvokeResult(continue_processing=True)
return ToolPreInvokeResult(modified_payload=new_payload)
return ToolPreInvokeResult(
    modified_payload=new_payload,
    metadata={"processed": True}
)

Plugin Lifecycle Methods¶

Plugins can implement optional lifecycle methods:

class MyPlugin(Plugin):
    async def initialize(self):
        """Called when plugin is loaded."""
        # Set up resources, connections, etc.
        self._session = aiohttp.ClientSession()

    async def shutdown(self):
        """Called when plugin manager shuts down."""
        # Cleanup resources
        if hasattr(self, '_session'):
            await self._session.close()

Plugin Context and State¶

Each hook function has a context object of type PluginContext which is designed to allow plugins to pass state between one another (across pre/post hook pairs) or for a plugin to pass state information to itself across pre/post hook pairs. The plugin context looks as follows:

class GlobalContext(BaseModel):
    """The global context, which shared across all plugins.

    Attributes:
            request_id (str): ID of the HTTP request.
            user (str): user ID associated with the request.
            tenant_id (str): tenant ID.
            server_id (str): server ID.
            metadata (Optional[dict[str,Any]]): a global shared metadata across plugins (Read-only from plugin's perspective.).
            state (Optional[dict[str,Any]]): a global shared state across plugins.
    """

    request_id: str
    user: Optional[str] = None
    tenant_id: Optional[str] = None
    server_id: Optional[str] = None
    state: dict[str, Any] = Field(default_factory=dict)
    metadata: dict[str, Any] = Field(default_factory=dict)


class PluginContext(BaseModel):
    """The plugin's context, which lasts a request lifecycle.

    Attributes:
       state:  the inmemory state of the request.
       global_context: the context that is shared across plugins.
       metadata: plugin meta data.
    """

    state: dict[str, Any] = Field(default_factory=dict)
    global_context: GlobalContext
    metadata: dict[str, Any] = Field(default_factory=dict)

As can be seen, the PluginContext has both a state dictionary and a global_context object that also has a state dictionary. A single plugin can share state between pre/post hook pairs by using the the PluginContext state dictionary. It can share state with other plugins using the context.global_context.state dictionary. Metadata for the specific hook site is passed in through the metadata dictionaries in the context.global_context.metadata. It is meant to be read-only. The context.metadata is plugin specific metadata and can be used to store metadata information such as timing information.

The following shows how plugins can maintain state between pre/post hooks:

async def prompt_pre_fetch(self, payload, context):
    # Store state for later use
    context.set_state("request_time", time.time())
    context.set_state("original_args", payload.args.copy())

    return PromptPrehookResult()

async def prompt_post_fetch(self, payload, context):
    # Retrieve state from pre-hook
    elapsed = time.time() - context.get_state("request_time", 0)
    original = context.get_state("original_args", {})

    # Add timing metadata
    context.metadata["processing_time_ms"] = elapsed * 1000

    return PromptPosthookResult()

Tool and Gateway Metadata¶

Currently, the tool pre/post hooks have access to tool and gateway metadata through the global context metadata dictionary. They are accessible as follows:

It can be accessed inside of the tool hooks through:

from mcpgateway.plugins.framework.constants import GATEWAY_METADATA, TOOL_METADATA

tool_meta = context.global_context.metadata[TOOL_METADATA]
assert tool_meta.original_name == "test_tool"
assert tool_meta.url.host == "example.com"
assert tool_meta.integration_type == "REST" or tool_meta.integration_type == "MCP"

Note, if the integration type is MCP the gateway information may also be available as follows.

gateway_meta = context.global_context.metadata[GATEWAY_METADATA]
assert gateway_meta.name == "test_gateway"
assert gateway_meta.transport == "sse"
assert gateway_meta.url.host == "example.com"

Metadata for other entities such as prompts and resources will be added in future versions of the gateway.

External Service Plugin Example¶

class LLMGuardPlugin(Plugin):
    """Example external service integration."""

    def __init__(self, config: PluginConfig):
        super().__init__(config)
        self.service_url = config.config.get("service_url")
        self.api_key = config.config.get("api_key")
        self.timeout = config.config.get("timeout", 30)

    async def prompt_pre_fetch(self, payload, context):
        # Call external service
        async with httpx.AsyncClient() as client:
            try:
                response = await client.post(
                    f"{self.service_url}/analyze",
                    json={
                        "text": str(payload.args),
                        "policy": "strict"
                    },
                    headers={
                        "Authorization": f"Bearer {self.api_key}"
                    },
                    timeout=self.timeout
                )

                result = response.json()

                if result.get("blocked", False):
                    return PromptPrehookResult(
                        continue_processing=False,
                        violation=PluginViolation(
                            reason="External service blocked",
                            description=result.get("reason", "Content blocked"),
                            code="LLMGUARD_BLOCKED",
                            details=result
                        )
                    )

            except Exception as e:
                # Handle errors based on plugin settings
                if self.config.mode == PluginMode.ENFORCE:
                    return PromptPrehookResult(
                        continue_processing=False,
                        violation=PluginViolation(
                            reason="Service error",
                            description=f"Service error: {str(e)}",
                            code="SERVICE_ERROR",
                            details={"error": str(e)}
                        )
                    )

        return PromptPrehookResult()

Plugin Development Guide¶

1. Create Plugin Directory¶

mkdir -p plugins/my_plugin
touch plugins/my_plugin/__init__.py
touch plugins/my_plugin/plugin.py
touch plugins/my_plugin/plugin-manifest.yaml

2. Write Plugin Manifest¶

# plugins/my_plugin/plugin-manifest.yaml
description: "My custom plugin for X"
author: "Your Name"
version: "1.0.0"
tags: ["custom", "filter"]
available_hooks:

  - "prompt_pre_fetch"
  - "prompt_post_fetch"
default_config:
  setting_one: "default_value"
  setting_two: 123

3. Implement Plugin Class¶

# plugins/my_plugin/plugin.py
from mcpgateway.plugins.framework import Plugin

class MyPlugin(Plugin):
    # Implementation here
    pass

4. Register in Configuration¶

# plugins/config.yaml
plugins:

  - name: "MyCustomPlugin"
    kind: "plugins.my_plugin.plugin.MyPlugin"
    hooks: ["prompt_pre_fetch"]
    # ... other configuration

5. Test Your Plugin¶

# tests/test_my_plugin.py
import pytest
from plugins.my_plugin.plugin import MyPlugin
from mcpgateway.plugins.framework import PluginConfig

@pytest.mark.asyncio
async def test_my_plugin():
    config = PluginConfig(
        name="test",
        kind="plugins.my_plugin.plugin.MyPlugin",
        hooks=["prompt_pre_fetch"],
        config={"setting_one": "test_value"}
    )

    plugin = MyPlugin(config)

    # Test your plugin logic
    result = await plugin.prompt_pre_fetch(payload, context)
    assert result.continue_processing

Best Practices¶

1. Error Handling¶

Errors inside a plugin should be raised as exceptions. The plugin manager will catch the error, and its behavior depends on both the gateway's and plugin's configuration as follows:

if plugin_settings.fail_on_plugin_error in the plugin config.yaml is set to true the exception is bubbled up as a PluginError and the error is passed to the client of the MCP Context Forge regardless of the plugin mode.
if plugin_settings.fail_on_plugin_error is set to false the error is handled based off of the plugin mode in the plugin's config as follows:
if mode is enforce, both violations and errors are bubbled up as exceptions and the execution is blocked.
if mode is enforce_ignore_error, violations are bubbled up as exceptions and execution is blocked, but errors are logged and execution continues.
if mode is permissive, execution is allowed to proceed whether there are errors or violations. Both are logged.

2. Performance Considerations¶

Keep plugin operations lightweight
Use caching for expensive operations
Respect the configured timeout
Consider async operations for I/O

class CachedPlugin(Plugin):
    def __init__(self, config):
        super().__init__(config)
        self._cache = {}
        self._cache_ttl = config.config.get("cache_ttl", 300)

    async def expensive_operation(self, key):
        # Check cache first
        if key in self._cache:
            cached_value, timestamp = self._cache[key]
            if time.time() - timestamp < self._cache_ttl:
                return cached_value

        # Perform expensive operation
        result = await self._do_expensive_work(key)

        # Cache result
        self._cache[key] = (result, time.time())
        return result

3. Conditional Execution¶

Use conditions to limit plugin scope:

conditions:

  - prompts: ["sensitive_prompt"]
    server_ids: ["prod-server-1", "prod-server-2"]
    tenant_ids: ["enterprise-tenant"]
    user_patterns: ["admin-*", "support-*"]

4. Logging and Monitoring¶

Use appropriate log levels:

logger.debug(f"Plugin {self.name} processing prompt: {payload.name}")
logger.info(f"Plugin {self.name} blocked request: {violation.code}")
logger.warning(f"Plugin {self.name} timeout approaching")
logger.error(f"Plugin {self.name} failed: {error}")

API Reference¶

Plugin management endpoints are not exposed in the gateway at this time.

Troubleshooting¶

Plugin Not Loading¶

Check server logs for initialization errors
Verify plugin class path in configuration
Ensure all dependencies are installed
Check Python import path includes plugin directory

Plugin Not Executing¶

Verify plugin is enabled (mode != "disabled")
Check conditions match your request
Review priority ordering
Enable debug logging to see execution flow

Performance Issues¶

Monitor plugin execution time in logs
Check for blocking I/O operations
Review timeout settings
Consider caching expensive operations

Production Deployment Examples¶

Enterprise AI Safety Pipeline¶

# Production-grade AI safety configuration
plugins:
  # Step 1: PII Detection and Masking (Highest Priority)
  - name: "PIIFilter"
    kind: "plugins.pii_filter.pii_filter.PIIFilterPlugin"
    hooks: ["prompt_pre_fetch", "prompt_post_fetch", "tool_pre_invoke", "tool_post_invoke"]
    mode: "enforce"
    priority: 10
    config:
      detect_ssn: true
      detect_credit_card: true
      detect_email: true
      mask_strategy: "partial"
      block_on_detection: false

  # Step 2: External AI Safety Service (LlamaGuard)
  - name: "LlamaGuardSafety"
    kind: "external"
    hooks: ["prompt_pre_fetch", "tool_pre_invoke"]
    mode: "enforce"
    priority: 20
    mcp:
      proto: STREAMABLEHTTP
      url: "https://ai-safety.internal.corp/llamaguard/v1"
    conditions:

      - server_ids: ["production-chat", "customer-support"]

  # Step 3: OpenAI Moderation for Final Check
  - name: "OpenAIMod"
    kind: "external"
    hooks: ["prompt_post_fetch", "tool_post_invoke"]
    mode: "permissive"  # Log violations but don't block
    priority: 30
    mcp:
      proto: STREAMABLEHTTP
      url: "https://api.openai.com/v1/moderations"

  # Step 4: Audit Logging (Lowest Priority)
  - name: "AuditLogger"
    kind: "plugins.audit.audit_logger.AuditLoggerPlugin"
    hooks: ["prompt_pre_fetch", "tool_pre_invoke", "tool_post_invoke"]
    mode: "permissive"
    priority: 100
    config:
      log_level: "INFO"
      include_payloads: false  # For privacy
      audit_endpoints: ["https://audit.internal.corp/api/v1/logs"]

Multi-Tenant Security Configuration¶

plugins:
  # Enterprise tenant gets strict filtering
  - name: "EnterpriseSecurityFilter"
    kind: "plugins.security.enterprise_filter.EnterpriseFilterPlugin"
    hooks: ["prompt_pre_fetch", "tool_pre_invoke"]
    mode: "enforce"
    priority: 50
    conditions:

      - tenant_ids: ["enterprise-corp", "banking-client"]
        tools: ["database-query", "file-access", "system-command"]
    config:
      sql_injection_protection: true
      command_injection_protection: true
      file_system_restrictions: true

  # Free tier gets basic content filtering
  - name: "BasicContentFilter"
    kind: "plugins.content.basic_filter.BasicFilterPlugin"
    hooks: ["prompt_pre_fetch", "prompt_post_fetch"]
    mode: "permissive"
    priority: 75
    conditions:

      - tenant_ids: ["free-tier"]
    config:
      profanity_filter: true
      spam_detection: true
      rate_limit_warnings: true

Development vs Production Configurations¶

# Development Environment
plugins:

  - name: "DevPIIFilter"
    kind: "plugins.pii_filter.pii_filter.PIIFilterPlugin"
    hooks: ["prompt_pre_fetch", "tool_pre_invoke"]
    mode: "permissive"  # Don't block in dev
    priority: 50
    config:
      detect_ssn: true
      log_detections: true
      mask_strategy: "partial"
      whitelist_patterns:

        - "test@example.com"
        - "555-555-5555"
        - "123-45-6789"  # Test SSN

# Production Environment
plugins:

  - name: "ProdPIIFilter"
    kind: "plugins.pii_filter.pii_filter.PIIFilterPlugin"
    hooks: ["prompt_pre_fetch", "prompt_post_fetch", "tool_pre_invoke", "tool_post_invoke"]
    mode: "enforce"  # Block in production
    priority: 10
    config:
      detect_ssn: true
      detect_credit_card: true
      detect_phone: true
      detect_email: true
      detect_api_keys: true
      block_on_detection: true
      audit_detections: true
      compliance_mode: "strict"

Performance and Scalability¶

Benchmark Results¶

Native Plugins: <1ms latency overhead per hook
External Service Plugins: 10-100ms depending on service (cached responses: <5ms)
Memory Usage: ~5MB base overhead + ~1MB per active plugin
Throughput: Tested to 1,000+ req/s with 5 active plugins

Performance Optimization Tips¶

# Optimize plugin configuration for high-throughput environments
plugin_settings:
  plugin_timeout: 5000  # 5 second timeout for external services
  parallel_execution_within_band: true  # Enable when available
  fail_on_plugin_error: false  # Continue processing on plugin failures

plugins:

  - name: "CachedAIService"
    kind: "external"
    priority: 50
    config:
      cache_ttl_seconds: 300  # Cache responses for 5 minutes
      cache_max_entries: 10000  # LRU cache with 10K entries
      timeout_ms: 2000  # Fast timeout for high-throughput
      retry_attempts: 1  # Single retry only

Monitoring and Observability¶

General observability guidance:

Emit structured logs at appropriate levels (debug/info/warn/error)
Track plugin execution time in logs where useful
Use external APM/logging stacks for end‑to‑end tracing if needed

Security Considerations¶

Plugin Isolation and Security¶

Input Validation: Plugin configurations validated with Pydantic models
Timeout Protection: Configurable timeouts prevent plugin hangs
Payload Limits: Payload size guards (~1MB) prevent resource exhaustion
Error Isolation: Plugin failures don't affect gateway stability
Audit Logging: Log plugin executions and violations

External Plugin Security¶

Secure external plugin servers as you would any service (authentication, TLS). The gateway's external plugin client communicates over MCP (STDIO or Streamable HTTP).

Future Roadmap¶

Near‑term Enhancements¶

Server Attestation Hooks: server_pre_register (TPM/TEE)
Authentication Hooks: auth_pre_check/auth_post_check
Admin UI: Visual plugin management and monitoring dashboard
Hot Configuration Reload: Update plugin configs without restart
Advanced Caching: Redis-backed caching for external service calls

Long-term Vision (v0.9.0+)¶

Plugin Marketplace: Community plugin sharing and discovery
Advanced Analytics: Plugin performance analytics and optimization recommendations
A/B Testing Framework: Split traffic between plugin configurations
Machine Learning Pipeline: Built-in support for custom ML model deployment

Contributing¶

To contribute a plugin:

Follow the plugin structure guidelines
Include comprehensive tests
Document configuration options
Submit a pull request with examples

For framework improvements, please open an issue to discuss proposed changes.