Design Doc: Multi-Protocol Adapter Architecture
Author: vLLM Semantic Router Team
Status: To be Implemented
Created: February 2026
Last Updated: February 2026
Overview
This document describes the design and implementation of the multi-protocol adapter architecture for vLLM Semantic Router, which abstracts the API layer to support multiple front-end protocols beyond Envoy ExtProc.
Background
The Semantic Router was tightly coupled to Envoy's External Processor (ExtProc) protocol via gRPC. While this provides powerful integration with Envoy, it created barriers for users who:
- Want to use the router without deploying Envoy
- Prefer direct HTTP/REST API integration
- Use Nginx or other reverse proxies
- Need simpler deployment architectures for development or testing
Motivation
- Flexibility: Users need direct HTTP API access without requiring Envoy infrastructure
- Testing: Developers need lightweight testing without full Envoy deployment
- Extensibility: Support for nginx, native gRPC, and custom protocols
- Reusability: Single routing engine shared across all protocols
- Deployment Options: Enable serverless, edge, and simplified deployment scenarios