Documentation
Install with AgentGateway
This guide provides step-by-step instructions for integrating the vLLM Semantic Router with AgentGateway on Kubernetes. AgentGateway acts as the Gateway API data plane for OpenAI-compatible traffic, and vLLM Semantic Router runs as an Envoy ExtProc server that classifies each request and mutates the request body before AgentGateway forwards it to vLLM.