跳到主要内容

1 篇博文 含有标签「architecture」

查看所有标签

Signal-Decision Driven Architecture: Reshaping Semantic Routing at Scale

· 阅读需 1 分钟
Xunzhuo Liu
Intelligent Routing @vLLM

The earlier versions of vLLM Semantic Router relied on classification-based routing, a straightforward approach where user queries are classified into one of 14 MMLU domain categories, and then routed to corresponding models. While this worked for basic scenarios, we quickly discovered its limitations when building production AI systems for enterprises.

Synced from official vLLM Blog: Signal-Decision Driven Architecture: Reshaping Semantic Routing at Scale

banner