基于编码器模型
编码器驱动的智能
专用编码器模型从每个请求中提取语义 — 理解意图、排序相关性、跨模态实时分类内容。
多模态
检测并路由文本、图像和音频输入到合适的模态模型。
Bi-Encoder 嵌入
独立编码查询和候选项为稠密向量,用于相似度搜索和语义缓存。
Cross-Encoder 学习
联合交叉注意力评分查询-候选对,实现高精度重排序。
分类
基于自研 BERT 的领域、越狱、PII 和事实核查的分类器,覆盖多个 signal
全注意力
跨 token 和句子的双向注意力 — 双向完整上下文,非因果掩码。
2DMSE
推理时自适应调整嵌入层数和维度,按需平衡计算量与精度。
MRL
无需重训即可截断嵌入向量到任意维度 — 按请求平衡精度与速度。
🏗️ 架构

🎯 我们的目标
为混合模型(MoM)构建系统级智能,将全 局智能引入 LLM 系统

📍 它的位置
它位于现实世界和模型之间

👥 认识我们的团队
vLLM Semantic Router 背后的优秀成员
维护者Huamin Chen
Distinguished Engineer @Red Hat
维护者Chen Wang
Senior Staff Research Scientist @IBM
维护者Yue Zhu
Staff Research Scientist @IBM
维护者Xunzhuo Liu
Intelligent Routing @vLLM
提交者Senan Zedan
R&D Manager @Red Hat
提交者samzong
AI Infrastructure / Cloud-Native PM @DaoCloud
Liav Weiss
Software Engineer @Red Hat
Asaad Balum
Senior Software Engineer @Red Hat
Yehudit
Software Engineer @Red Hat
Noa Limoy
Software Engineer @Red Hat
提交者JaredforReal
Software Engineer @Z.ai
Srinivas A
Software Engineer @Yokogawa
carlory
Open Source Engineer @DaoCloud
提交者Yossi Ovadia
Senior Principal Engineer @Red Hat
提交者Jintao Zhang
Senior Software Engineer @Kong
提交者yuluo-yx
Individual Contributor
提交者cryo-zd
Individual Contributor
提交者OneZero-Y
Individual Contributor
提交者aeft
Individual Contributor
维护者Huamin Chen
Distinguished Engineer @Red Hat
维护者Chen Wang
Senior Staff Research Scientist @IBM
维护者Yue Zhu
Staff Research Scientist @IBM
维护者Xunzhuo Liu
Intelligent Routing @vLLM
提交者Senan Zedan
R&D Manager @Red Hat
提交者samzong
AI Infrastructure / Cloud-Native PM @DaoCloud
Liav Weiss
Software Engineer @Red Hat
Asaad Balum
Senior Software Engineer @Red Hat
Yehudit
Software Engineer @Red Hat
Noa Limoy
Software Engineer @Red Hat
提交者JaredforReal
Software Engineer @Z.ai
Srinivas A
Software Engineer @Yokogawa
carlory
Open Source Engineer @DaoCloud
提交者Yossi Ovadia
Senior Principal Engineer @Red Hat
提交者Jintao Zhang
Senior Software Engineer @Kong
提交者yuluo-yx
Individual Contributor
提交者cryo-zd
Individual Contributor
提交者OneZero-Y
Individual Contributor
提交者aeft
Individual Contributor
维护者Huamin Chen
Distinguished Engineer @Red Hat
维护者Chen Wang
Senior Staff Research Scientist @IBM
维护者Yue Zhu
Staff Research Scientist @IBM
维护者Xunzhuo Liu
Intelligent Routing @vLLM
提交者Senan Zedan
R&D Manager @Red Hat
提交者samzong
AI Infrastructure / Cloud-Native PM @DaoCloud
Liav Weiss
Software Engineer @Red Hat
Asaad Balum
Senior Software Engineer @Red Hat
Yehudit
Software Engineer @Red Hat
Noa Limoy
Software Engineer @Red Hat
提交者JaredforReal
Software Engineer @Z.ai
Srinivas A
Software Engineer @Yokogawa
carlory
Open Source Engineer @DaoCloud
提交者Yossi Ovadia
Senior Principal Engineer @Red Hat
提交者Jintao Zhang
Senior Software Engineer @Kong
提交者yuluo-yx
Individual Contributor
提交者cryo-zd
Individual Contributor
提交者OneZero-Y
Individual Contributor
提交者aeft
Individual Contributor
致谢
vLLM Semantic Router 诞生于开源,构建于开源 ❤️







