Skip to main content
vLLM Logo

AI-Powered vLLM Semantic Router

System Level Intelligent Router for Mixture-of-Models🧠🧬Neural NetworksLLM Routing♻️Per-token Unit Economics

Terminal

🧠 Neural Processing Architecture

Powered by cutting-edge AI technologies including Encoder Only Models, SLMs and LLMs, and advanced semantic understanding for intelligent model routing and selection.

🤖Small Language Models
🧬Neural Network Processing
Real-time Inference
🎯Semantic Understanding
AIMLNNLLM
Neural Processing UnitEmbedding • Classify • Similarity

🏗️ Architecture

Architecture

🎯 Our Goals

Building the System Level Intelligence for Mixture-of-Models (MoM), bringing Collective Intelligence into LLM systems

vLLM Semantic Router Banner
1
How to capture the missing signals in request, response and context?
2
How to combine the signals to make better decisions?
3
How to collaborate more efficiently between different models?
4
How to secure the real world and LLM system from jailbreaks, pii leaks, hallucinations?
5
How to collect the valuable signals and build a self-learning system?

📍 Where it lives

It lives between the real world and models

Where vLLM Semantic Router Lives

👥 Meet Our Team

The amazing people behind vLLM Semantic Router

Huamin ChenMaintainer

Huamin Chen

Distinguished Engineer @Red Hat

Chen WangMaintainer

Chen Wang

Senior Staff Research Scientist @IBM

Yue ZhuMaintainer

Yue Zhu

Staff Research Scientist @IBM

Xunzhuo LiuMaintainer

Xunzhuo Liu

AI Networking @Tencent

Senan ZedanCommitter

Senan Zedan

R&D Manager @Red Hat

samzongCommitter

samzong

AI Infrastructure / Cloud-Native PM @DaoCloud

Liav WeissCommitter

Liav Weiss

Software Engineer @Red Hat

Asaad BalumCommitter

Asaad Balum

Senior Software Engineer @Red Hat

YehuditCommitter

Yehudit

Software Engineer @Red Hat

Noa LimoyCommitter

Noa Limoy

Software Engineer @Red Hat

JaredforRealCommitter

JaredforReal

Software Engineer @Z.ai

Srinivas ACommitter

Srinivas A

Software Engineer @Yokogawa

carloryCommitter

carlory

Open Source Engineer @DaoCloud

Yossi OvadiaCommitter

Yossi Ovadia

Senior Principal Engineer @Red Hat

Jintao ZhangCommitter

Jintao Zhang

Senior Software Engineer @Kong

yuluo-yxCommitter

yuluo-yx

Individual Contributor

cryo-zdCommitter

cryo-zd

Individual Contributor

OneZero-YCommitter

OneZero-Y

Individual Contributor

aeftCommitter

aeft

Individual Contributor

Huamin ChenMaintainer

Huamin Chen

Distinguished Engineer @Red Hat

Chen WangMaintainer

Chen Wang

Senior Staff Research Scientist @IBM

Yue ZhuMaintainer

Yue Zhu

Staff Research Scientist @IBM

Xunzhuo LiuMaintainer

Xunzhuo Liu

AI Networking @Tencent

Senan ZedanCommitter

Senan Zedan

R&D Manager @Red Hat

samzongCommitter

samzong

AI Infrastructure / Cloud-Native PM @DaoCloud

Liav WeissCommitter

Liav Weiss

Software Engineer @Red Hat

Asaad BalumCommitter

Asaad Balum

Senior Software Engineer @Red Hat

YehuditCommitter

Yehudit

Software Engineer @Red Hat

Noa LimoyCommitter

Noa Limoy

Software Engineer @Red Hat

JaredforRealCommitter

JaredforReal

Software Engineer @Z.ai

Srinivas ACommitter

Srinivas A

Software Engineer @Yokogawa

carloryCommitter

carlory

Open Source Engineer @DaoCloud

Yossi OvadiaCommitter

Yossi Ovadia

Senior Principal Engineer @Red Hat

Jintao ZhangCommitter

Jintao Zhang

Senior Software Engineer @Kong

yuluo-yxCommitter

yuluo-yx

Individual Contributor

cryo-zdCommitter

cryo-zd

Individual Contributor

OneZero-YCommitter

OneZero-Y

Individual Contributor

aeftCommitter

aeft

Individual Contributor

Huamin ChenMaintainer

Huamin Chen

Distinguished Engineer @Red Hat

Chen WangMaintainer

Chen Wang

Senior Staff Research Scientist @IBM

Yue ZhuMaintainer

Yue Zhu

Staff Research Scientist @IBM

Xunzhuo LiuMaintainer

Xunzhuo Liu

AI Networking @Tencent

Senan ZedanCommitter

Senan Zedan

R&D Manager @Red Hat

samzongCommitter

samzong

AI Infrastructure / Cloud-Native PM @DaoCloud

Liav WeissCommitter

Liav Weiss

Software Engineer @Red Hat

Asaad BalumCommitter

Asaad Balum

Senior Software Engineer @Red Hat

YehuditCommitter

Yehudit

Software Engineer @Red Hat

Noa LimoyCommitter

Noa Limoy

Software Engineer @Red Hat

JaredforRealCommitter

JaredforReal

Software Engineer @Z.ai

Srinivas ACommitter

Srinivas A

Software Engineer @Yokogawa

carloryCommitter

carlory

Open Source Engineer @DaoCloud

Yossi OvadiaCommitter

Yossi Ovadia

Senior Principal Engineer @Red Hat

Jintao ZhangCommitter

Jintao Zhang

Senior Software Engineer @Kong

yuluo-yxCommitter

yuluo-yx

Individual Contributor

cryo-zdCommitter

cryo-zd

Individual Contributor

OneZero-YCommitter

OneZero-Y

Individual Contributor

aeftCommitter

aeft

Individual Contributor

Acknowledgements

vLLM Semantic Router is born in open source and built on open source ❤️