Open Source: #serving

Catalog projects marked with #serving. Tags work as dedicated landing pages, so related tools are easier to find and connect.

Filters

Reset
Languages

Repositories

Found: 1
vllm-project/vllm
vLLM

vLLM is a high-performance engine for LLM inference and serving with an OpenAI-compatible API, batching, and efficient memory management.

Stars 82,414 Forks 17,884 Author vllm-project Language Python License Apache-2.0
Open