vLLM - Biapy Web Directory

Sun Jan 26 15:50:37 2025

Easy, fast, and cheap LLM serving for everyone.

vLLM is a fast and easy-to-use library for LLM inference and serving.

Originally developed in the Sky Computing Lab at UC Berkeley, vLLM has evloved into a community-driven project with contributions from both academia and industry.

vLLM @ GitHub.

Links per page

Filters