Welcome to Happy_vLLM
Happy_vLLM is a REST API, production ready. It is based on the popular library vLLM and provide an API for it.
Installation
You can install happy_vLLM using pip:
Or build it from source:
Quickstart
Just use the entrypoint happy-vllm
(see arguments for a list of all possible arguments)
It will launch the API and you can directly query it for example with
To get various information on the application or
if you want to generate your first LLM response using happy_vLLM. See endpoints for more details on all the endpoints provided by happy_vLLM.
Deploy with Docker image
A docker image is available from the Github Container Registry :
See deploying_with_docker for more details on how to serve happy_vLLM with docker.Swagger
You can reach the swagger UI at the /docs
endpoint (so for example by default at 127.0.0.1:5000/docs
). You will be provided all the endpoints and examples on how to use them.