SmolVLM Real-Time Webcam Demo using vLLM Backend

Real-time webcam demo using SmolVLM with a vLLM backend.

The app captures webcam images in the browser, sends them with a text prompt to a local vLLM server via OpenAI-compatible API, and displays the model’s visual-language response.

How to Set Up

Follow these steps to get the real-time SmolVLM webcam demo running with a local vLLM server:

1. Clone the Repository

git clone https://github.com/yourusername/smolvlm-realtime-webcam-vllm.git
cd smolvlm-realtime-webcam-vllm

2. Set Up the vLLM Backend

vLLM Installation (GPU)

# (Recommended) Create a new conda environment.
conda create -n vllm python=3.12 -y
conda activate vllm

# Install vLLM
pip install vllm

Or see full installation instructions in the vLLM documentation.

Start the vLLM Server

Run the vLLM server using the provided shell script:

bash vllm_backend.sh

Default model: HuggingFaceTB/SmolVLM-500M-Instruct

Tested:

HuggingFaceTB/SmolVLM-500M-Instruct
HuggingFaceTB/SmolVLM-256M-Instruct
HuggingFaceTB/SmolVLM-Instruct

If you want to use a different model, pass the model name as an argument:

bash vllm_backend.sh your-org/your-model-name

Note ℹ️ Make sure the model is compatible with vLLM and supports the OpenAI Chat API format.

The script will automatically fall back to the default model if no argument is provided.

3. Launch the Frontend

Open index.html on a browser and allow access to webcam. Done!

Tested using RTX 4090

Reference

Inspired by http://github.com/ngxson/smolvlm-realtime-webcam repository.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
image.png		image.png
index.html		index.html
vllm_backend.sh		vllm_backend.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SmolVLM Real-Time Webcam Demo using vLLM Backend

How to Set Up

1. Clone the Repository

2. Set Up the vLLM Backend

vLLM Installation (GPU)

Start the vLLM Server

3. Launch the Frontend

Reference

About

Uh oh!

Releases

Packages

Languages

License

yakhyo/smolvlm-realtime-webcam-vllm

Folders and files

Latest commit

History

Repository files navigation

SmolVLM Real-Time Webcam Demo using vLLM Backend

How to Set Up

1. Clone the Repository

2. Set Up the vLLM Backend

vLLM Installation (GPU)

Start the vLLM Server

3. Launch the Frontend

Reference

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages