smolvlm
Here are 13 public repositories matching this topic...
Real-time webcam demo using SmolVLM with vLLM backend
-
Updated
May 15, 2025 - HTML
This repository contains the implementation of AlignVLM paper, which proposes a novel method for vision language alignment
-
Updated
May 23, 2025 - Python
A small VLM that sees everything
-
Updated
Jun 2, 2025 - HTML
🎭 Real-time voice-controlled 3D avatar with multimodal AI - speak naturally and watch your AI companion respond with perfect lip-sync
-
Updated
Jul 5, 2025 - TypeScript
Scripts for combining SmolVLM and LLM
-
Updated
May 15, 2025 - Python
Real-time vision demo using SmolVLM with llama.cpp backend
-
Updated
Aug 29, 2025 - HTML
A simple web application for real-time AI vision analysis using SmolVLM-500M-Instruct with live camera feed processing and text-to-speech.
-
Updated
Jun 30, 2025 - JavaScript
This blog post introduces SmolVLM, a 2B VLM, SOTA for its memory footprint. SmolVLM is small, fast, memory-efficient, and fully open-source. All model checkpoints, VLM datasets, training recipes and tools are released under the Apache 2.0 license.
-
Updated
May 17, 2025 - HTML
A Flask-based web app for managing multimodal datasets text and images with CRUD operations via SQLite, and seamless export as a structured Parquet dataset to Hugging Face Hub.
-
Updated
Jul 23, 2025 - HTML
Improve this page
Add a description, image, and links to the smolvlm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the smolvlm topic, visit your repo's landing page and select "manage topics."