whisper_ros

This repository provides a set of ROS 2 packages to integrate whisper.cpp into ROS 2 using audio_common 4.0.7. Besides, silero-vad is used to perform VAD (Voice Activity Detection).

ROS 2 Distro	Branch	Build status	Docker Image	Documentation
Humble	`main`
Iron	`main`
Jazzy	`main`
Kilted	`main`
Rolling	`main`

Related Projects

chatbot_ros → This chatbot, integrated into ROS 2, uses whisper_ros, to listen to people speech; and llama_ros, to generate responses. The chatbot is controlled by a state machine created with YASMIN.

Installation

To run whisper_ros with CUDA, first, you must install the CUDA Toolkit. To run SileroVAD with ONNX and CUDA, you must install the cuDNN.

cd ~/ros2_ws/src
git clone https://github.com/mgonzs13/audio_common.git
git clone https://github.com/mgonzs13/whisper_ros.git
cd ~/ros2_ws
rosdep install --from-paths src --ignore-src -r -y
colcon build --cmake-args -DGGML_CUDA=ON -DONNX_GPU=ON # To use CUDA on Whisper and on Silero, respectively

Docker

Build the whisper_ros docker. Additionally, you can choose to build whisper_ros with CUDA (USE_CUDA) and choose the CUDA version (CUDA_VERSION). Remember that you have to use DOCKER_BUILDKIT=0 to compile whisper_ros with CUDA when building the image.

DOCKER_BUILDKIT=0 docker build -t whisper_ros --build-arg USE_CUDA=1 --build-arg CUDA_VERSION=12-6 .

Run the docker container. If you want to use CUDA, you have to install the NVIDIA Container Tollkit and add --gpus all.

docker run -it --rm --gpus all whisper_ros

Usage

Run Silero for VAD and Whisper for STT:

ros2 launch whisper_bringup whisper.launch.py

Add the parameter silero_vad_use_cuda:=True to use Silero with CUDA.

Demos

Send a goal action to listen:

ros2 action send_goal /whisper/listen whisper_msgs/action/STT "{}"

Or try the example of a whisper client:

ros2 run whisper_demos whisper_demo_node

Name		Name	Last commit message	Last commit date
Latest commit History 263 Commits
.github		.github
whisper_bringup		whisper_bringup
whisper_cpp_vendor		whisper_cpp_vendor
whisper_demos		whisper_demos
whisper_hfhub_vendor		whisper_hfhub_vendor
whisper_msgs		whisper_msgs
whisper_onnxruntime_vendor		whisper_onnxruntime_vendor
whisper_ros		whisper_ros
.gitignore		.gitignore
CITATION.cff		CITATION.cff
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

whisper_ros

Table of Contents

Related Projects

Installation

Docker

Usage

Demos

About

Uh oh!

Releases 50

Packages

Uh oh!

Contributors 5

Uh oh!

Languages

License

mgonzs13/whisper_ros

Folders and files

Latest commit

History

Repository files navigation

whisper_ros

Table of Contents

Related Projects

Installation

Docker

Usage

Demos

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 50

Packages 0

Uh oh!

Contributors 5

Uh oh!

Languages

Packages