OpenWakeWord WASM (browser)

Inspired by Miro Hristov’s Deep Core Labs write-up, this package brings the same browser-only wake-word pipeline into a reusable npm module. A full React sandbox lives in this repo as well: openwakeword_wasm_react_demo.

Small browser-first wrapper around the OpenWakeWord models using onnxruntime-web. It exposes a WakeWordEngine class you can drop into a React app to listen for wake words like hey_jarvis directly in Chrome, no native layer required.

Agents should read AGENTS.md to get details and onboarding instructions.

Installation

npm install file:../openwakeword_wasm
# or after publishing: npm install openwakeword-wasm-browser

Make sure the ONNX model files in models/ are hosted somewhere the browser can fetch them (for CRA/Vite you can copy the folder into public/openwakeword/models). If you self-host the ORT wasm files, pass ortWasmPath (e.g. /openwakeword/ort/).

Basic React usage

import { useEffect, useMemo, useState } from 'react';
import WakeWordEngine from 'openwakeword-wasm-browser';

export default function WakeWordDemo() {
  const [detected, setDetected] = useState(null);
  const engine = useMemo(() => new WakeWordEngine({
    baseAssetUrl: '/openwakeword/models', // where you host the .onnx files
    keywords: ['hey_jarvis'],             // or any of: alexa, hey_mycroft, hey_rhasspy, timer, weather
    detectionThreshold: 0.5,
    cooldownMs: 2000
  }), []);

  useEffect(() => {
    let unsub;
    engine.load().then(() => {
      unsub = engine.on('detect', ({ keyword, score }) => {
        setDetected(`${keyword} (${score.toFixed(2)})`);
      });
      engine.start(); // prompts for mic
    });
    return () => { unsub?.(); engine.stop(); };
  }, [engine]);

  return (
    <div>
      <p>Listening for hey_jarvis…</p>
      {detected && <p>Detected: {detected}</p>}
    </div>
  );
}

Vanilla example

import WakeWordEngine from 'openwakeword-wasm-browser';

const engine = new WakeWordEngine({
  baseAssetUrl: '/openwakeword/models',
  ortWasmPath: '/openwakeword/ort/',
  keywords: ['hey_jarvis', 'alexa'],
  detectionThreshold: 0.55,
});

await engine.load();
engine.on('speech-start', () => status.textContent = 'Speech detected');
engine.on('speech-end', () => status.textContent = 'Silence');
engine.on('detect', ({ keyword }) => playTone(keyword));
await engine.start({ deviceId: preferredMicId, gain: 1.3 });

document.querySelector('#stop').addEventListener('click', () => engine.stop());
document.querySelector('#keyword').addEventListener('change', (evt) => {
  engine.setActiveKeywords([evt.target.value]);
});

API reference

await engine.load() downloads ONNX models (mel, embedding, VAD, keyword heads) and infers keyword window sizes.
await engine.start({ deviceId?, gain? }) starts microphone streaming and posts 1280-sample chunks through the AudioWorklet.
await engine.stop() tears down the graph, stops tracks, and clears cooldowns.
engine.setGain(value) updates the GainNode while running.
await engine.runWav(arrayBuffer) runs the entire pipeline offline and returns the highest score seen.
engine.setActiveKeywords(name[]) gates which keywords are allowed to emit detect.

Events

ready fires once models finish loading.
detect surfaces { keyword, score, at } when score > threshold, VAD hangover is active, and cooldown is clear.
speech-start / speech-end mirror the VAD state transitions.
error emits any pipeline failures (getUserMedia, onnxruntime, decoding issues).

Asset layout

Example with Vite/CRA:

public/
  openwakeword/
    models/
      melspectrogram.onnx
      embedding_model.onnx
      silero_vad.onnx
      hey_jarvis_v0.1.onnx
      ...
    ort/
      ort-wasm.wasm
      ort-wasm-simd.wasm

Then instantiate with baseAssetUrl: '/openwakeword/models' and ortWasmPath: '/openwakeword/ort' if you host the wasm yourself. If ortWasmPath is omitted, onnxruntime-web uses its default CDN.

Notes

The engine runs at 16 kHz with 80 ms frames, mirroring the reference demo in main.js.
VAD hangover is tuned to 12 frames to keep speech open long enough for the wake word score to peak.
Cooldown (cooldownMs) prevents multiple detections per utterance; lower if you want rapid-fire triggers.

Publishing / packaging tips

npm pack (or npm publish) includes src/, models/, and README.md via the files list so consumers get the engine and bundled assets.
Ship the ONNX assets alongside the package or document the public hosting location (baseAssetUrl). The React demo copies them into public/openwakeword/models.
Consider running engine.runWav() against hey_jarvis_11-2.wav before publishing to verify the scoring path still peaks near 1.0.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
models		models
openwakeword_wasm_react_demo		openwakeword_wasm_react_demo
src		src
.gitignore		.gitignore
.htaccess		.htaccess
AGENTS.md		AGENTS.md
README.md		README.md
article.html		article.html
hey_jarvis_11-2.wav		hey_jarvis_11-2.wav
index.php		index.php
main.js		main.js
openwakeword-wasm-browser-0.1.0.tgz		openwakeword-wasm-browser-0.1.0.tgz
package-lock.json		package-lock.json
package.json		package.json
style.css		style.css
success.mp3		success.mp3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OpenWakeWord WASM (browser)

Installation

Basic React usage

Vanilla example

API reference

Events

Asset layout

Notes

Publishing / packaging tips

About

Uh oh!

Releases

Packages

Languages

dnavarrom/openwakeword_wasm

Folders and files

Latest commit

History

Repository files navigation

OpenWakeWord WASM (browser)

Installation

Basic React usage

Vanilla example

API reference

Events

Asset layout

Notes

Publishing / packaging tips

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages