Skip to content

Releases: FluidInference/FluidAudio

v0.7.10

29 Nov 08:22
d540f00

Choose a tag to compare

What's Changed

Full Changelog: v0.7.9...v0.7.10

What's Changed

  • Update FluidAudio version to 0.7.9 by @BrandonWeng in #192
  • Add word-level timestamps support to CLI transcribe command by @Alex-Wengg in #193
  • optionalize TTS via FluidAudioTTS target by @Alex-Wengg in #186
  • Expose streaming chunk API and package ESpeakNG xcframework with dSYMs by @Alex-Wengg in #201
  • Streaming Diarization Improvements by @SGD2718 in #191
  • Fix: Move ESpeakNG.xcframework to top-level Frameworks directory by @Alex-Wengg in #205

Full Changelog: v0.7.9...v0.7.10

v0.7.9

18 Nov 18:27
a6a0e2c

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.7.8...v0.7.9

v0.7.8

04 Nov 14:53
8136bd0

Choose a tag to compare

Impact:

  • Remove shared buffers for diarization pipeline that was causing concurrency crashes, < 3% impact to latency.
  • Reduced missing words by 10% when running ASR on long audio files
  • Slightly improved WER for v2 and v3 (~0.5% on benchmarks) and ~5% faster!
  • Programmatically override the default registry to download from. (i.e hf-mirror.com), useful for Chinese developers

What's Changed

Full Changelog: v0.7.7...v0.7.8

v0.7.7

29 Oct 23:07
2dd0bd1

Choose a tag to compare

What's Changed

  • Add Intel ESpeakNG support so its easier for others to build by @BrandonWeng in #167

Full Changelog: v0.7.6...v0.7.7

v0.7.6

28 Oct 16:16
b2248c4

Choose a tag to compare

What's Changed

This should fix SPM issues

#164

Full Changelog: v0.7.5...v0.7.6

v0.7.5

27 Oct 02:00
f47209a

Choose a tag to compare

What's Changed

Core ASR & Diarization

Speech Synthesis (ESpeakNG)

Dataset & Pipeline Improvements

  • Expanded FLEURS dataset coverage to all 25 languages
  • Added Hugging Face download retries for robustness (#158) — @BrandonWeng

Internal Cleanups

  • General refactors and organization improvements across diarization and data pipelines.

New Contributors


Full Changelog: v0.7.4...v0.7.5

v0.7.4

20 Oct 23:21
06ab3d5

Choose a tag to compare

What's Changed

  • Fix Kokoro File phonic not found issue with xFramework by @BrandonWeng in #151

Should build and run properly now, verified with another developer

Full Changelog: v0.7.2...v0.7.4

v0.7.2

19 Oct 18:53
2df5261

Choose a tag to compare

What's Changed

  • Update ESpeak and hard fail if missing by @BrandonWeng in #148
  • Include header for import files by @BrandonWeng in #149
  • Bump min versions to macos 14 and iOS17. This shouldn't matter for most as our models are built against MacOS14 and iOS17..

Full Changelog: v0.7.1...v0.7.2

v0.7.1

18 Oct 21:31

Choose a tag to compare

What's Changed

  • Adding support for phonetic and alias replacement by @smdesai in #140
  • Fix ML cache race condition for streaming ASR by @BrandonWeng in #147

New Contributors

Full Changelog: v0.7.0...v0.7.1