Skip to content

Add google text-to-speech support #1

@avimar

Description

@avimar

Many "neural" voices, they call it wavenet. Same low pricing.

Returns native LINEAR16 - Uncompressed 16-bit signed little-endian samples (Linear PCM). Audio content returned as LINEAR16 also contains a WAV header.

I'm not sure where to find the native hertz rate per voice, to know if 48000 hz is supported.

Not sure if it's allowed for re-use. I asked here: https://stackoverflow.com/questions/61609222/can-i-distribute-googles-text-to-speech-files

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions