Skip to content
View ming024's full-sized avatar

Highlights

  • Pro

Block or report ming024

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. FastSpeech2 FastSpeech2 Public

    An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

    Python 2.2k 612

  2. kyutai-labs/moshi-rag kyutai-labs/moshi-rag Public

    MoshiRAG is a compact full-duplex speech language model augmented with asynchronous knowledge retrieval to improve factuality without sacrificing real-time interactivity.

    Rust 72 4

  3. yistLin/FragmentVC yistLin/FragmentVC Public

    Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention

    Python 204 37

  4. ankitapasad/layerwise-analysis ankitapasad/layerwise-analysis Public

    Layer-wise analysis of self-supervised pre-trained speech representations

    Python 133 22

  5. hierarchical_prosody_modeling hierarchical_prosody_modeling Public

    HTML 8

  6. voicebox_adapter voicebox_adapter Public

    HTML 2