Ph.D. Student at TTIC | Speech & NLP | NTU Speech Lab.
- Chicago, IL
- https://cmchien.ttic.edu
Highlights
- Pro
Pinned Loading
-
FastSpeech2
FastSpeech2 PublicAn implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
-
kyutai-labs/moshi-rag
kyutai-labs/moshi-rag PublicMoshiRAG is a compact full-duplex speech language model augmented with asynchronous knowledge retrieval to improve factuality without sacrificing real-time interactivity.
-
yistLin/FragmentVC
yistLin/FragmentVC PublicAny-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
-
ankitapasad/layerwise-analysis
ankitapasad/layerwise-analysis PublicLayer-wise analysis of self-supervised pre-trained speech representations
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
