Skip to content
Change the repository type filter

All

    Repositories list

    • Code of VPC 2024 with extended privacy evaluation options to find overestimated results
      Python
      11100Updated Jul 28, 2025Jul 28, 2025
    • Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.
      Shell
      108810Updated Jul 4, 2025Jul 4, 2025
    • Controllable and fast Text-to-Speech for over 7000 languages!
      Python
      1951.7k50Updated Jun 30, 2025Jun 30, 2025
    • diagraph

      Public
      DIAGRAPH: An open-source graphic interface for dialog flow design
      JavaScript
      1700Updated Mar 14, 2025Mar 14, 2025
    • Jupyter Notebook
      2710Updated Feb 19, 2025Feb 19, 2025
    • Code and Data for Conversational Tree Search: A new task that bridges the gap between FAQ-style information retrieval and task-oriented dialog.
      Python
      0810Updated Feb 5, 2025Feb 5, 2025
    • Predicting a subgraph alongside the answer in a graph based VQA model
      Python
      1910Updated Jan 21, 2025Jan 21, 2025
    • Python
      0500Updated Jul 3, 2024Jul 3, 2024
    • bloomzmms

      Public
      Materials for the publication "Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training"
      Python
      0200Updated Jun 16, 2024Jun 16, 2024
    • Materials for the publication "Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding"
      Python
      0200Updated Jun 16, 2024Jun 16, 2024
    • VoicePAT

      Public
      VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.
      Shell
      55220Updated May 14, 2024May 14, 2024
    • adviser

      Public
      ADvISER is a flexible framework to encourage task-oriented dialog system research & development
      Python
      356036Updated Aug 14, 2023Aug 14, 2023
    • Code accompanying our paper on finetuning self-supervised general speech representations with a combination of contrastive and non-contrastive methods.
      Python
      0210Updated Oct 5, 2022Oct 5, 2022
    • IMS-Speech is a tool for German, English and Russian speech transcription aiming to facilitate research in various disciplines. We are willing to provide a speech transcription service with an intuitive web interface accessible with a wide range of computing devices and to people with various backgrounds. Our service is available here: https://7…
      Go
      2510Updated May 13, 2022May 13, 2022
    • Our_Fault

      Public
      A collaborative dialog game playable by a human and an AI system, designed to better understand how users view such an AI partner. The repository contains code for the game as well as dialog logs, survey responses, and annotations from a user study conducted with this scenario.
      Python
      0000Updated Nov 10, 2021Nov 10, 2021
    • A project exploring ethical implications of chatbot design, in particular affective language style. The repository contains code, survey responses, and annotated data for the experiment conducted using this implementation.
      Python
      0000Updated Nov 9, 2021Nov 9, 2021
    • CycleGAN-based Emotion Style Transfer as Data Augmentation for Speech Emotion Recognition
      Python
      11210Updated Oct 7, 2019Oct 7, 2019
    • nlg-eval

      Public
      Code accompanying the INLG 2018 paper Sequence-to-Sequence Models for Data-to-Text Natural Language Generation: Word- vs. Character-based Processing and Output Diversity
      Python
      0600Updated Aug 30, 2019Aug 30, 2019
    • Comparing attention-based convolutional and recurrent neural networks under adversarial attacks to investigate their success and limitations in machine reading comprehension
      Python
      31000Updated Aug 24, 2018Aug 24, 2018