Skip to content

In this project, we have finetuned wav2vec2 model with multi-tasks such as Speech Recognition, Speaker Identification, Emotion Recognition

Notifications You must be signed in to change notification settings

asadullah797/ssl-semi-multitask

Repository files navigation

Wav2Vec2-MultiTask

This is a fine-tuned Wav2Vec2.0 model for multi-task learning:

  • Phoneme recognition
  • Emotion classification
  • Speaker identification

Usage

from transformers import AutoModel, AutoConfig, AutoProcessor

model = AutoModel.from_pretrained(
    "username/my-wav2vec2-multitask",
    trust_remote_code=True
)

config = AutoConfig.from_pretrained(
    "username/my-wav2vec2-multitask",
    trust_remote_code=True
)

processor = AutoProcessor.from_pretrained("facebook/wav2vec2-base")

inputs = processor("hello world", return_tensors="pt", sampling_rate=16000)

# phoneme recognition
logits = model(**inputs, task="phoneme")

About

In this project, we have finetuned wav2vec2 model with multi-tasks such as Speech Recognition, Speaker Identification, Emotion Recognition

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published