The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
-
Updated
Apr 29, 2025 - Python
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
This repository is built with a focus on practical ways to obtain and work with the audio data of audioset. You can use this repository to download and precprocess audioset wav files for running the recipies of Audio Spectogram Transformer (AST) and Masked Autoencoder that listen (Audio - MAE).
This project leverages Convolutional Neural Networks (CNNs) to perform real-time audio classification in rainforest environments, detecting sounds such as chainsaws, truck engines, and storms. Audio data, sourced from UrbanSound8K and rainforest recordings, is preprocessed into spectrograms using Librosa, while synthetic soundscapes are generated w
This repository contains code/papers/research on Speech or Audio Classification
BirdCLEF 2025 soundscape classification project using deep learning (SeresNeXt26t) with PyTorch, pseudo-labeling, and audio preprocessing for bird species identification.
A Convolutional Neural Network which is trained to detect COVID 19 even in asymptotic patients using only cough recordings.
Dataset for CABA: Clasificador Automatico de Botellas por Acustica (Automatic Acustic Bottle Classifier)
A deep learning model that can detect the presence of capuchin bird calls in audio clips
This is a space where I share my personal portfolio :)
Speech Emotion Recognition, abbreviated as SER, is the act of attempting to recognize human emotion and affective states from speech. This is capitalizing on the fact that voice often reflects underlying emotion through tone and pitch. This is also the phenomenon that animals like dogs and horses employ to be able to understand human emotion.
Add a description, image, and links to the audioclassification topic page so that developers can more easily learn about it.
To associate your repository with the audioclassification topic, visit your repo's landing page and select "manage topics."