You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
KP
ee9972a0d3
|
3 years ago | |
---|---|---|
.. | ||
examples | 3 years ago | |
paddleaudio | 3 years ago | |
test | 3 years ago | |
.gitignore | 3 years ago | |
.pre-commit-config.yaml | 3 years ago | |
.style.yapf | 3 years ago | |
LICENSE | 3 years ago | |
README.md | 3 years ago | |
requirements.txt | 3 years ago | |
setup.py | 3 years ago |
README.md
PaddleAudio: The audio library for PaddlePaddle
Introduction
PaddleAudio is the audio toolkit to speed up your audio research and development loop in PaddlePaddle. It currently provides a collection of audio datasets, feature-extraction functions, audio transforms,state-of-the-art pre-trained models in sound tagging/classification and anomaly sound detection. More models and features are on the roadmap.
Features
- Spectrogram and related features are compatible with librosa.
- State-of-the-art models in sound tagging on Audioset, sound classification on esc50, and more to come.
- Ready-to-use audio embedding with a line of code, includes sound embedding and more on the roadmap.
- Data loading supports for common open source audio in multiple languages including English, Mandarin and so on.
Install
git clone https://github.com/PaddlePaddle/models
cd models/PaddleAudio
pip install .
Quick start
Audio loading and feature extraction
import paddleaudio as pa
s,r = pa.load(f)
mel_spect = pa.melspectrogram(s,sr=r)
Examples
We provide a set of examples to help you get started in using PaddleAudio quickly.
- PANNs: acoustic scene and events analysis using pre-trained models
- Environmental Sound classification on ESC-50 dataset
- Training a audio-tagging network on Audioset
Please refer to example directory for more details.