You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
38 lines
1.5 KiB
38 lines
1.5 KiB
# PaddleAudio: The audio library for PaddlePaddle
|
|
|
|
## Introduction
|
|
PaddleAudio is the audio toolkit to speed up your audio research and development loop in PaddlePaddle. It currently provides a collection of audio datasets, feature-extraction functions, audio transforms,state-of-the-art pre-trained models in sound tagging/classification and anomaly sound detection. More models and features are on the roadmap.
|
|
|
|
|
|
|
|
## Features
|
|
- Spectrogram and related features are compatible with librosa.
|
|
- State-of-the-art models in sound tagging on Audioset, sound classification on esc50, and more to come.
|
|
- Ready-to-use audio embedding with a line of code, includes sound embedding and more on the roadmap.
|
|
- Data loading supports for common open source audio in multiple languages including English, Mandarin and so on.
|
|
|
|
|
|
## Install
|
|
```
|
|
git clone https://github.com/PaddlePaddle/models
|
|
cd models/PaddleAudio
|
|
pip install .
|
|
|
|
```
|
|
|
|
## Quick start
|
|
### Audio loading and feature extraction
|
|
```
|
|
import paddleaudio as pa
|
|
s,r = pa.load(f)
|
|
mel_spect = pa.melspectrogram(s,sr=r)
|
|
```
|
|
|
|
### Examples
|
|
We provide a set of examples to help you get started in using PaddleAudio quickly.
|
|
- [PANNs: acoustic scene and events analysis using pre-trained models](./examples/panns)
|
|
- [Environmental Sound classification on ESC-50 dataset](./examples/sound_classification)
|
|
- [Training a audio-tagging network on Audioset](./examples/audioset_training)
|
|
|
|
Please refer to [example directory](./examples) for more details.
|