You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
PaddleSpeech/examples/ami/sd0
ccrrong 995436c6f1
delete unused file ami_dataset.py, compute_der.py, test=doc
3 years ago
..
conf code format, test=doc 3 years ago
local delete unused file ami_dataset.py, compute_der.py, test=doc 3 years ago
.gitignore [vector] add AMI data preparation scripts 3 years ago
README.md [vector] add AMI data preparation scripts 3 years ago
path.sh [vector] add AMI data preparation scripts 3 years ago
run.sh convert dataset format to paddlespeech, test=doc 3 years ago
utils [vector] add AMI data preparation scripts 3 years ago

README.md

Speaker Diarization on AMI corpus

About the AMI corpus:

"The AMI Meeting Corpus consists of 100 hours of meeting recordings. The recordings use a range of signals synchronized to a common timeline. These include close-talking and far-field microphones, individual and room-view video cameras, and output from a slide projector and an electronic whiteboard. During the meetings, the participants also have unsynchronized pens available to them that record what is written. The meetings were recorded in English using three different rooms with different acoustic properties, and include mostly non-native speakers." See ami overview for more details.

About the example

The script performs diarization using x-vectors(TDNN,ECAPA-TDNN) on the AMI mix-headset data. We demonstrate the use of different clustering methods: AHC, spectral.

How to Run

Use the following command to run diarization on AMI corpus. bash ./run.sh

Results (DER) coming soon! :)