You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
PaddleSpeech/dataset/ami
qingen 2775235c8d
[vector] add AMI data preparation scripts
3 years ago
..
.gitignore [vector] add AMI data preparation scripts 3 years ago
README.md [vector] add AMI data preparation scripts 3 years ago
ami_prepare.py [vector] add AMI data preparation scripts 3 years ago
ami_splits.py [vector] add AMI data preparation scripts 3 years ago
dataio.py [vector] add AMI data preparation scripts 3 years ago

README.md

AMI

The AMI Meeting Corpus is a multi-modal data set consisting of 100 hours of meeting recordings. For a gentle introduction to the corpus, see the corpus overview. To access the data, follow the directions given there. Around two-thirds of the data has been elicited using a scenario in which the participants play different roles in a design team, taking a design project from kick-off to completion over the course of a day. The rest consists of naturally occurring meetings in a range of domains.

Detailed information can be found in the documentation section.