You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
PaddleSpeech/utils
Hui Zhang b472a148dc
format
2 years ago
..
fst format 3 years ago
parallel
DER.py format 3 years ago
README.md rm ds2_ol test dir 3 years ago
__init__.py more utils to support kaldi/espnet data preocess 3 years ago
addjson.py format code, test=doc 3 years ago
apply-cmvn.py format code, test=doc 3 years ago
avg.sh
avg_model.py transform; librispeech/s2 data process ok 3 years ago
build_kenlm_model_from_arpa.sh
build_vocab.py update data process 3 years ago
caculate_rtf.py lm embed and format code 3 years ago
compute-cmvn-stats.py merge deepspeech, parakeet and text_processing into paddlespeech 3 years ago
compute-wer.py more detail of copyright 3 years ago
compute_mean_std.py filter example; cmvn stride and window int; libri/s1 conf 3 years ago
compute_statistics.py update tacotron2 voice cloning in aishell3 with new tacotron2, test=tts (#1419) 3 years ago
copy-feats.py format code, test=doc 3 years ago
data2json.sh lm embed and format code 3 years ago
dump.sh more utils to support kaldi/espnet data preocess 3 years ago
dump_manifest.py format code 3 years ago
duration_from_maniefst.sh
espnet_json_to_manifest.py fix utils for ngram and wfst 3 years ago
feat-to-shape.py merge deepspeech, parakeet and text_processing into paddlespeech 3 years ago
feat_to_shape.sh more utils to support kaldi/espnet data preocess 3 years ago
filter.py
filter_scp.pl
format_data.py format code 3 years ago
format_rsl.py fix param path name; ws client 3 years ago
format_triplet_data.py format code 3 years ago
gen_duration_from_textgrid.py [TTS]fix praatio version, test=tts (#1158) 3 years ago
generate_infer_yaml.py prefect the packing scripts, test=doc 3 years ago
json2trn.py
link_wav.py fix utils for ngram and wfst 3 years ago
log.sh
manifest_key_value.py text process for lm 3 years ago
md-eval.pl [vector] add AMI data preparation scripts 3 years ago
merge_scp2json.py format code, test=doc 3 years ago
ngram_train.sh
pack_model.sh fix tiny conf 3 years ago
parse_options.sh
pd_env_collect.sh
profile.sh
reduce_data_dir.sh more utils to support kaldi/espnet data preocess 3 years ago
remove_longshortdata.py format code 3 years ago
remove_longshortdata.sh more utils to support kaldi/espnet data preocess 3 years ago
run.pl librispeech s1 support multi process decode and sclite 3 years ago
score_sclite.sh
scp2json.py update format 3 years ago
show_results.sh pack model 3 years ago
spk2utt_to_utt2spk.pl
split_data.sh
split_json.sh
split_scp.pl
spm_decode
spm_encode
spm_train
tarball.sh
text2token.py lm embed and format code 3 years ago
text_to_lexicon.py mv text_to_lexicon.py to utils 3 years ago
tokenizer.perl add utils 3 years ago
train_arpa_with_kenlm.sh
update_json.sh add utils 3 years ago
utility.py format code 3 years ago
utility.sh fix set_device; more utils; args.opts support multi same name 3 years ago
utt2spk_to_spk2utt.pl
zh_tn.py format 2 years ago

README.md