Commit Graph

1067 Commits (03022f2170ce76d2ca8385a92aa8df3519e2366b)

Author SHA1 Message Date
TianYuan 378d25aef8 update vits ckpt, test=tts
2 years ago
Jackwaterveg e37bcdd5a8
test=doc
2 years ago
Jackwaterveg 69f399f8cd
test=doc
2 years ago
Jackwaterveg e1f686abd9
test=doc
2 years ago
Jackwaterveg c167e128c5
fix doc,test=doc
2 years ago
TianYuan c1395e3a05 add synthesize for ernie_sat aishell3 and aishell3_vctk, test=tts
2 years ago
Hui Zhang 6699d760e4
Update README.md
2 years ago
TianYuan 5503c8bd6b add ernie_sat synthesize script for metadata.jsonl, test=tts
2 years ago
TianYuan e129bc736b fix am name, test=tts
2 years ago
TianYuan 028742b69a update lr scheduler
2 years ago
TianYuan 94688264c7 add ernie sat model file and config
2 years ago
huangyuxin 05d41523ad Merge branch 'develop' into webdataset
2 years ago
huangyuxin 92d1d08b9a fix scripts
2 years ago
Jackwaterveg 7fc81fe9d9
test=doc
2 years ago
Jackwaterveg 32e8c6f16c
test=doc
2 years ago
Jackwaterveg 1b0cda961f
test=doc
2 years ago
TianYuan e0a87ea914
Merge pull request #2090 from KPatr1ck/sv
2 years ago
TianYuan 60c1a1e575
Merge pull request #2087 from yt605155624/add_blank
2 years ago
TianYuan b2b05a0bc7 add vits ckpt, test=doc
2 years ago
TianYuan e3075e7917 install CPython version monotonic_align before train, test=tts
2 years ago
huangyuxin 429221dc03 adopt multi machine traiing
2 years ago
KP 19fd46f57b Fix rir download. test=doc
2 years ago
huangyuxin ac1b301657 Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
2 years ago
huangyuxin 81934d7191 fix run.sh
2 years ago
Jackwaterveg 6598216b2f
Merge branch 'develop' into webdataset
2 years ago
huangyuxin 0c7abc1f17 add training scripts
2 years ago
TianYuan e233c4e449
Merge pull request #2079 from Honei/develop
2 years ago
xiongxinlei d15883e3dc add convert.sh
2 years ago
huangyuxin c7a7b113c8 support multi-gpu training with webdataset
2 years ago
TianYuan 6a45c5c3f5 add tts static/onnx models' link in released_model.md, test=doc
2 years ago
TianYuan 7743c6a1ff add onnx models for aishell3/ljspeech/vctk's tts3/voc1/voc5, test=tts
2 years ago
TianYuan d1aa83a239
Merge pull request #2052 from yt605155624/ernie_sat
2 years ago
TianYuan 79658a5f20 add ernie sat inference, test=tts
2 years ago
TianYuan 02734141ce
Merge pull request #2040 from yt605155624/add_blank
2 years ago
TianYuan 1731976e4e add blank between characters for vits, test=tts
2 years ago
Jackwaterveg bca014fd92
Merge pull request #2032 from PaddlePaddle/audio_refactoring
2 years ago
KP bf056c013d Refactor paddleaudio to paddlespeech.audio
2 years ago
TianYuan a37a5266f5
Merge pull request #2031 from Jackwaterveg/develop_fix
2 years ago
huangyuxin 865d075831 fix local/test.sh of librispeech asr1
2 years ago
Jackwaterveg b9d35c9b2b
Merge pull request #2028 from Jackwaterveg/develop_dev
2 years ago
huangyuxin 9aa868d14d support distrbuted training
2 years ago
Jackwaterveg 4432190fa8
test=doc
2 years ago
Jackwaterveg 6fe4cc1e47
test=doc
2 years ago
Jackwaterveg 681151a8c8
test=doc
2 years ago
Hui Zhang dfdf450b22 fix #2013; and format
2 years ago
huangyuxin 61e565182a add preprocess.yaml
2 years ago
huangyuxin e48e1d5e81 fix tiny and local script, test=asr
3 years ago
huangyuxin 47dd61e5b2 refactor ds2, cli, server
3 years ago
TianYuan 6c7ed42712 fix ljspeech readme, test=doc
3 years ago
TianYuan 9a253bc091 gen lexicon with tone in mfa, test=tts
3 years ago
TianYuan e6e5d86a5a
Merge pull request #1984 from Jackwaterveg/develop
3 years ago
huangyuxin 62c50e0060 deprecate the 1.8x model, test=doc
3 years ago
Hui Zhang 42fba661c9 more detail of copyright
3 years ago
TianYuan 7bc54cbbe6
Merge pull request #1957 from yt605155624/vits_doc
3 years ago
TianYuan f9f014d159 add VITS readme, test=tts
3 years ago
Hui Zhang 8f8239ad3b
Merge pull request #1954 from Honei/acs_server
3 years ago
xiongxinlei 07c0d7d7cc remove old vector model info, test=doc
3 years ago
xiongxinlei 7afbdbefad update the vector model, test=doc
3 years ago
qingen a7037dc029 [vec][doc] update der result, test=doc
3 years ago
TianYuan 7a88e3f4e4 update readme, test=doc
3 years ago
TianYuan df3f975ea5 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_vits
3 years ago
TianYuan 8db06444c5 add vits trainer and synthesize
3 years ago
Jackwaterveg 6b6c6cc1eb
test=doc
3 years ago
Jackwaterveg 689e3bfc60 test=doc
3 years ago
Jackwaterveg a55ec2f69f
test=doc
3 years ago
Jackwaterveg cab0394954
test=doc
3 years ago
Jackwaterveg b5437a293d
test=doc
3 years ago
root 8a3c88d42e add launch for st, test=asr
3 years ago
root 9f389a7a33 support cpu, test=asr
3 years ago
root 864041085f replace dist.spawn with dist.launch, test=asr
3 years ago
TianYuan 4b7786f2ed add vits network scripts, test=tts
3 years ago
root 9947380898 fix the doc, test=doc
3 years ago
root 4d7046d244 updata released model info, test=doc
3 years ago
root 0309a4d032 Add doc for wenetspeech model, test=doc
3 years ago
Hui Zhang 624ab2c57a update asr1 config
3 years ago
lizi 10b00b4da7 fix the reorganize_aishell3 trouble now it can generate lab files of audio files under training classification
3 years ago
Honei ff7dbcc2de
Merge branch 'develop' into v0.3
3 years ago
xiongxinlei c5fe181405 update the paddlespeech_client asr_online cli, test=doc
3 years ago
Hui Zhang 6132457b7b
Merge pull request #1808 from Jackwaterveg/develop_dev
3 years ago
Hui Zhang b66838faa9
Merge pull request #1811 from Honei/v0.3
3 years ago
xiongxinlei 4c56e4d42c update the voxceleb readme.md, test=doc
3 years ago
huangyuxin e5fbd8ce75 renew ds2 online doc, test=doc
3 years ago
xiongxinlei cb9beabace fix the sv ecapa-tdnn cpu training, test=doc
3 years ago
Hui Zhang 5f62c84cb0
Merge pull request #1791 from qingen/cluster
3 years ago
qingen 648cc5823b [vec] update readme, test=doc
3 years ago
Hui Zhang 42a81f453b
Merge pull request #1781 from PaddlePaddle/Jackwaterveg-patch-2
3 years ago
KP abb15ac6e8 Update KWS example.
3 years ago
Jackwaterveg 5ecdf3d3cd
Update RESULTS.md
3 years ago
KP caa8eb4d0d Add KWS example.
3 years ago
KP 43659b9882 Add KWS example.
3 years ago
KP f9761d532c Add KWS example.
3 years ago
KP b60b1dadde Add KWS example.
3 years ago
TianYuan 0b1b573a3f
Merge pull request #1767 from Jackwaterveg/cli
3 years ago
huangyuxin ad4e04fc82 add conformer_online_aishell, test=doc
3 years ago
Hui Zhang 91e24b0480 format code
3 years ago
TianYuan a7402203ec Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into update_paddle2onnx
3 years ago
TianYuan b55865b228 update version of paddle2onnx, test=tts
3 years ago
TianYuan 9121dfc046
Merge pull request #1752 from yt605155624/fix_wavernn
3 years ago
TianYuan 08a4673355 fix wavernn bug, test=tts
3 years ago
Jackwaterveg be42b1322a
Updata released model info, test=doc
3 years ago
Hui Zhang 21b740f3cf
Merge pull request #1746 from PaddlePaddle/Jackwaterveg-patch-1
3 years ago
Hui Zhang 2974931196
Merge pull request #1747 from Jackwaterveg/debug
3 years ago
huangyuxin 9c77d9e880 fix, test=doc
3 years ago
Jackwaterveg 132c8916c3
Update RESULTS.md
3 years ago
xiongxinlei df503e97c1 update the voxceleb readme.md, test=doc
3 years ago
Jackwaterveg b7f62ba82f
Update RESULTS.md
3 years ago
Jackwaterveg 8d1ee8262e
Merge branch 'develop' into CER
3 years ago
TianYuan c74fa9ada8 restructure syn_utils.py, test=tts
3 years ago
huangyuxin 4e431ae269 resum librispeech
3 years ago
huangyuxin 7c3c1b440a change librispeech
3 years ago
huangyuxin 6e80618e3d add ds2
3 years ago
Hui Zhang 7220b11b58
Merge pull request #1715 from zh794390558/spx_egs
3 years ago
Hui Zhang b78bc6375b
Merge pull request #1712 from yt605155624/add_cnndecoder_onnx
3 years ago
Hui Zhang 0ede6c2ee7 train lm
3 years ago
TianYuan da93f944e6 update, test=doc
3 years ago
TianYuan dafe7c3657 add fastspeech2 cnndecoder onnx model, test=tts
3 years ago
Hui Zhang a054d1c452 text process for lm
3 years ago
huangyuxin 4f23caa238 fix bug
3 years ago
TianYuan 7dcfc4aa95 [doc]add pwgan onnx model, test=doc
3 years ago
TianYuan 98f67870ea
Merge pull request #1693 from yt605155624/fix_ss_NHWC
3 years ago
buchongyu 48358055d0 修改hack 单词拼写错误
3 years ago
buchongyu 607a20a54c 修复 example 目录中speech单词拼写错误问题
3 years ago
TianYuan 8b801ca18b change NLC to NCL in speedyspeech, test=tts
3 years ago
xiongxinlei d1935d8552 add vector necessary note, test=doc
3 years ago
Honei 48e0177767
Merge pull request #1630 from Honei/vox12
3 years ago
qingen fc72295334
Merge pull request #1651 from ccrrong/ami
3 years ago
ccrrong 995436c6f1 delete unused file ami_dataset.py, compute_der.py, test=doc
3 years ago
Hui Zhang 44ee5cd805
Merge pull request #1677 from PaddlePaddle/Jackwaterveg-patch-1
3 years ago
ccrrong bc53f726fe convert dataset format to paddlespeech, test=doc
3 years ago
TianYuan c674e59b91 update readme, test=doc
3 years ago
TianYuan 0282d45c62 remove fill_constant_batch_size_like in static model of speedyspeech, test=tts
3 years ago
TianYuan 30628f6832 update readme, test=doc
3 years ago
TianYuan c765fca6b4 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_onnx
3 years ago
TianYuan 21c75684ac add paddle2onnx, test=tts
3 years ago
Jackwaterveg 75c9dc773b
test=doc
3 years ago
Jackwaterveg 3c93953550
test=doc
3 years ago
Jackwaterveg f71b9b915d
test=doc
3 years ago
Jackwaterveg 1a67038616
test=doc
3 years ago
Jackwaterveg 88f5595bd7
test=doc
3 years ago
Jackwaterveg ee96fb40f0
test=doc
3 years ago
Jackwaterveg a22f29ba10
test=doc
3 years ago
Jackwaterveg ae1b22273f
[Doc] update readem for aishell/asr0, test=doc
3 years ago
xiongxinlei a8244dc5b0 update the note, test=doc
3 years ago
KP 80b1fb9839 Update RESULTS.md. test=doc
3 years ago
KP 34b77a9db1 Update RESULTS.md. test=doc
3 years ago
huangyuxin fd7a50d5a0 add new cer tools, test=asr
3 years ago
Hui Zhang 2b7ca6f261
Update RESULTS.md
3 years ago
Hui Zhang 7ca40ff008
Merge pull request #1668 from PaddlePaddle/Jackwaterveg-patch-1
3 years ago
Honei 89791d7aca
Merge pull request #1663 from Honei/model
3 years ago
Jackwaterveg 82cd7015d7
test=doc
3 years ago
Jackwaterveg 5bb36472e8
test=doc
3 years ago
Jackwaterveg eeae00cc04
test=doc
3 years ago
TianYuan d592f25279 add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
3 years ago
TianYuan 7aecb2c4bb add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
3 years ago
xiongxinlei d064c8196e update the speaker verification model, test=doc
3 years ago
KP 079ac5caa0
Update README.md
3 years ago
xiongxinlei 38e4e9c893 refactor voxceleb2 data download, test=doc
3 years ago
ccrrong 7a03f36548 code format, test=doc
3 years ago
ccrrong 378fe5909f add ami diarization pipeline, test=doc
3 years ago
xiongxinlei acebfad7b7 change the vector csv.spk_id to csv.label, test=doc
3 years ago
xiongxinlei 57c11dcab0 add some annotations, test=doc
3 years ago
xiongxinlei 30b5b3cb9e add vector csv dataset format, test=doc
3 years ago
xiongxinlei 5b05300e53 train process add new voxceleb and rirs dataset, test=doc
3 years ago
xiongxinlei 9944fec3d4 convert rirs noise to csv file
3 years ago
TianYuan 78219cef7b add cnndecoder pretrained model, test=doc
3 years ago
TianYuan 4d7cd0e063 add streaming synthesize, test=tts
3 years ago
xiongxinlei ec24a169ee convert jsonfile to csv file
3 years ago
TianYuan 005aa4066c Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into cnn_decoder
3 years ago
TianYuan 0fc79f474d add CNNDecoder, test=tts
3 years ago
TianYuan 318edec303
Merge pull request #1613 from yt605155624/restructure_expand
3 years ago
Hui Zhang 943d4ac1ee
Merge pull request #1612 from Jackwaterveg/update
3 years ago
huangyuxin ed490b66cb update spectrogram, test=asr
3 years ago
Hui Zhang 84d712d493 format code, test=doc
3 years ago
Honei d60856b1ed
Merge pull request #1614 from Honei/vox12
3 years ago
xiongxinlei ed7113f320 change the vector output to numpy.array
3 years ago
Jackwaterveg 5db7e6382a
test=doc
3 years ago
TianYuan e52fc08c58 update readme, test=doc
3 years ago
TianYuan bc5ae43d3a restructure expand in length_regulator.py for paddle2onnx, test=tts
3 years ago
huangyuxin 0ffe1f9114 replace kaidi_fbank with paddleaudio
3 years ago
Jackwaterveg 64e12e949a
Update RESULTS.md
3 years ago
Jackwaterveg 1e35007925
test=doc
3 years ago
xiongxinlei ef1bc5e815 vector cli output dim info, test=doc
3 years ago
xiongxinlei 1fdb36f757 add mode emb dim info, test=doc
3 years ago
xiongxinlei ad2caf2ccb add speaker verification demo and doc, test=doc
3 years ago
Honei 305bacdcf2
Merge branch 'develop' into vox12
3 years ago
TianYuan 8938483529
Merge pull request #1601 from yt605155624/add_ljspeech_hifigan
3 years ago
TianYuan 342b487383 update readme for ljspeech hifigan, test=tts
3 years ago
Hui Zhang 4051e7b762 fix compliance test bug, and format
3 years ago
xiongxinlei e2684e71f2 refactor the data prepare process
3 years ago
Jackwaterveg 5c1283289e
[Doc] Updata doc
3 years ago
Jackwaterveg c07d248afd
test=doc
3 years ago
Jackwaterveg 13ac21b705
Update RESULTS.md
3 years ago
xiongxinlei 5221c2797f add voxceleb dataset and trial info, test=doc
3 years ago
Jackwaterveg 5d3c760eae
Update RESULTS.md
3 years ago
Jackwaterveg fcc1762048
Merge pull request #1577 from Jackwaterveg/change_init
3 years ago
huangyuxin e1b581b622 fix some bug, test=asr
3 years ago
Hui Zhang b5315657ff
Merge pull request #1509 from qingen/cluster
3 years ago
TianYuan 490300f84f Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_ljspeech_hifigan
3 years ago
TianYuan c36039ce32 update readme for ljspeech hifigan, test=tts
3 years ago
huangyuxin 6da8465f14 add dist_sampler args, test=asr
3 years ago
TianYuan 6469568d2a update readme for vctk hifigan, test=tts
3 years ago
TianYuan 9497c93fb0 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_vctk_hifigan
3 years ago
TianYuan d9127601b6 update readme for vctk hifigan, test=tts
3 years ago
huangyuxin e991d82ae7 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init
3 years ago
xiongxinlei d28ccfa96b add vector cli component, test=doc
3 years ago
TianYuan 5ab2601759 update readme for aishell3 hifigan, test=tts
3 years ago
TianYuan c4035f8c43 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_aishell3_hifigan
3 years ago
TianYuan 8d9197817a add hifigan in syn of aishell3, test=tts
3 years ago
TianYuan 13242d015e
update run.sh, test=doc
3 years ago
huangyuxin ab16d8ce3c change default initializer to kaiming_uniform, test=asr
3 years ago
TianYuan 4c517fa8a6
update preprocess.sh in aishell3 vc0, test=doc
3 years ago
TianYuan bf587ba879
update synthesize_e2e.sh, test=tts
3 years ago
qingen 0f7ede11ef Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster
3 years ago
qingen d16ce21d47 [wip][vec] update cluster of diarization, test=doc #1304
3 years ago
xiongxinlei 506d26a957 change the code style to s2t code style, test=doc
3 years ago
xiongxinlei 7eb8fa72a1 convert save_freq to save_interval, test=doc
3 years ago
xiongxinlei 311fa87a11 add some comments to the code
3 years ago
xiongxinlei 8ed5c287a3 add vox2 data into VoxCeleb class
3 years ago
xiongxinlei 584a2c0e39 add ecapa-tdnn config yaml file
3 years ago
Hui Zhang 67fc073b01
Merge pull request #1550 from yt605155624/fix_ss_dump_bug
3 years ago
TianYuan 589f780850 fix synthesize bug for speedyspeech, test=tts
3 years ago
xiongxinlei 993d6783d7 remove unused code, test=doc
3 years ago
xiongxinlei 0e87037f2c refactor to compilance paddleaudio
3 years ago
xiongxinlei 4473405f82 merge develop to vox12, test=doc
3 years ago
Honei 0dee8f40e9
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
xiongxinlei 60d73bb7bd add state 0 to prepare the voxcele data and augment data
3 years ago
TianYuan a151935eaf add ljspeech hifigan, test=tts
3 years ago
xiongxinlei 14efbf5b15 check extract embedding result, test=doc
3 years ago
Hui Zhang 67dcff2f3f
Merge pull request #1545 from yt605155624/add_aishell3_hifigan
3 years ago
TianYuan 81d964f0a0 add vctk hifigan, test=tts
3 years ago
TianYuan 1410a84054 add aishell3 hifigan, test=tts
3 years ago
xiongxinlei 7db7eb8993 add extract audio embedding api, test=doc
3 years ago
xiongxinlei 2d89c80e6f add waveform augment pipeline, test=doc
3 years ago
qingen ff47ab1779
Merge branch 'PaddlePaddle:develop' into cluster
3 years ago
xiongxinlei 016ed6d69c repair the code according to the part comment, test=doc
3 years ago
xiongxinlei 97ec01260b add speaker verification using cosine score, test=doc
3 years ago
xiongxinlei 1f74af110b add training log info and comment, test=doc
3 years ago
xiongxinlei 4648059b5f add training process for sid, test=doc
3 years ago
xiongxinlei 7668f61422 add sid dataloader for training, test=doc
3 years ago
xiongxinlei 6af2bc3d5b add sid loss wraper for voxceleb, test=doc
3 years ago
xiongxinlei 57c4f4a68c add sid learning rate and training model
3 years ago
xiongxinlei 3a943ca95b repair the variable name bug
3 years ago
xiongxinlei 0780d181d2 remove personal code test=doc
3 years ago
xiongxinlei 7ef60ebae2 add voxceleb1 data prepare
3 years ago
Honei 1395b5f5fa
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
TianYuan 175c39b4a4
Merge pull request #1511 from yt605155624/pre_fix_for_streaming
3 years ago
TianYuan 641984ae30 add code annotation, test=tts
3 years ago
TianYuan cb07bd2a94 add rtf for synthesize, add more vocoder for synthesize_e2e.sh, test=tts
3 years ago
qingen d01d7fedce [wip][vec] add clustering of vectors #1304
3 years ago
qingen c962eec51d [wip][vec] add clustering of vectors #1304
3 years ago