Commit Graph

878 Commits (de0f99150a653e8184df3dca02b139f5ff6198d4)

Author SHA1 Message Date
Jackwaterveg ee96fb40f0
test=doc
3 years ago
Jackwaterveg a22f29ba10
test=doc
3 years ago
Jackwaterveg ae1b22273f
[Doc] update readem for aishell/asr0, test=doc
3 years ago
xiongxinlei a8244dc5b0 update the note, test=doc
3 years ago
KP 80b1fb9839 Update RESULTS.md. test=doc
3 years ago
KP 34b77a9db1 Update RESULTS.md. test=doc
3 years ago
huangyuxin fd7a50d5a0 add new cer tools, test=asr
3 years ago
Hui Zhang 2b7ca6f261
Update RESULTS.md
3 years ago
Hui Zhang 7ca40ff008
Merge pull request #1668 from PaddlePaddle/Jackwaterveg-patch-1
3 years ago
Honei 89791d7aca
Merge pull request #1663 from Honei/model
3 years ago
Jackwaterveg 82cd7015d7
test=doc
3 years ago
Jackwaterveg 5bb36472e8
test=doc
3 years ago
Jackwaterveg eeae00cc04
test=doc
3 years ago
TianYuan d592f25279 add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
3 years ago
TianYuan 7aecb2c4bb add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
3 years ago
xiongxinlei d064c8196e update the speaker verification model, test=doc
3 years ago
KP 079ac5caa0
Update README.md
3 years ago
xiongxinlei 38e4e9c893 refactor voxceleb2 data download, test=doc
3 years ago
ccrrong 7a03f36548 code format, test=doc
3 years ago
ccrrong 378fe5909f add ami diarization pipeline, test=doc
3 years ago
xiongxinlei acebfad7b7 change the vector csv.spk_id to csv.label, test=doc
3 years ago
xiongxinlei 57c11dcab0 add some annotations, test=doc
3 years ago
xiongxinlei 30b5b3cb9e add vector csv dataset format, test=doc
3 years ago
xiongxinlei 5b05300e53 train process add new voxceleb and rirs dataset, test=doc
3 years ago
xiongxinlei 9944fec3d4 convert rirs noise to csv file
3 years ago
TianYuan 78219cef7b add cnndecoder pretrained model, test=doc
3 years ago
TianYuan 4d7cd0e063 add streaming synthesize, test=tts
3 years ago
xiongxinlei ec24a169ee convert jsonfile to csv file
3 years ago
TianYuan 005aa4066c Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into cnn_decoder
3 years ago
TianYuan 0fc79f474d add CNNDecoder, test=tts
3 years ago
TianYuan 318edec303
Merge pull request #1613 from yt605155624/restructure_expand
3 years ago
Hui Zhang 943d4ac1ee
Merge pull request #1612 from Jackwaterveg/update
3 years ago
huangyuxin ed490b66cb update spectrogram, test=asr
3 years ago
Hui Zhang 84d712d493 format code, test=doc
3 years ago
Honei d60856b1ed
Merge pull request #1614 from Honei/vox12
3 years ago
xiongxinlei ed7113f320 change the vector output to numpy.array
3 years ago
Jackwaterveg 5db7e6382a
test=doc
3 years ago
TianYuan e52fc08c58 update readme, test=doc
3 years ago
TianYuan bc5ae43d3a restructure expand in length_regulator.py for paddle2onnx, test=tts
3 years ago
huangyuxin 0ffe1f9114 replace kaidi_fbank with paddleaudio
3 years ago
Jackwaterveg 64e12e949a
Update RESULTS.md
3 years ago
Jackwaterveg 1e35007925
test=doc
3 years ago
xiongxinlei ef1bc5e815 vector cli output dim info, test=doc
3 years ago
xiongxinlei 1fdb36f757 add mode emb dim info, test=doc
3 years ago
xiongxinlei ad2caf2ccb add speaker verification demo and doc, test=doc
3 years ago
Honei 305bacdcf2
Merge branch 'develop' into vox12
3 years ago
TianYuan 8938483529
Merge pull request #1601 from yt605155624/add_ljspeech_hifigan
3 years ago
TianYuan 342b487383 update readme for ljspeech hifigan, test=tts
3 years ago
Hui Zhang 4051e7b762 fix compliance test bug, and format
3 years ago
xiongxinlei e2684e71f2 refactor the data prepare process
3 years ago
Jackwaterveg 5c1283289e
[Doc] Updata doc
3 years ago
Jackwaterveg c07d248afd
test=doc
3 years ago
Jackwaterveg 13ac21b705
Update RESULTS.md
3 years ago
xiongxinlei 5221c2797f add voxceleb dataset and trial info, test=doc
3 years ago
Jackwaterveg 5d3c760eae
Update RESULTS.md
3 years ago
Jackwaterveg fcc1762048
Merge pull request #1577 from Jackwaterveg/change_init
3 years ago
huangyuxin e1b581b622 fix some bug, test=asr
3 years ago
Hui Zhang b5315657ff
Merge pull request #1509 from qingen/cluster
3 years ago
TianYuan 490300f84f Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_ljspeech_hifigan
3 years ago
TianYuan c36039ce32 update readme for ljspeech hifigan, test=tts
3 years ago
huangyuxin 6da8465f14 add dist_sampler args, test=asr
3 years ago
TianYuan 6469568d2a update readme for vctk hifigan, test=tts
3 years ago
TianYuan 9497c93fb0 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_vctk_hifigan
3 years ago
TianYuan d9127601b6 update readme for vctk hifigan, test=tts
3 years ago
huangyuxin e991d82ae7 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init
3 years ago
xiongxinlei d28ccfa96b add vector cli component, test=doc
3 years ago
TianYuan 5ab2601759 update readme for aishell3 hifigan, test=tts
3 years ago
TianYuan c4035f8c43 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_aishell3_hifigan
3 years ago
TianYuan 8d9197817a add hifigan in syn of aishell3, test=tts
3 years ago
TianYuan 13242d015e
update run.sh, test=doc
3 years ago
huangyuxin ab16d8ce3c change default initializer to kaiming_uniform, test=asr
3 years ago
TianYuan 4c517fa8a6
update preprocess.sh in aishell3 vc0, test=doc
3 years ago
TianYuan bf587ba879
update synthesize_e2e.sh, test=tts
3 years ago
qingen 0f7ede11ef Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster
3 years ago
qingen d16ce21d47 [wip][vec] update cluster of diarization, test=doc #1304
3 years ago
xiongxinlei 506d26a957 change the code style to s2t code style, test=doc
3 years ago
xiongxinlei 7eb8fa72a1 convert save_freq to save_interval, test=doc
3 years ago
xiongxinlei 311fa87a11 add some comments to the code
3 years ago
xiongxinlei 8ed5c287a3 add vox2 data into VoxCeleb class
3 years ago
xiongxinlei 584a2c0e39 add ecapa-tdnn config yaml file
3 years ago
Hui Zhang 67fc073b01
Merge pull request #1550 from yt605155624/fix_ss_dump_bug
3 years ago
TianYuan 589f780850 fix synthesize bug for speedyspeech, test=tts
3 years ago
xiongxinlei 993d6783d7 remove unused code, test=doc
3 years ago
xiongxinlei 0e87037f2c refactor to compilance paddleaudio
3 years ago
xiongxinlei 4473405f82 merge develop to vox12, test=doc
3 years ago
Honei 0dee8f40e9
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
xiongxinlei 60d73bb7bd add state 0 to prepare the voxcele data and augment data
3 years ago
TianYuan a151935eaf add ljspeech hifigan, test=tts
3 years ago
xiongxinlei 14efbf5b15 check extract embedding result, test=doc
3 years ago
Hui Zhang 67dcff2f3f
Merge pull request #1545 from yt605155624/add_aishell3_hifigan
3 years ago
TianYuan 81d964f0a0 add vctk hifigan, test=tts
3 years ago
TianYuan 1410a84054 add aishell3 hifigan, test=tts
3 years ago
xiongxinlei 7db7eb8993 add extract audio embedding api, test=doc
3 years ago
xiongxinlei 2d89c80e6f add waveform augment pipeline, test=doc
3 years ago
qingen ff47ab1779
Merge branch 'PaddlePaddle:develop' into cluster
3 years ago
xiongxinlei 016ed6d69c repair the code according to the part comment, test=doc
3 years ago
xiongxinlei 97ec01260b add speaker verification using cosine score, test=doc
3 years ago
xiongxinlei 1f74af110b add training log info and comment, test=doc
3 years ago
xiongxinlei 4648059b5f add training process for sid, test=doc
3 years ago
xiongxinlei 7668f61422 add sid dataloader for training, test=doc
3 years ago
xiongxinlei 6af2bc3d5b add sid loss wraper for voxceleb, test=doc
3 years ago
xiongxinlei 57c4f4a68c add sid learning rate and training model
3 years ago
xiongxinlei 3a943ca95b repair the variable name bug
3 years ago
xiongxinlei 0780d181d2 remove personal code test=doc
3 years ago
xiongxinlei 7ef60ebae2 add voxceleb1 data prepare
3 years ago
Honei 1395b5f5fa
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
TianYuan 175c39b4a4
Merge pull request #1511 from yt605155624/pre_fix_for_streaming
3 years ago
TianYuan 641984ae30 add code annotation, test=tts
3 years ago
TianYuan cb07bd2a94 add rtf for synthesize, add more vocoder for synthesize_e2e.sh, test=tts
3 years ago
qingen d01d7fedce [wip][vec] add clustering of vectors #1304
3 years ago
qingen c962eec51d [wip][vec] add clustering of vectors #1304
3 years ago
TianYuan 66a8beb27f update text frontend, test=tts
3 years ago
xiongxinlei 35b7968ed1 remove invalid directory
3 years ago
xiongxinlei 16108de71e add voxceleb1 dataset prepare process
3 years ago
Hui Zhang 6b1fe70100 format code,test=doc
3 years ago
Junkun 97e2015242 update run sh
3 years ago
Junkun 1169ffa480 add config files
3 years ago
Junkun 70166c2026 mv json_to_manifest to utils
3 years ago
Junkun af2b20650e update mustc v1
3 years ago
Junkun 0165c450ad update script
3 years ago
TianYuan b5a7c2d080 update readme for aishell3_vc0, test=doc
3 years ago
TianYuan ebef10efcd
Update README.md
3 years ago
TianYuan a8cac30bd6
Create README.md
3 years ago
Hui Zhang d07fd5bd43 upadte data info
3 years ago
TianYuan b6fbacdd9b
Merge pull request #1436 from yt605155624/rename_tacotron2
3 years ago
TianYuan e7adc854d4
Update README.md
3 years ago
TianYuan 6a19de44ff
Update README.md
3 years ago
TianYuan 7dc1f2daa3 fix some librosa bugs, test=tts
3 years ago
Hui Zhang 8eb708e754
Merge pull request #1417 from Honei/develop
3 years ago
TianYuan 25347bb6a3 rename tacotron2, test=tts
3 years ago
xiongxinlei d7a09ff71c repair the annotation of make voxceleb trial script
3 years ago
TianYuan ea29275acd fix dead links, test=doc
3 years ago
TianYuan 270fe4fdfc
Merge pull request #1430 from kslz/develop
3 years ago
Hui Zhang dcfc32f1ec
Merge pull request #1379 from yt605155624/new_wavernn
3 years ago
TianYuan 0747600c95
[TTS]add ljspeech new tacotron2 (#1416)
3 years ago
TianYuan 348a1a33bf
update tacotron2 voice cloning in aishell3 with new tacotron2, test=tts (#1419)
3 years ago
lizi be2fc2cc11 Modify typesetting, test=doc
3 years ago
lizi 5e34cdbd6e Modify typesetting, test=doc
3 years ago
lizi 06e8bdf0d7 add Chinese doc for "FastSpeech2 with CSMSC", test=doc
3 years ago
Honei 940602adbe convert voxceleb trial to kaldi format trial
3 years ago
TianYuan 1b0c034134 update wavernn, test=tts
3 years ago
TianYuan 89e69ee10e
[TTS]fix tacotron2 dygraph to static (#1414)
3 years ago
TianYuan 001afee644 fix wavernn dygraph to static , test=tts
3 years ago
TianYuan 2844f388dc
[doc ]add tacotron2 readme (#1385)
3 years ago
TianYuan 2071774d81 add wavernn in synthesize_e2e, test=tts
3 years ago
TianYuan 4c3e57a23c align preprocess of wavernn, test=tts
3 years ago
qingen 7413c9e48a
Merge pull request #1335 from qingen/test-pr
3 years ago
qingen 9c2a23e15e [vector] add AMI data preparation scripts
3 years ago
TianYuan fb0acd40a2 add wavernn, test=tts
3 years ago
Jackwaterveg d7222c0453
[ASR] Support CTC decoder online (#821)
3 years ago
qingen 1899200cae [vector] add AMI data preparation scripts
3 years ago
qingen 9d32f62f48 [vector] add AMI data preparation scripts
3 years ago
TianYuan 49fd55dc16
Merge pull request #1366 from Jackwaterveg/fix
3 years ago
TianYuan 9764535d3d
Update run.sh
3 years ago
Jackwaterveg 89a5c4ec5b
Update run.sh
3 years ago
huangyuxin baccedee54 fix g2p, test=doc
3 years ago
Hui Zhang b4f621b9d5
add esc50 reference
3 years ago
Jackwaterveg 2082b89d12
Update chunk_decode.yaml
3 years ago
TianYuan 96323816e9 fix yamls, change labels to stop_labels, test=tts
3 years ago
TianYuan 1bf1a876ae Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_new_tacotron2, test=tts
3 years ago
TianYuan 41d24337cb fix fastspeech2 multi speaker to static, test=tts
3 years ago
TianYuan 1a9e59612a fix fastspeech2 multi speaker to static, test=tts
3 years ago
Junkun 44408e5211 sync the variable name to others
3 years ago
Junkun f866059b74 config and formalize
3 years ago
TianYuan d50d195145 update frontend readme, test=doc
3 years ago
TianYuan 8f507ba4ba
Merge pull request #1302 from jerryuhoo/develop
3 years ago
TianYuan 89e988a69e add csmsc tacotron2, test=tts
3 years ago
TianYuan c088b9a304 add csmsc tacotron2
3 years ago
Jackwaterveg e7189b216c
Update chunk_decode.yaml
3 years ago
Jerryuhoo 75c2bd5faf fix link_wav.py path, test=tts
3 years ago
TianYuan fb238d83f4
update vctk voc1, test=tts (#1294)
3 years ago
Jackwaterveg 9c1e098693
[Asr][Config] fix config (#1293)
3 years ago
Jackwaterveg 494d6f8b6b
[ASR][Config]fix config (#1290)
3 years ago
billishyahao ddf184be60 fix some typos
3 years ago
Jerryuhoo d6e9b76e76 change link_wav.py path, test=tts
3 years ago
Jerryuhoo ea8977555f Simplify link_wav.py path
3 years ago
Jerryuhoo c94f346207 move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/
3 years ago
Jerryuhoo 76f98c6f69 add --dataset and --rootdir to voc3 finetune script
3 years ago
Jerryuhoo e239ee1cd2 add multi-speaker support for finetuning hifigan vocoder
3 years ago
huangyuxin 07d457859d use pre-commit, test=doc_fix
3 years ago
TianYuan 680eac02b9
[tts]Update mb melgan (#1272)
3 years ago
Jackwaterveg 66a615555d
revise aishell_asr0 Result, test=doc_fix
3 years ago
Hui Zhang 2316e5cb8a
Update README.md
3 years ago
TianYuan 98ce69d0aa
Merge pull request #1259 from jerryuhoo/develop
3 years ago
huangyuxin 455bf477a4 fix some bug, test=asr
3 years ago
huangyuxin ffadbe22a7 merge the develop, test=asr
3 years ago
huangyuxin d5f05edc2e fix some bug, test=asr
3 years ago
huangyuxin 8b63485ce3 fix some bug, test=asr
3 years ago
Jerryuhoo 1323242e2d Merge branch 'develop' of https://github.com/jerryuhoo/PaddleSpeech into develop
3 years ago
Jerryuhoo 6327949790 add speaker dict path
3 years ago
limingshu 50752f8bc4
first commit (#1261)
3 years ago
huangyuxin 3e2cc898cb remove default cfg and fix some bugs,test=asr
3 years ago
Jerryuhoo f191d0b022 change speaker embedding position
3 years ago
Jerryuhoo 11991b6d35 add multi-speaker support for speedyspeech
3 years ago
huangyuxin a1d8ab0f99 merge the develop
3 years ago
huangyuxin c907a8deda change all recipes
3 years ago
TianYuan 326fcd520a fix config, test=tts
3 years ago
Junkun Chen 420709e5ce
[st] Distributed sampler and new dataloader with MIMO (#1239)
3 years ago
huangyuxin 41eeed0450 add librispeech asr1
3 years ago
huangyuxin fb6d1e2c11 merge the develop
3 years ago