Commit Graph

884 Commits (83e10fadd0d88a55e60b3ab94a34e034d1873a1f)

Author SHA1 Message Date
xiongxinlei ad2caf2ccb add speaker verification demo and doc, test=doc
3 years ago
Honei 305bacdcf2
Merge branch 'develop' into vox12
3 years ago
TianYuan 8938483529
Merge pull request #1601 from yt605155624/add_ljspeech_hifigan
3 years ago
TianYuan 342b487383 update readme for ljspeech hifigan, test=tts
3 years ago
Hui Zhang 4051e7b762 fix compliance test bug, and format
3 years ago
xiongxinlei e2684e71f2 refactor the data prepare process
3 years ago
Jackwaterveg 5c1283289e
[Doc] Updata doc
3 years ago
Jackwaterveg c07d248afd
test=doc
3 years ago
Jackwaterveg 13ac21b705
Update RESULTS.md
3 years ago
xiongxinlei 5221c2797f add voxceleb dataset and trial info, test=doc
3 years ago
Jackwaterveg 5d3c760eae
Update RESULTS.md
3 years ago
Jackwaterveg fcc1762048
Merge pull request #1577 from Jackwaterveg/change_init
3 years ago
huangyuxin e1b581b622 fix some bug, test=asr
3 years ago
Hui Zhang b5315657ff
Merge pull request #1509 from qingen/cluster
3 years ago
TianYuan 490300f84f Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_ljspeech_hifigan
3 years ago
TianYuan c36039ce32 update readme for ljspeech hifigan, test=tts
3 years ago
huangyuxin 6da8465f14 add dist_sampler args, test=asr
3 years ago
TianYuan 6469568d2a update readme for vctk hifigan, test=tts
3 years ago
TianYuan 9497c93fb0 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_vctk_hifigan
3 years ago
TianYuan d9127601b6 update readme for vctk hifigan, test=tts
3 years ago
huangyuxin e991d82ae7 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init
3 years ago
xiongxinlei d28ccfa96b add vector cli component, test=doc
3 years ago
TianYuan 5ab2601759 update readme for aishell3 hifigan, test=tts
3 years ago
TianYuan c4035f8c43 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_aishell3_hifigan
3 years ago
TianYuan 8d9197817a add hifigan in syn of aishell3, test=tts
3 years ago
TianYuan 13242d015e
update run.sh, test=doc
3 years ago
huangyuxin ab16d8ce3c change default initializer to kaiming_uniform, test=asr
3 years ago
TianYuan 4c517fa8a6
update preprocess.sh in aishell3 vc0, test=doc
3 years ago
TianYuan bf587ba879
update synthesize_e2e.sh, test=tts
3 years ago
qingen 0f7ede11ef Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster
3 years ago
qingen d16ce21d47 [wip][vec] update cluster of diarization, test=doc #1304
3 years ago
xiongxinlei 506d26a957 change the code style to s2t code style, test=doc
3 years ago
xiongxinlei 7eb8fa72a1 convert save_freq to save_interval, test=doc
3 years ago
xiongxinlei 311fa87a11 add some comments to the code
3 years ago
xiongxinlei 8ed5c287a3 add vox2 data into VoxCeleb class
3 years ago
xiongxinlei 584a2c0e39 add ecapa-tdnn config yaml file
3 years ago
Hui Zhang 67fc073b01
Merge pull request #1550 from yt605155624/fix_ss_dump_bug
3 years ago
TianYuan 589f780850 fix synthesize bug for speedyspeech, test=tts
3 years ago
xiongxinlei 993d6783d7 remove unused code, test=doc
3 years ago
xiongxinlei 0e87037f2c refactor to compilance paddleaudio
3 years ago
xiongxinlei 4473405f82 merge develop to vox12, test=doc
3 years ago
Honei 0dee8f40e9
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
xiongxinlei 60d73bb7bd add state 0 to prepare the voxcele data and augment data
3 years ago
TianYuan a151935eaf add ljspeech hifigan, test=tts
3 years ago
xiongxinlei 14efbf5b15 check extract embedding result, test=doc
3 years ago
Hui Zhang 67dcff2f3f
Merge pull request #1545 from yt605155624/add_aishell3_hifigan
3 years ago
TianYuan 81d964f0a0 add vctk hifigan, test=tts
3 years ago
TianYuan 1410a84054 add aishell3 hifigan, test=tts
3 years ago
xiongxinlei 7db7eb8993 add extract audio embedding api, test=doc
3 years ago
xiongxinlei 2d89c80e6f add waveform augment pipeline, test=doc
3 years ago
qingen ff47ab1779
Merge branch 'PaddlePaddle:develop' into cluster
3 years ago
xiongxinlei 016ed6d69c repair the code according to the part comment, test=doc
3 years ago
xiongxinlei 97ec01260b add speaker verification using cosine score, test=doc
3 years ago
xiongxinlei 1f74af110b add training log info and comment, test=doc
3 years ago
xiongxinlei 4648059b5f add training process for sid, test=doc
3 years ago
xiongxinlei 7668f61422 add sid dataloader for training, test=doc
3 years ago
xiongxinlei 6af2bc3d5b add sid loss wraper for voxceleb, test=doc
3 years ago
xiongxinlei 57c4f4a68c add sid learning rate and training model
3 years ago
xiongxinlei 3a943ca95b repair the variable name bug
3 years ago
xiongxinlei 0780d181d2 remove personal code test=doc
3 years ago
xiongxinlei 7ef60ebae2 add voxceleb1 data prepare
3 years ago
Honei 1395b5f5fa
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
TianYuan 175c39b4a4
Merge pull request #1511 from yt605155624/pre_fix_for_streaming
3 years ago
TianYuan 641984ae30 add code annotation, test=tts
3 years ago
TianYuan cb07bd2a94 add rtf for synthesize, add more vocoder for synthesize_e2e.sh, test=tts
3 years ago
qingen d01d7fedce [wip][vec] add clustering of vectors #1304
3 years ago
qingen c962eec51d [wip][vec] add clustering of vectors #1304
3 years ago
TianYuan 66a8beb27f update text frontend, test=tts
3 years ago
xiongxinlei 35b7968ed1 remove invalid directory
3 years ago
xiongxinlei 16108de71e add voxceleb1 dataset prepare process
3 years ago
Hui Zhang 6b1fe70100 format code,test=doc
3 years ago
Junkun 97e2015242 update run sh
3 years ago
Junkun 1169ffa480 add config files
3 years ago
Junkun 70166c2026 mv json_to_manifest to utils
3 years ago
Junkun af2b20650e update mustc v1
3 years ago
Junkun 0165c450ad update script
3 years ago
TianYuan b5a7c2d080 update readme for aishell3_vc0, test=doc
3 years ago
TianYuan ebef10efcd
Update README.md
3 years ago
TianYuan a8cac30bd6
Create README.md
3 years ago
Hui Zhang d07fd5bd43 upadte data info
3 years ago
TianYuan b6fbacdd9b
Merge pull request #1436 from yt605155624/rename_tacotron2
3 years ago
TianYuan e7adc854d4
Update README.md
3 years ago
TianYuan 6a19de44ff
Update README.md
3 years ago
TianYuan 7dc1f2daa3 fix some librosa bugs, test=tts
3 years ago
Hui Zhang 8eb708e754
Merge pull request #1417 from Honei/develop
3 years ago
TianYuan 25347bb6a3 rename tacotron2, test=tts
3 years ago
xiongxinlei d7a09ff71c repair the annotation of make voxceleb trial script
3 years ago
TianYuan ea29275acd fix dead links, test=doc
3 years ago
TianYuan 270fe4fdfc
Merge pull request #1430 from kslz/develop
3 years ago
Hui Zhang dcfc32f1ec
Merge pull request #1379 from yt605155624/new_wavernn
3 years ago
TianYuan 0747600c95
[TTS]add ljspeech new tacotron2 (#1416)
3 years ago
TianYuan 348a1a33bf
update tacotron2 voice cloning in aishell3 with new tacotron2, test=tts (#1419)
3 years ago
lizi be2fc2cc11 Modify typesetting, test=doc
3 years ago
lizi 5e34cdbd6e Modify typesetting, test=doc
3 years ago
lizi 06e8bdf0d7 add Chinese doc for "FastSpeech2 with CSMSC", test=doc
3 years ago
Honei 940602adbe convert voxceleb trial to kaldi format trial
3 years ago
TianYuan 1b0c034134 update wavernn, test=tts
3 years ago
TianYuan 89e69ee10e
[TTS]fix tacotron2 dygraph to static (#1414)
3 years ago
TianYuan 001afee644 fix wavernn dygraph to static , test=tts
3 years ago
TianYuan 2844f388dc
[doc ]add tacotron2 readme (#1385)
3 years ago
TianYuan 2071774d81 add wavernn in synthesize_e2e, test=tts
3 years ago
TianYuan 4c3e57a23c align preprocess of wavernn, test=tts
3 years ago
qingen 7413c9e48a
Merge pull request #1335 from qingen/test-pr
3 years ago
qingen 9c2a23e15e [vector] add AMI data preparation scripts
3 years ago
TianYuan fb0acd40a2 add wavernn, test=tts
3 years ago
Jackwaterveg d7222c0453
[ASR] Support CTC decoder online (#821)
3 years ago
qingen 1899200cae [vector] add AMI data preparation scripts
3 years ago
qingen 9d32f62f48 [vector] add AMI data preparation scripts
3 years ago
TianYuan 49fd55dc16
Merge pull request #1366 from Jackwaterveg/fix
3 years ago
TianYuan 9764535d3d
Update run.sh
3 years ago
Jackwaterveg 89a5c4ec5b
Update run.sh
3 years ago
huangyuxin baccedee54 fix g2p, test=doc
3 years ago
Hui Zhang b4f621b9d5
add esc50 reference
3 years ago
Jackwaterveg 2082b89d12
Update chunk_decode.yaml
3 years ago
TianYuan 96323816e9 fix yamls, change labels to stop_labels, test=tts
3 years ago
TianYuan 1bf1a876ae Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_new_tacotron2, test=tts
3 years ago
TianYuan 41d24337cb fix fastspeech2 multi speaker to static, test=tts
3 years ago
TianYuan 1a9e59612a fix fastspeech2 multi speaker to static, test=tts
3 years ago
Junkun 44408e5211 sync the variable name to others
3 years ago
Junkun f866059b74 config and formalize
3 years ago
TianYuan d50d195145 update frontend readme, test=doc
3 years ago
TianYuan 8f507ba4ba
Merge pull request #1302 from jerryuhoo/develop
3 years ago
TianYuan 89e988a69e add csmsc tacotron2, test=tts
3 years ago
TianYuan c088b9a304 add csmsc tacotron2
3 years ago
Jackwaterveg e7189b216c
Update chunk_decode.yaml
3 years ago
Jerryuhoo 75c2bd5faf fix link_wav.py path, test=tts
3 years ago
TianYuan fb238d83f4
update vctk voc1, test=tts (#1294)
3 years ago
Jackwaterveg 9c1e098693
[Asr][Config] fix config (#1293)
3 years ago
Jackwaterveg 494d6f8b6b
[ASR][Config]fix config (#1290)
3 years ago
billishyahao ddf184be60 fix some typos
3 years ago
Jerryuhoo d6e9b76e76 change link_wav.py path, test=tts
3 years ago
Jerryuhoo ea8977555f Simplify link_wav.py path
3 years ago
Jerryuhoo c94f346207 move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/
3 years ago
Jerryuhoo 76f98c6f69 add --dataset and --rootdir to voc3 finetune script
3 years ago
Jerryuhoo e239ee1cd2 add multi-speaker support for finetuning hifigan vocoder
3 years ago
huangyuxin 07d457859d use pre-commit, test=doc_fix
3 years ago
TianYuan 680eac02b9
[tts]Update mb melgan (#1272)
3 years ago
Jackwaterveg 66a615555d
revise aishell_asr0 Result, test=doc_fix
3 years ago
Hui Zhang 2316e5cb8a
Update README.md
3 years ago
TianYuan 98ce69d0aa
Merge pull request #1259 from jerryuhoo/develop
3 years ago
huangyuxin 455bf477a4 fix some bug, test=asr
3 years ago
huangyuxin ffadbe22a7 merge the develop, test=asr
3 years ago
huangyuxin d5f05edc2e fix some bug, test=asr
3 years ago
huangyuxin 8b63485ce3 fix some bug, test=asr
3 years ago
Jerryuhoo 1323242e2d Merge branch 'develop' of https://github.com/jerryuhoo/PaddleSpeech into develop
3 years ago
Jerryuhoo 6327949790 add speaker dict path
3 years ago
limingshu 50752f8bc4
first commit (#1261)
3 years ago
huangyuxin 3e2cc898cb remove default cfg and fix some bugs,test=asr
3 years ago
Jerryuhoo f191d0b022 change speaker embedding position
3 years ago
Jerryuhoo 11991b6d35 add multi-speaker support for speedyspeech
3 years ago
huangyuxin a1d8ab0f99 merge the develop
3 years ago
huangyuxin c907a8deda change all recipes
3 years ago
TianYuan 326fcd520a fix config, test=tts
3 years ago
Junkun Chen 420709e5ce
[st] Distributed sampler and new dataloader with MIMO (#1239)
3 years ago
huangyuxin 41eeed0450 add librispeech asr1
3 years ago
huangyuxin fb6d1e2c11 merge the develop
3 years ago
huangyuxin 960658f669 add the whole of aishell asr1
3 years ago
TianYuan 42c109216d
[tts]add style melgan pretraied model (#1228)
3 years ago
Hui Zhang bb2a370b23
[asr] remove useless conf of librispeech (#1227)
3 years ago
huangyuxin c40b6f4062 refactor the train and test config,test=asr
3 years ago
TianYuan 5692b0ff04
fix log for t2s (#1219)
3 years ago
TianYuan bef481e010
Update README.md
3 years ago
TianYuan b031ee43c4
Merge pull request #1215 from yt605155624/refactor_punc
3 years ago
TianYuan e1798e1eeb update
3 years ago
TianYuan 8587384f9d update readme
3 years ago
TianYuan 15b8904fa1 refactor punc
3 years ago
KP 759e840d5d
[Doc]Updata RESULTS.md. test=doc_fix (#1205)
3 years ago
KP 1632af7706
Update examples/esc50. (#1203)
3 years ago
Hui Zhang db121226b8
clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch (#1191)
3 years ago
KP 00ddeb2159 Updata README of punc example. test=doc_fix
3 years ago
TianYuan 42b2c013e2
Update README.md
3 years ago
Hui Zhang d852aee2ff
[asr] logfbank with dither (#1179)
3 years ago
TianYuan 9be59e9cef update readme, test=doc_fix
3 years ago
TianYuan b71657f37e update hifigan readme, test=doc_fix
3 years ago
TianYuan d607629e1a update hifigan readme, test=doc_fix
3 years ago
Jerryuhoo 4871c48924 Fix README.md typo
3 years ago
TianYuan 19ef7210a0
[TTS]Add hifigan (#1097)
3 years ago
TianYuan 675cff258b
[TTS]fix praatio version, test=tts (#1158)
3 years ago
TianYuan 69138a2c85
update readme, test=doc_fix (#1156)
3 years ago
Jackwaterveg 989a89f4a8
fix the test_wav,test=asr (#1148)
3 years ago
Jackwaterveg 2c4177051b
test=asr (#1140)
3 years ago
Junkun Chen 9d28f86dc1
update timit result, test=doc_fix (#1147)
3 years ago
Jackwaterveg 9ff12d3ffc
st1,test=doc_fix (#1145)
3 years ago
Jackwaterveg 6970ac726a
[README] st0, test=doc_fix (#1144)
3 years ago
Jackwaterveg bf54bd629f
[README]add for librispeech asr2 (#1141)
3 years ago
Jackwaterveg 2ace03030a
fix the run.sh, test=doc_fix (#1139)
3 years ago
Jackwaterveg 6b606fc602
[READEME] tiny asr1 (#1138)
3 years ago
Jackwaterveg 96fa8889be
[README] tiny asr0 (#1137)
3 years ago
Jackwaterveg 14d2cf9d74
[READEME] librispeech asr0 (#1136)
3 years ago
Jackwaterveg fba8186c1f
[README]aishell_asr0 (#1135)
3 years ago
Jackwaterveg 68164dd39f
[asr]rename test_hub to test_wav (#1132)
3 years ago
Hui Zhang 41704e1f90
Merge pull request #1130 from PaddlePaddle/Jackwaterveg-patch-3
3 years ago
Jackwaterveg 34fd26bd39
[Readme] librispeech asr1 (#1129)
3 years ago
Jackwaterveg d5f999f9de
Update README.md
3 years ago
KP 074559fe90
[CLI][Demo][Text]Refactor punctuation_restoration. (#1013)
3 years ago
Hui Zhang c4a79ccea4
[asr] update librispeech conformer result (#1116)
3 years ago
TianYuan 84025c5ffe
Rename READEME.md to README.md
3 years ago
Hui Zhang b1c80c45e0 remove ctc grad norm type in config
3 years ago
Hui Zhang aa04e2652f rm uesless comment
3 years ago
TianYuan 963e906f56
Merge pull request #1068 from yt605155624/add_style_melgan
3 years ago
TianYuan 797e08343c
Update README.md
3 years ago
TianYuan 9b6482cc2a
Update README.md
3 years ago
Jackwaterveg 2827f040ec
Merge pull request #1079 from zh794390558/rsl
3 years ago
Hui Zhang 7992aa6623 update librispeech asr1 transformer result
3 years ago
TianYuan 5d8446b17c rm big sources in demos
3 years ago
Hui Zhang 2bbc4db508 fix install
3 years ago
TianYuan 075aeee7f0 add style_melgan readme, test=tts
3 years ago
TianYuan a0f74ef63f add style_melgan readme, test=tts
3 years ago
TianYuan 7bfafc8310 add style_melgan readme, test=tts
3 years ago
TianYuan a070524d37 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_style_melgan, test=tts
3 years ago
TianYuan dd36eafe34 add style_melgan
3 years ago
Hui Zhang 581a545c69
Update RESULTS.md
3 years ago
Hui Zhang 27087de5e9 update librispeech asr1 transformer result
3 years ago
Junkun 1f3357f2d2 minor
3 years ago
Junkun 72a8c9337c update data process
3 years ago
Jackwaterveg cfed8d0182
Merge pull request #1061 from LittleChenCc/develop
3 years ago
Hui Zhang ecbe785e47 remove ctc grad norm option
3 years ago
Hui Zhang 5d626aa6b4 fix tiny conf
3 years ago
Junkun f50a2ab4ca fix bugs
3 years ago
Hui Zhang 3e19978194
Merge pull request #1054 from zh794390558/visual
3 years ago
Jerryuhoo 13411d8a26 fix readme typo
3 years ago
Hui Zhang 39228864bb format code
3 years ago
Junkun aea1e92a3d update cmd.sh
3 years ago
Junkun 3e5fc3dd54 Merge branch 'develop' of https://github.com/LittleChenCc/DeepSpeech into develop
3 years ago
Junkun Chen 2301fed1b4
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
Junkun f225b1d88e minor updates
3 years ago
TianYuan 2de7bc14b0
Update finetune.yaml
3 years ago
TianYuan 507c3b52ea
Update default.yaml
3 years ago
Junkun 351e4e8e87 training script
3 years ago
Junkun 3c8e87344a update run scripts
3 years ago
Junkun e867f3bb41 minor
3 years ago
Junkun 48207c1410 process scripts and configs
3 years ago
Junkun 8f3280af8e fix data process
3 years ago
Junkun 6a50211c80 data process for ted-en-zh st1
3 years ago
huangyuxin b48bc4e046 fix the run.sh
3 years ago
huangyuxin dcc2390323 merge the develop branch and do the revising
3 years ago
huangyuxin 895a086fdd rename the config.feat_size and the config.vocab.size to input_size and output_size
3 years ago
Hui Zhang a1f5db8d7f
Merge pull request #1037 from Jackwaterveg/dev
3 years ago
TianYuan 022f1ce8e9
Merge pull request #1040 from yt605155624/fix_frontend
3 years ago
huangyuxin b6a466ceea upload the demo audio_file
3 years ago
huangyuxin ef27a0e18a Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into dev
3 years ago
Hui Zhang 32afa23e50
Merge pull request #1041 from zh794390558/ctc
3 years ago
Hui Zhang 396db4a56a update librispeech asr1-2 result; add warpctc source link in ctc topic
3 years ago
TianYuan dad1cbbcd6 update text frontend
3 years ago
KP 6e1ac1cc15 Add paddlespeech.cls and esc50 example.
3 years ago
KP 33f0e7622c Add paddlespeech.cls and esc50 example.
3 years ago
KP dfdc19fb49 Add paddlespeech.cls and esc50 example.
3 years ago
KP 2c531d78ac Add paddlespeech.cls and esc50 example.
3 years ago
KP bdb3ce23ee Add paddlespeech.cls and esc50 example.
3 years ago
KP eb68b3d800 Add paddlespeech.cls and esc50 example.
3 years ago