Commit Graph

194 Commits (c94f346207c70fbb3c7eef2776e8d165a07eeba1)

Author SHA1 Message Date
Jerryuhoo c94f346207 move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/
3 years ago
Jerryuhoo e239ee1cd2 add multi-speaker support for finetuning hifigan vocoder
3 years ago
huangyuxin 07d457859d use pre-commit, test=doc_fix
3 years ago
TianYuan 680eac02b9
[tts]Update mb melgan (#1272)
3 years ago
TianYuan 98ce69d0aa
Merge pull request #1259 from jerryuhoo/develop
3 years ago
huangyuxin ffadbe22a7 merge the develop, test=asr
3 years ago
JiehangXie bdc48114a4
Update text_normlization.py
3 years ago
JiehangXie d88ceef7bc Fix punctuation bug
3 years ago
huangyuxin 8b63485ce3 fix some bug, test=asr
3 years ago
JiehangXie 6065b1b607 Fix punctuation bug
3 years ago
Jerry 0719698118
Merge branch 'develop' into develop
3 years ago
AdamBear 36c9eaa437
Cache the TextFeaturizer instance for infer speed improvement. (#1260)
3 years ago
huangyuxin 3e2cc898cb remove default cfg and fix some bugs,test=asr
3 years ago
Jerryuhoo 2dccd5315d remove useless "other" dataset
3 years ago
Jerryuhoo f191d0b022 change speaker embedding position
3 years ago
Jerryuhoo 11991b6d35 add multi-speaker support for speedyspeech
3 years ago
huangyuxin a1d8ab0f99 merge the develop
3 years ago
huangyuxin c907a8deda change all recipes
3 years ago
TianYuan b9a55262f1
Update fastspeech2.py
3 years ago
Hui Zhang c81a3f0f83
[s2t] DataLoader with BatchSampler or DistributeBatchSampler (#1242)
3 years ago
Junkun Chen 420709e5ce
[st] Distributed sampler and new dataloader with MIMO (#1239)
3 years ago
TianYuan fbe3c05137
add style_melgan and hifigan in tts cli, test=tts (#1241)
3 years ago
TianYuan a232cd8b12
Update fastspeech2.py
3 years ago
huangyuxin 41eeed0450 add librispeech asr1
3 years ago
huangyuxin fb6d1e2c11 merge the develop
3 years ago
huangyuxin 2c5902d7c5 rename decoding to decode
3 years ago
TianYuan 42c109216d
[tts]add style melgan pretraied model (#1228)
3 years ago
Hui Zhang bb2a370b23
[asr] remove useless conf of librispeech (#1227)
3 years ago
huangyuxin c40b6f4062 refactor the train and test config,test=asr
3 years ago
TianYuan 5692b0ff04
fix log for t2s (#1219)
3 years ago
TianYuan b031ee43c4
Merge pull request #1215 from yt605155624/refactor_punc
3 years ago
TianYuan e1798e1eeb update
3 years ago
KP d362d28d35 Remove logging file in cli api.
3 years ago
TianYuan 15b8904fa1 refactor punc
3 years ago
JiehangXie 927c9bbdb6 Fix a bug when sentence inputed contain English words
3 years ago
KP 1632af7706
Update examples/esc50. (#1203)
3 years ago
Jerryuhoo 3cbfd7bf35 Add speaker embedding and speaker id for style fastspeech2 inference
3 years ago
Hui Zhang db121226b8
clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch (#1191)
3 years ago
Hui Zhang d852aee2ff
[asr] logfbank with dither (#1179)
3 years ago
KP 9ec2bc8e2e Update README. test=doc_fix
3 years ago
Jackwaterveg 879857332d
[version]add paddlespeech.__version__ (#1166)
3 years ago
TianYuan 19ef7210a0
[TTS]Add hifigan (#1097)
3 years ago
TianYuan 675cff258b
[TTS]fix praatio version, test=tts (#1158)
3 years ago
Jackwaterveg e9748faa71
[Cli]optimize the cli, add --yes, and delete transformer_aishell (#1154)
3 years ago
Jackwaterveg 2bccde3def
update the version of ctcdecoders and feat,test=doc_fix (#1155)
3 years ago
Jackwaterveg 0151f2463f
fix bug of pad_sequence in u2,test=asr (#1153)
3 years ago
Jackwaterveg 68164dd39f
[asr]rename test_hub to test_wav (#1132)
3 years ago
KP 16d6ed3842 Add automatic_video_subtitiles demo.
3 years ago
KP 7394a18732 Add default arguments in cls python api.
3 years ago
TianYuan f9efbf3063
Update generate_lexicon.py
3 years ago