TianYuan
|
b9a55262f1
|
Update fastspeech2.py
|
3 years ago |
Hui Zhang
|
c81a3f0f83
|
[s2t] DataLoader with BatchSampler or DistributeBatchSampler (#1242)
* batchsampler or distributebatchsampler
* format
|
3 years ago |
Junkun Chen
|
420709e5ce
|
[st] Distributed sampler and new dataloader with MIMO (#1239)
* update timit result, test=doc_fix
* result update
* fix bug
* add triplet loader
* empty preprocess file
* sync to u2, updating
* sync to u2 config
* fix bugs
* code refine
* update config
* customize decoding batch size
* update optimizer and lr scheduler
* minor
* minor
* minor
* fix bugs of refs
* minor
* distributed sampler
* minor
* refine the loader
|
3 years ago |
TianYuan
|
fbe3c05137
|
add style_melgan and hifigan in tts cli, test=tts (#1241)
|
3 years ago |
TianYuan
|
a232cd8b12
|
Update fastspeech2.py
|
3 years ago |
TianYuan
|
42c109216d
|
[tts]add style melgan pretraied model (#1228)
* add style melgan pretraied model
* add style melgan pretraied model, test=tts
Co-authored-by: Hui Zhang <zhtclz@foxmail.com>
|
3 years ago |
Hui Zhang
|
bb2a370b23
|
[asr] remove useless conf of librispeech (#1227)
* remve useless conf
* format code
* update conf
* update conf
* update conf
|
3 years ago |
TianYuan
|
5692b0ff04
|
fix log for t2s (#1219)
|
3 years ago |
TianYuan
|
b031ee43c4
|
Merge pull request #1215 from yt605155624/refactor_punc
[text]Refactor punc
|
3 years ago |
TianYuan
|
e1798e1eeb
|
update
|
3 years ago |
KP
|
d362d28d35
|
Remove logging file in cli api.
|
3 years ago |
TianYuan
|
15b8904fa1
|
refactor punc
|
3 years ago |
JiehangXie
|
927c9bbdb6
|
Fix a bug when sentence inputed contain English words
|
3 years ago |
KP
|
1632af7706
|
Update examples/esc50. (#1203)
|
3 years ago |
Jerryuhoo
|
3cbfd7bf35
|
Add speaker embedding and speaker id for style fastspeech2 inference
|
3 years ago |
Hui Zhang
|
db121226b8
|
clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch (#1191)
|
3 years ago |
Hui Zhang
|
d852aee2ff
|
[asr] logfbank with dither (#1179)
* fix logfbank dither
* format
|
3 years ago |
KP
|
9ec2bc8e2e
|
Update README. test=doc_fix
|
3 years ago |
Jackwaterveg
|
879857332d
|
[version]add paddlespeech.__version__ (#1166)
* add paddlespeech.__version__
* version 0.1.0 is ready
|
3 years ago |
TianYuan
|
19ef7210a0
|
[TTS]Add hifigan (#1097)
* add hifigan
* add hifigan
* integrate synthesize synthesize_e2e, inference for tts, test=tts
* add some python files, test=tts
* update readme, test=doc_fix
|
3 years ago |
TianYuan
|
675cff258b
|
[TTS]fix praatio version, test=tts (#1158)
* fix praatio version, test=tts
* fix praatio version, test=tts
|
3 years ago |
Jackwaterveg
|
e9748faa71
|
[Cli]optimize the cli, add --yes, and delete transformer_aishell (#1154)
* optimize the cli/asr,test=asr
* test=doc_fix
|
3 years ago |
Jackwaterveg
|
2bccde3def
|
update the version of ctcdecoders and feat,test=doc_fix (#1155)
|
3 years ago |
Jackwaterveg
|
0151f2463f
|
fix bug of pad_sequence in u2,test=asr (#1153)
|
3 years ago |
Jackwaterveg
|
68164dd39f
|
[asr]rename test_hub to test_wav (#1132)
* add the readme, librispeech_asr1
* fix the test_hub
* test=asr
|
3 years ago |
KP
|
16d6ed3842
|
Add automatic_video_subtitiles demo.
|
3 years ago |
KP
|
7394a18732
|
Add default arguments in cls python api.
|
3 years ago |
TianYuan
|
f9efbf3063
|
Update generate_lexicon.py
|
3 years ago |
Jackwaterveg
|
5b446f6321
|
[Config]clear the u2 decode config for asr (#1107)
* clear the u2 decode config
* rename the vocab_filepath and cmvn_path
|
3 years ago |
KP
|
074559fe90
|
[CLI][Demo][Text]Refactor punctuation_restoration. (#1013)
* Refactor punctuation_restoration.
* Add text cli and punc demo.
|
3 years ago |
Hui Zhang
|
51d7a07c6d
|
format and fix pre-commit (#1120)
|
3 years ago |
TianYuan
|
5f0f76f249
|
add eval() for inference model (#1114)
|
3 years ago |
TianYuan
|
59e4a34071
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into tts_cli
|
3 years ago |
TianYuan
|
3de4130dfc
|
update am name
|
3 years ago |
TianYuan
|
9db1710ba7
|
add conformer demos (#1108)
|
3 years ago |
TianYuan
|
3fe75f833d
|
Merge pull request #1109 from yt605155624/tts_cli
[cli]update voc name
|
3 years ago |
TianYuan
|
ca12a83d5a
|
update voc name
|
3 years ago |
TianYuan
|
965a57ef0e
|
Update README.md
|
3 years ago |
Jackwaterveg
|
9e31a606d1
|
set default encoding utf8 for win (#1101)
Co-authored-by: KP <109694228@qq.com>
|
3 years ago |
Hui Zhang
|
764a5d4271
|
Merge branch 'develop' into ctc
|
3 years ago |
Hui Zhang
|
b1c80c45e0
|
remove ctc grad norm type in config
|
3 years ago |
huangyuxin
|
1d4002409f
|
separate the sox and soxbindings with the requirements
|
3 years ago |
TianYuan
|
df5fe035e5
|
Update README.md
|
3 years ago |
TianYuan
|
a6e0a69da8
|
Merge pull request #1095 from KPatr1ck/demo
[Demo]Add tts demo.
|
3 years ago |
TianYuan
|
963e906f56
|
Merge pull request #1068 from yt605155624/add_style_melgan
[TTS]add style_melgan
|
3 years ago |
KP
|
1909f2f620
|
Add tts demo.
|
3 years ago |
KP
|
3701fba0be
|
Update download logic and fix README typos.
|
3 years ago |
TianYuan
|
f701882b66
|
update add_style_melgan
|
3 years ago |
gongel
|
dc60aeb8c2
|
format
|
3 years ago |
gongel
|
31510d088c
|
refactor: rm kaldi_io
|
3 years ago |