huangyuxin
|
3845804cc9
|
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into Setup
|
3 years ago |
TianYuan
|
96323816e9
|
fix yamls, change labels to stop_labels, test=tts
|
3 years ago |
TianYuan
|
1bf1a876ae
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_new_tacotron2, test=tts
|
3 years ago |
TianYuan
|
3fd7a7790b
|
add typehit for updater and evaluator, test=tts
|
3 years ago |
huangyuxin
|
4e31247633
|
refacto the code
|
3 years ago |
TianYuan
|
41d24337cb
|
fix fastspeech2 multi speaker to static, test=tts
|
3 years ago |
TianYuan
|
1a9e59612a
|
fix fastspeech2 multi speaker to static, test=tts
|
3 years ago |
huangyuxin
|
565a63c5ef
|
refactor the setup in paddleaudio
|
3 years ago |
huangyuxin
|
eb91ce84f9
|
refactor the version
|
3 years ago |
Hui Zhang
|
4a133619a1
|
Merge pull request #1356 from Jackwaterveg/CLI
[CLI] asr, Add Deepspeech2 online and offline model
|
3 years ago |
Hui Zhang
|
d4acf4704f
|
Merge pull request #1350 from LittleChenCc/develop
[ST] beam search with optimality guarantees
|
3 years ago |
huangyuxin
|
ab759b16de
|
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into CLI
|
3 years ago |
huangyuxin
|
38edfd1a89
|
Add Deepspeech2 online and offline in cli
|
3 years ago |
TianYuan
|
d368d57d67
|
fix low ips bug of speedyspeech and fastspeech2, test=tts (#1349)
|
3 years ago |
TianYuan
|
9c7f0762b0
|
update racotron2 and transformer tts, test=tts
|
3 years ago |
huangyuxin
|
8028f33b7f
|
synchronize the version
|
3 years ago |
Junkun
|
44408e5211
|
sync the variable name to others
|
3 years ago |
Junkun
|
f866059b74
|
config and formalize
|
3 years ago |
Junkun
|
43aad7a018
|
beam search with optimality guarantees
|
3 years ago |
Jackwaterveg
|
26524031d2
|
Merge pull request #1343 from Jackwaterveg/fix
[ASR] Fix some bugs
|
3 years ago |
huangyuxin
|
5e7e8a3e24
|
fix the u2 export, test=asr
|
3 years ago |
TianYuan
|
a1867c20c3
|
fix slice bug of speedyspeech expand, test=tts (#1337)
|
3 years ago |
Hui Zhang
|
ec1c88ae1a
|
[s2t] remove nltk (#1332)
|
3 years ago |
TianYuan
|
7ae4f7221e
|
Update length_regulator.py
|
3 years ago |
TianYuan
|
acfe2b9084
|
Update duration_predictor.py
|
3 years ago |
TianYuan
|
caa391f461
|
fix speedyspeech inference, test=tts (#1322)
|
3 years ago |
Jackwaterveg
|
0c4895cd0b
|
mv the ctcdecoders to third_part (#1313)
|
3 years ago |
TianYuan
|
8f507ba4ba
|
Merge pull request #1302 from jerryuhoo/develop
[TTS] Add support for finetuning speedyspeech
|
3 years ago |
Jerryuhoo
|
111a452378
|
Fix the code format, test=tts
|
3 years ago |
TianYuan
|
89e988a69e
|
add csmsc tacotron2, test=tts
|
3 years ago |
TianYuan
|
c088b9a304
|
add csmsc tacotron2
|
3 years ago |
huangyuxin
|
fe1dc9d211
|
refactor the cli/st, test=st
|
3 years ago |
TianYuan
|
27bb76bdb9
|
fix tone_sandhi of yi, test=tts
|
3 years ago |
Jerryuhoo
|
be99807d61
|
Add durations to gen_gta_mel.py inference
|
3 years ago |
KP
|
52a8b2f320
|
Add ECAPA_TDNN. (#1301)
|
3 years ago |
Jerryuhoo
|
fcc34e3e95
|
[tts] add gen_gta_mel.py for finetuning speedypeech, test=tts
|
3 years ago |
Jackwaterveg
|
010aa65b2b
|
[cli] asr - support English, decode_metod and unified config (#1297)
* fix config, test=asr
* fix config, test=doc_fix
* add en and decode_method for cli/asr, test=asr
* test=asr
* fix, test=doc_fix
|
3 years ago |
KP
|
c09466ebbe
|
Add ECAPA_TDNN. (#1295)
|
3 years ago |
TianYuan
|
fb238d83f4
|
update vctk voc1, test=tts (#1294)
|
3 years ago |
TianYuan
|
73dc0e2535
|
fix_ning
|
3 years ago |
billishyahao
|
ddf184be60
|
fix some typos
|
3 years ago |
TianYuan
|
318cc9e539
|
Merge branch 'develop' into develop
|
3 years ago |
Jackwaterveg
|
e69abc9265
|
Merge pull request #1273 from zh794390558/batch_sampler
[s2t] Fix Batch sampler set epoch
|
3 years ago |
KP
|
a810cd4e5c
|
Add cli logging. (#1274)
|
3 years ago |
Jerryuhoo
|
d6e9b76e76
|
change link_wav.py path, test=tts
|
3 years ago |
Jerryuhoo
|
c94f346207
|
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/ so that different vocoders can use one script.
|
3 years ago |
Jerryuhoo
|
e239ee1cd2
|
add multi-speaker support for finetuning hifigan vocoder
|
3 years ago |
huangyuxin
|
07d457859d
|
use pre-commit, test=doc_fix
|
3 years ago |
Hui Zhang
|
45832f6770
|
fix default dist_samlper to False
|
3 years ago |
Hui Zhang
|
3a2db414e6
|
format code
|
3 years ago |
Hui Zhang
|
6f651d762e
|
fix batch sampler set_epoch when epcoh start
|
3 years ago |
TianYuan
|
680eac02b9
|
[tts]Update mb melgan (#1272)
* update mb melgan
* update mb melgan, test=tts
|
3 years ago |
TianYuan
|
98ce69d0aa
|
Merge pull request #1259 from jerryuhoo/develop
[TTS]Add multi-speaker support for the SpeedySpeech model
|
3 years ago |
huangyuxin
|
ffadbe22a7
|
merge the develop, test=asr
|
3 years ago |
JiehangXie
|
bdc48114a4
|
Update text_normlization.py
|
3 years ago |
JiehangXie
|
d88ceef7bc
|
Fix punctuation bug
修复顿号和英文冒号停顿和分句的问题
|
3 years ago |
huangyuxin
|
8b63485ce3
|
fix some bug, test=asr
|
3 years ago |
JiehangXie
|
6065b1b607
|
Fix punctuation bug
修复顿号和英文冒号停顿和分句的问题
|
3 years ago |
Jerry
|
0719698118
|
Merge branch 'develop' into develop
|
3 years ago |
AdamBear
|
36c9eaa437
|
Cache the TextFeaturizer instance for infer speed improvement. (#1260)
|
3 years ago |
huangyuxin
|
3e2cc898cb
|
remove default cfg and fix some bugs,test=asr
|
3 years ago |
Jerryuhoo
|
2dccd5315d
|
remove useless "other" dataset
|
3 years ago |
Jerryuhoo
|
f191d0b022
|
change speaker embedding position
Change speaker embedding position into the encoder.
|
3 years ago |
Jerryuhoo
|
11991b6d35
|
add multi-speaker support for speedyspeech
|
3 years ago |
huangyuxin
|
a1d8ab0f99
|
merge the develop
|
3 years ago |
huangyuxin
|
c907a8deda
|
change all recipes
|
3 years ago |
TianYuan
|
b9a55262f1
|
Update fastspeech2.py
|
3 years ago |
Hui Zhang
|
c81a3f0f83
|
[s2t] DataLoader with BatchSampler or DistributeBatchSampler (#1242)
* batchsampler or distributebatchsampler
* format
|
3 years ago |
Junkun Chen
|
420709e5ce
|
[st] Distributed sampler and new dataloader with MIMO (#1239)
* update timit result, test=doc_fix
* result update
* fix bug
* add triplet loader
* empty preprocess file
* sync to u2, updating
* sync to u2 config
* fix bugs
* code refine
* update config
* customize decoding batch size
* update optimizer and lr scheduler
* minor
* minor
* minor
* fix bugs of refs
* minor
* distributed sampler
* minor
* refine the loader
|
3 years ago |
TianYuan
|
fbe3c05137
|
add style_melgan and hifigan in tts cli, test=tts (#1241)
|
3 years ago |
TianYuan
|
a232cd8b12
|
Update fastspeech2.py
|
3 years ago |
huangyuxin
|
41eeed0450
|
add librispeech asr1
|
3 years ago |
huangyuxin
|
fb6d1e2c11
|
merge the develop
|
3 years ago |
huangyuxin
|
2c5902d7c5
|
rename decoding to decode
|
3 years ago |
TianYuan
|
42c109216d
|
[tts]add style melgan pretraied model (#1228)
* add style melgan pretraied model
* add style melgan pretraied model, test=tts
Co-authored-by: Hui Zhang <zhtclz@foxmail.com>
|
3 years ago |
Hui Zhang
|
bb2a370b23
|
[asr] remove useless conf of librispeech (#1227)
* remve useless conf
* format code
* update conf
* update conf
* update conf
|
3 years ago |
huangyuxin
|
c40b6f4062
|
refactor the train and test config,test=asr
|
3 years ago |
TianYuan
|
5692b0ff04
|
fix log for t2s (#1219)
|
3 years ago |
TianYuan
|
b031ee43c4
|
Merge pull request #1215 from yt605155624/refactor_punc
[text]Refactor punc
|
3 years ago |
TianYuan
|
e1798e1eeb
|
update
|
3 years ago |
KP
|
d362d28d35
|
Remove logging file in cli api.
|
3 years ago |
TianYuan
|
15b8904fa1
|
refactor punc
|
3 years ago |
JiehangXie
|
927c9bbdb6
|
Fix a bug when sentence inputed contain English words
|
3 years ago |
KP
|
1632af7706
|
Update examples/esc50. (#1203)
|
3 years ago |
Jerryuhoo
|
3cbfd7bf35
|
Add speaker embedding and speaker id for style fastspeech2 inference
|
3 years ago |
Hui Zhang
|
db121226b8
|
clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch (#1191)
|
3 years ago |
Hui Zhang
|
d852aee2ff
|
[asr] logfbank with dither (#1179)
* fix logfbank dither
* format
|
3 years ago |
KP
|
9ec2bc8e2e
|
Update README. test=doc_fix
|
3 years ago |
Jackwaterveg
|
879857332d
|
[version]add paddlespeech.__version__ (#1166)
* add paddlespeech.__version__
* version 0.1.0 is ready
|
3 years ago |
TianYuan
|
19ef7210a0
|
[TTS]Add hifigan (#1097)
* add hifigan
* add hifigan
* integrate synthesize synthesize_e2e, inference for tts, test=tts
* add some python files, test=tts
* update readme, test=doc_fix
|
3 years ago |
TianYuan
|
675cff258b
|
[TTS]fix praatio version, test=tts (#1158)
* fix praatio version, test=tts
* fix praatio version, test=tts
|
3 years ago |
Jackwaterveg
|
e9748faa71
|
[Cli]optimize the cli, add --yes, and delete transformer_aishell (#1154)
* optimize the cli/asr,test=asr
* test=doc_fix
|
3 years ago |
Jackwaterveg
|
2bccde3def
|
update the version of ctcdecoders and feat,test=doc_fix (#1155)
|
3 years ago |
Jackwaterveg
|
0151f2463f
|
fix bug of pad_sequence in u2,test=asr (#1153)
|
3 years ago |
Jackwaterveg
|
68164dd39f
|
[asr]rename test_hub to test_wav (#1132)
* add the readme, librispeech_asr1
* fix the test_hub
* test=asr
|
3 years ago |
KP
|
16d6ed3842
|
Add automatic_video_subtitiles demo.
|
3 years ago |
KP
|
7394a18732
|
Add default arguments in cls python api.
|
3 years ago |
TianYuan
|
f9efbf3063
|
Update generate_lexicon.py
|
3 years ago |
Jackwaterveg
|
5b446f6321
|
[Config]clear the u2 decode config for asr (#1107)
* clear the u2 decode config
* rename the vocab_filepath and cmvn_path
|
3 years ago |
KP
|
074559fe90
|
[CLI][Demo][Text]Refactor punctuation_restoration. (#1013)
* Refactor punctuation_restoration.
* Add text cli and punc demo.
|
3 years ago |