TianYuan
|
9764535d3d
|
Update run.sh
|
3 years ago |
Jackwaterveg
|
89a5c4ec5b
|
Update run.sh
|
3 years ago |
huangyuxin
|
baccedee54
|
fix g2p, test=doc
|
3 years ago |
Hui Zhang
|
b4f621b9d5
|
add esc50 reference
|
3 years ago |
Jackwaterveg
|
2082b89d12
|
Update chunk_decode.yaml
|
3 years ago |
TianYuan
|
96323816e9
|
fix yamls, change labels to stop_labels, test=tts
|
3 years ago |
TianYuan
|
1bf1a876ae
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_new_tacotron2, test=tts
|
3 years ago |
TianYuan
|
41d24337cb
|
fix fastspeech2 multi speaker to static, test=tts
|
3 years ago |
TianYuan
|
1a9e59612a
|
fix fastspeech2 multi speaker to static, test=tts
|
3 years ago |
Junkun
|
44408e5211
|
sync the variable name to others
|
3 years ago |
Junkun
|
f866059b74
|
config and formalize
|
3 years ago |
TianYuan
|
d50d195145
|
update frontend readme, test=doc
|
3 years ago |
TianYuan
|
8f507ba4ba
|
Merge pull request #1302 from jerryuhoo/develop
[TTS] Add support for finetuning speedyspeech
|
3 years ago |
TianYuan
|
89e988a69e
|
add csmsc tacotron2, test=tts
|
3 years ago |
TianYuan
|
c088b9a304
|
add csmsc tacotron2
|
3 years ago |
Jackwaterveg
|
e7189b216c
|
Update chunk_decode.yaml
|
3 years ago |
Jerryuhoo
|
75c2bd5faf
|
fix link_wav.py path, test=tts
|
3 years ago |
TianYuan
|
fb238d83f4
|
update vctk voc1, test=tts (#1294)
|
3 years ago |
Jackwaterveg
|
9c1e098693
|
[Asr][Config] fix config (#1293)
* fix config, test=asr
* fix config, test=doc_fix
|
3 years ago |
Jackwaterveg
|
494d6f8b6b
|
[ASR][Config]fix config (#1290)
* fix config, test=asr
* fix config, test=doc_fix
|
3 years ago |
billishyahao
|
ddf184be60
|
fix some typos
|
3 years ago |
Jerryuhoo
|
d6e9b76e76
|
change link_wav.py path, test=tts
|
3 years ago |
Jerryuhoo
|
ea8977555f
|
Simplify link_wav.py path
|
3 years ago |
Jerryuhoo
|
c94f346207
|
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/ so that different vocoders can use one script.
|
3 years ago |
Jerryuhoo
|
76f98c6f69
|
add --dataset and --rootdir to voc3 finetune script
add --dataset and --rootdir to voc3 finetune script, also remove --speaker-dict in voc5.
|
3 years ago |
Jerryuhoo
|
e239ee1cd2
|
add multi-speaker support for finetuning hifigan vocoder
|
3 years ago |
huangyuxin
|
07d457859d
|
use pre-commit, test=doc_fix
|
3 years ago |
TianYuan
|
680eac02b9
|
[tts]Update mb melgan (#1272)
* update mb melgan
* update mb melgan, test=tts
|
3 years ago |
Jackwaterveg
|
66a615555d
|
revise aishell_asr0 Result, test=doc_fix
|
3 years ago |
Hui Zhang
|
2316e5cb8a
|
Update README.md
|
3 years ago |
TianYuan
|
98ce69d0aa
|
Merge pull request #1259 from jerryuhoo/develop
[TTS]Add multi-speaker support for the SpeedySpeech model
|
3 years ago |
huangyuxin
|
455bf477a4
|
fix some bug, test=asr
|
3 years ago |
huangyuxin
|
ffadbe22a7
|
merge the develop, test=asr
|
3 years ago |
huangyuxin
|
d5f05edc2e
|
fix some bug, test=asr
|
3 years ago |
huangyuxin
|
8b63485ce3
|
fix some bug, test=asr
|
3 years ago |
Jerryuhoo
|
1323242e2d
|
Merge branch 'develop' of https://github.com/jerryuhoo/PaddleSpeech into develop
|
3 years ago |
Jerryuhoo
|
6327949790
|
add speaker dict path
|
3 years ago |
limingshu
|
50752f8bc4
|
first commit (#1261)
|
3 years ago |
huangyuxin
|
3e2cc898cb
|
remove default cfg and fix some bugs,test=asr
|
3 years ago |
Jerryuhoo
|
f191d0b022
|
change speaker embedding position
Change speaker embedding position into the encoder.
|
3 years ago |
Jerryuhoo
|
11991b6d35
|
add multi-speaker support for speedyspeech
|
3 years ago |
huangyuxin
|
a1d8ab0f99
|
merge the develop
|
3 years ago |
huangyuxin
|
c907a8deda
|
change all recipes
|
3 years ago |
TianYuan
|
326fcd520a
|
fix config, test=tts
|
3 years ago |
Junkun Chen
|
420709e5ce
|
[st] Distributed sampler and new dataloader with MIMO (#1239)
* update timit result, test=doc_fix
* result update
* fix bug
* add triplet loader
* empty preprocess file
* sync to u2, updating
* sync to u2 config
* fix bugs
* code refine
* update config
* customize decoding batch size
* update optimizer and lr scheduler
* minor
* minor
* minor
* fix bugs of refs
* minor
* distributed sampler
* minor
* refine the loader
|
3 years ago |
huangyuxin
|
41eeed0450
|
add librispeech asr1
|
3 years ago |
huangyuxin
|
fb6d1e2c11
|
merge the develop
|
3 years ago |
huangyuxin
|
960658f669
|
add the whole of aishell asr1
|
3 years ago |
TianYuan
|
42c109216d
|
[tts]add style melgan pretraied model (#1228)
* add style melgan pretraied model
* add style melgan pretraied model, test=tts
Co-authored-by: Hui Zhang <zhtclz@foxmail.com>
|
3 years ago |
Hui Zhang
|
bb2a370b23
|
[asr] remove useless conf of librispeech (#1227)
* remve useless conf
* format code
* update conf
* update conf
* update conf
|
3 years ago |
huangyuxin
|
c40b6f4062
|
refactor the train and test config,test=asr
|
3 years ago |
TianYuan
|
5692b0ff04
|
fix log for t2s (#1219)
|
3 years ago |
TianYuan
|
bef481e010
|
Update README.md
|
3 years ago |
TianYuan
|
b031ee43c4
|
Merge pull request #1215 from yt605155624/refactor_punc
[text]Refactor punc
|
3 years ago |
TianYuan
|
e1798e1eeb
|
update
|
3 years ago |
TianYuan
|
8587384f9d
|
update readme
|
3 years ago |
TianYuan
|
15b8904fa1
|
refactor punc
|
3 years ago |
KP
|
759e840d5d
|
[Doc]Updata RESULTS.md. test=doc_fix (#1205)
* Updata RESULTS.md. test=doc_fix
* Updata RESULTS.md. test=doc_fix
|
3 years ago |
KP
|
1632af7706
|
Update examples/esc50. (#1203)
|
3 years ago |
Hui Zhang
|
db121226b8
|
clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch (#1191)
|
3 years ago |
KP
|
00ddeb2159
|
Updata README of punc example. test=doc_fix
|
3 years ago |
TianYuan
|
42b2c013e2
|
Update README.md
|
3 years ago |
Hui Zhang
|
d852aee2ff
|
[asr] logfbank with dither (#1179)
* fix logfbank dither
* format
|
3 years ago |
TianYuan
|
9be59e9cef
|
update readme, test=doc_fix
|
3 years ago |
TianYuan
|
b71657f37e
|
update hifigan readme, test=doc_fix
|
3 years ago |
TianYuan
|
d607629e1a
|
update hifigan readme, test=doc_fix
|
3 years ago |
Jerryuhoo
|
4871c48924
|
Fix README.md typo
|
3 years ago |
TianYuan
|
19ef7210a0
|
[TTS]Add hifigan (#1097)
* add hifigan
* add hifigan
* integrate synthesize synthesize_e2e, inference for tts, test=tts
* add some python files, test=tts
* update readme, test=doc_fix
|
3 years ago |
TianYuan
|
675cff258b
|
[TTS]fix praatio version, test=tts (#1158)
* fix praatio version, test=tts
* fix praatio version, test=tts
|
3 years ago |
TianYuan
|
69138a2c85
|
update readme, test=doc_fix (#1156)
|
3 years ago |
Jackwaterveg
|
989a89f4a8
|
fix the test_wav,test=asr (#1148)
|
3 years ago |
Jackwaterveg
|
2c4177051b
|
test=asr (#1140)
|
3 years ago |
Junkun Chen
|
9d28f86dc1
|
update timit result, test=doc_fix (#1147)
|
3 years ago |
Jackwaterveg
|
9ff12d3ffc
|
st1,test=doc_fix (#1145)
|
3 years ago |
Jackwaterveg
|
6970ac726a
|
[README] st0, test=doc_fix (#1144)
* test=doc_fix
* test=doc_fix
* test=doc_fix
* test=doc_fix
|
3 years ago |
Jackwaterveg
|
bf54bd629f
|
[README]add for librispeech asr2 (#1141)
* test=doc_fix
* test=doc_fix
|
3 years ago |
Jackwaterveg
|
2ace03030a
|
fix the run.sh, test=doc_fix (#1139)
|
3 years ago |
Jackwaterveg
|
6b606fc602
|
[READEME] tiny asr1 (#1138)
* test=doc_fix
* test=doc_fix
|
3 years ago |
Jackwaterveg
|
96fa8889be
|
[README] tiny asr0 (#1137)
* test=doc_fix
* test=doc_fix
|
3 years ago |
Jackwaterveg
|
14d2cf9d74
|
[READEME] librispeech asr0 (#1136)
* test=doc_fix
* test=doc_fix
* test=doc_fix
* test=doc_fix
|
3 years ago |
Jackwaterveg
|
fba8186c1f
|
[README]aishell_asr0 (#1135)
* test=doc_fix
* test=doc_fix
* test=doc_fix
* test=doc_fix
|
3 years ago |
Jackwaterveg
|
68164dd39f
|
[asr]rename test_hub to test_wav (#1132)
* add the readme, librispeech_asr1
* fix the test_hub
* test=asr
|
3 years ago |
Hui Zhang
|
41704e1f90
|
Merge pull request #1130 from PaddlePaddle/Jackwaterveg-patch-3
Update README.md
|
3 years ago |
Jackwaterveg
|
34fd26bd39
|
[Readme] librispeech asr1 (#1129)
* add the readme, librispeech_asr1
* Create READMe.md
test=doc_fix
* Update READMe.md
test=doc_fix
* Rename READMe.md to README.md
test=doc_fix
* test=doc_fix
|
3 years ago |
Jackwaterveg
|
d5f999f9de
|
Update README.md
test=doc_fix
|
3 years ago |
KP
|
074559fe90
|
[CLI][Demo][Text]Refactor punctuation_restoration. (#1013)
* Refactor punctuation_restoration.
* Add text cli and punc demo.
|
3 years ago |
Hui Zhang
|
c4a79ccea4
|
[asr] update librispeech conformer result (#1116)
* update librispeech result
* change conf order
|
3 years ago |
TianYuan
|
84025c5ffe
|
Rename READEME.md to README.md
|
3 years ago |
Hui Zhang
|
b1c80c45e0
|
remove ctc grad norm type in config
|
3 years ago |
Hui Zhang
|
aa04e2652f
|
rm uesless comment
|
3 years ago |
TianYuan
|
963e906f56
|
Merge pull request #1068 from yt605155624/add_style_melgan
[TTS]add style_melgan
|
3 years ago |
TianYuan
|
797e08343c
|
Update README.md
|
3 years ago |
TianYuan
|
9b6482cc2a
|
Update README.md
|
3 years ago |
Jackwaterveg
|
2827f040ec
|
Merge pull request #1079 from zh794390558/rsl
[asr] update librispeech asr1 transformer result
|
3 years ago |
Hui Zhang
|
7992aa6623
|
update librispeech asr1 transformer result
|
3 years ago |
TianYuan
|
5d8446b17c
|
rm big sources in demos
|
3 years ago |
Hui Zhang
|
2bbc4db508
|
fix install
|
3 years ago |
TianYuan
|
075aeee7f0
|
add style_melgan readme, test=tts
|
3 years ago |
TianYuan
|
a0f74ef63f
|
add style_melgan readme, test=tts
|
3 years ago |
TianYuan
|
7bfafc8310
|
add style_melgan readme, test=tts
|
3 years ago |
TianYuan
|
a070524d37
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_style_melgan, test=tts
|
3 years ago |
TianYuan
|
dd36eafe34
|
add style_melgan
|
3 years ago |
Hui Zhang
|
581a545c69
|
Update RESULTS.md
fix table header
|
3 years ago |
Hui Zhang
|
27087de5e9
|
update librispeech asr1 transformer result
|
3 years ago |
Junkun
|
1f3357f2d2
|
minor
|
3 years ago |
Junkun
|
72a8c9337c
|
update data process
|
3 years ago |
Jackwaterveg
|
cfed8d0182
|
Merge pull request #1061 from LittleChenCc/develop
[Bug Fix] fix bugs in the data reader
|
3 years ago |
Hui Zhang
|
ecbe785e47
|
remove ctc grad norm option
|
3 years ago |
Hui Zhang
|
5d626aa6b4
|
fix tiny conf
|
3 years ago |
Junkun
|
f50a2ab4ca
|
fix bugs
|
3 years ago |
Hui Zhang
|
3e19978194
|
Merge pull request #1054 from zh794390558/visual
[asr] using visualdl , jsonlines read manifest
|
3 years ago |
Jerryuhoo
|
13411d8a26
|
fix readme typo
|
3 years ago |
Hui Zhang
|
39228864bb
|
format code
|
3 years ago |
Junkun
|
aea1e92a3d
|
update cmd.sh
|
3 years ago |
Junkun
|
3e5fc3dd54
|
Merge branch 'develop' of https://github.com/LittleChenCc/DeepSpeech into develop
|
3 years ago |
Junkun Chen
|
2301fed1b4
|
Merge branch 'PaddlePaddle:develop' into develop
|
3 years ago |
Junkun
|
f225b1d88e
|
minor updates
|
3 years ago |
TianYuan
|
2de7bc14b0
|
Update finetune.yaml
|
3 years ago |
TianYuan
|
507c3b52ea
|
Update default.yaml
|
3 years ago |
Junkun
|
351e4e8e87
|
training script
|
3 years ago |
Junkun
|
3c8e87344a
|
update run scripts
|
3 years ago |
Junkun
|
e867f3bb41
|
minor
|
3 years ago |
Junkun
|
48207c1410
|
process scripts and configs
|
3 years ago |
Junkun
|
8f3280af8e
|
fix data process
|
3 years ago |
Junkun
|
6a50211c80
|
data process for ted-en-zh st1
|
3 years ago |
huangyuxin
|
b48bc4e046
|
fix the run.sh
|
3 years ago |
huangyuxin
|
dcc2390323
|
merge the develop branch and do the revising
|
3 years ago |
huangyuxin
|
895a086fdd
|
rename the config.feat_size and the config.vocab.size to input_size and output_size
|
3 years ago |
Hui Zhang
|
a1f5db8d7f
|
Merge pull request #1037 from Jackwaterveg/dev
[run.sh] fix the audio_file location in run.sh
|
3 years ago |
TianYuan
|
022f1ce8e9
|
Merge pull request #1040 from yt605155624/fix_frontend
[TTS]update text frontend
|
3 years ago |
huangyuxin
|
b6a466ceea
|
upload the demo audio_file
|
3 years ago |
huangyuxin
|
ef27a0e18a
|
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into dev
|
3 years ago |
Hui Zhang
|
32afa23e50
|
Merge pull request #1041 from zh794390558/ctc
[asr] update librispeech asr1-2 result; add warpctc source link in ctc topic
|
3 years ago |
Hui Zhang
|
396db4a56a
|
update librispeech asr1-2 result; add warpctc source link in ctc topic
|
3 years ago |
TianYuan
|
dad1cbbcd6
|
update text frontend
|
3 years ago |
KP
|
6e1ac1cc15
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
33f0e7622c
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
dfdc19fb49
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
2c531d78ac
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
bdb3ce23ee
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
eb68b3d800
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
1189117784
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
huangyuxin
|
5047e8786c
|
merge the develop
|
3 years ago |
TianYuan
|
b6ade97b32
|
Update README.md
|
3 years ago |
TianYuan
|
47434c1ac6
|
Update README.md
|
3 years ago |
Hui Zhang
|
2bbfdbae91
|
Merge pull request #1015 from yt605155624/fs2_conformer
[TTS]fastspeech2 conformer
|
3 years ago |
TianYuan
|
f9bd802eb0
|
Update README.md
|
3 years ago |
TianYuan
|
469329221b
|
refactor encoder, rm old code
|
3 years ago |
TianYuan
|
6a76ee00aa
|
Update README.md
|
3 years ago |
TianYuan
|
27b9a411f0
|
Update README.md
|
3 years ago |
TianYuan
|
14413f7464
|
Update README.md
|
3 years ago |
TianYuan
|
38f44ff736
|
Update README.md
|
3 years ago |
TianYuan
|
13d38942ec
|
Update README.md
|
3 years ago |
Hui Zhang
|
deffc958cf
|
support kaldi static
|
3 years ago |
Hui Zhang
|
712de751cb
|
Merge pull request #1036 from zh794390558/nproc
[asr] nproc to ngpu
|
3 years ago |
Hui Zhang
|
fd15d0daf8
|
Merge pull request #1035 from zh794390558/dataset
[asr] dataset to root dir
|
3 years ago |
huangyuxin
|
45ac9e0520
|
delete the unsupport
|
3 years ago |
huangyuxin
|
357a6723e0
|
fix the audio_file location in run.sh
|
3 years ago |
Hui Zhang
|
fe83adfbcb
|
nproc to ngpu
|
3 years ago |
Hui Zhang
|
6151800d04
|
fix dataset dir in data.sh
|
3 years ago |
Hui Zhang
|
cc7096dd27
|
examples/dataset to dataset
|
3 years ago |
Jackwaterveg
|
4d46cc9357
|
Merge pull request #1034 from zh794390558/rsl
[asr] rename to result.md
|
3 years ago |
Hui Zhang
|
733b0ce29a
|
rename to result.md
|
3 years ago |
Hui Zhang
|
789471bfca
|
test wav for u2
|
3 years ago |
huangyuxin
|
50cf88b7f1
|
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into doc
|
3 years ago |
Jackwaterveg
|
563568a2b8
|
Merge pull request #1031 from yt605155624/fix_docs
[TTS]update ipynb, add eval loss
|
3 years ago |
TianYuan
|
7d3985bff9
|
update table
|
3 years ago |
TianYuan
|
f3fbce005e
|
update ipynb, add eval loss
|
3 years ago |
Hui Zhang
|
042bbe5ed5
|
update ds2 offline result
|
3 years ago |
TianYuan
|
bc0dd51149
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into HEAD
|
3 years ago |
Hui Zhang
|
b119cfe06d
|
fix preprocess of libri asr2
|
3 years ago |
huangyuxin
|
649fcc4c16
|
revise some programming mistakes
|
3 years ago |
huangyuxin
|
2274a07235
|
Merge branch 'develop' into doc
|
3 years ago |
Jackwaterveg
|
04cfcd96ca
|
Merge pull request #1023 from zh794390558/dict
[asr] put vocab into data/lang_char
|
3 years ago |
Jackwaterveg
|
88d4208430
|
Merge pull request #1022 from yt605155624/fix_tts_doc
[TTS]fix readme
|
3 years ago |
TianYuan
|
f5a3b21f45
|
fix readme
|
3 years ago |
Hui Zhang
|
cdeb5cf6b6
|
update librispeech transformer result
|
3 years ago |
Jackwaterveg
|
09931d2ccc
|
Merge pull request #1019 from zh794390558/feat
[bugfix] Kaldi Feature using dither in train
|
3 years ago |
huangyuxin
|
f765171111
|
add the readme for the run.sh in aishsll asr1
|
3 years ago |
Hui Zhang
|
4f54e36294
|
vocab into data/lang_char
|
3 years ago |
gongel
|
3a31547516
|
refactor: rename t1 to st1
|
3 years ago |
gongel
|
d4ee5916b1
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into ted_en_zh_t0
|
3 years ago |
gongel
|
7cef93a6f4
|
refactor: update
|
3 years ago |
huangyuxin
|
8aebfeac81
|
fix the prc-commit
|
3 years ago |
Hui Zhang
|
56480e1033
|
fix format
|
3 years ago |
TianYuan
|
4537e900ef
|
Update README.md
|
3 years ago |
Jackwaterveg
|
524658a04f
|
Merge pull request #1018 from yt605155624/fix_url
[TTS]fix urls
|
3 years ago |
TianYuan
|
2d808a3c64
|
fix urls
|
3 years ago |
Hui Zhang
|
6750770e54
|
Merge pull request #1012 from zh794390558/datapipe
[asr] independent dataloader
|
3 years ago |
gongel
|
5b5c73f9bb
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into ted_en_zh_t0
|
3 years ago |
TianYuan
|
bdd2fb8f93
|
add aishell3/vc1 readme, add csmsc/voc1 readme
|
3 years ago |
Hui Zhang
|
2f4f744071
|
rename asr egs
|
3 years ago |
Hui Zhang
|
2ba3f00bbd
|
Merge branch 'develop' into datapipe
|
3 years ago |
Hui Zhang
|
b57b865989
|
rename egs
|
3 years ago |
Hui Zhang
|
b944418d6f
|
new format data support ds2/st
|
3 years ago |
Hui Zhang
|
02c7ef3198
|
format data support multi output
|
3 years ago |
Hui Zhang
|
e79e00a6b2
|
pack model
|
3 years ago |
Hui Zhang
|
0defc658e1
|
update aishell/librispeech transformer result; wenetspeech pretrain conformer result
|
3 years ago |
TianYuan
|
4370c5cfa6
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into fs2_conformer
|
3 years ago |
Hui Zhang
|
a7858551b7
|
add utt2spk for all dataset
|
3 years ago |
Hui Zhang
|
638b96bf07
|
check if cmvn_file in config for u2
|
3 years ago |
TianYuan
|
ea81c772ce
|
Merge pull request #1010 from zh794390558/statis
[asr]disable export for u2
|
3 years ago |
Hui Zhang
|
a87ba13d93
|
disable export for u2
|
3 years ago |
Hui Zhang
|
c354e9154b
|
Merge pull request #1003 from yt605155624/fs2_ge2e
[TTS]add fastspeech2 voice cloning in aishell3
|
3 years ago |
TianYuan
|
133ee7db0b
|
rename num_speakers
|
3 years ago |
TianYuan
|
3d5e078c91
|
add conformer
|
3 years ago |
TianYuan
|
a97c7b5206
|
rename spembs
|
3 years ago |
gongel
|
9f42ec4bc2
|
feat: add ted_en_zh t1
|
3 years ago |
Hui Zhang
|
b9790d03f2
|
add wenetspeech egs
|
3 years ago |
Hui Zhang
|
171fa353ee
|
refactor libri s2 conf
|
3 years ago |
Hui Zhang
|
26258949ab
|
Merge pull request #995 from yt605155624/mbmelgan_fine
[TTS]add multi-band melgan finetune scripts
|
3 years ago |
TianYuan
|
8d025451de
|
add fastspeech2 voice cloning in aishell3
|
3 years ago |
TianYuan
|
c5c9f19091
|
rename to gen_gta_mel.py, remove stats compute when gen fintune data
|
3 years ago |
Zeyu Chen
|
4a28751df0
|
Formalize the terms in README
|
3 years ago |
Hui Zhang
|
3046a22719
|
aishell support utt2spk
|
3 years ago |
TianYuan
|
b9dc017011
|
Update synthesize_e2e.sh
|
3 years ago |
TianYuan
|
c4234b3ecd
|
Update synthesize.sh
|
3 years ago |
TianYuan
|
a6ac497f8e
|
add multi-band melgan finetune scripts
|
3 years ago |
TianYuan
|
39400e5ee8
|
Update synthesize.sh
|
3 years ago |
Hui Zhang
|
bc4e2e4ee2
|
Merge pull request #982 from Jackwaterveg/develop
Optimizer the hips while downloading the LM
|
3 years ago |
huangyuxin
|
754c0b560b
|
optimizer the hips of downloading LM
|
3 years ago |
TianYuan
|
30d09b411d
|
fix style_syn, replace DeepSpeech with PaddleSpeech in readme
|
3 years ago |
Mingxue-Xu
|
f26db2e762
|
Update README.md
|
3 years ago |
Mingxue-Xu
|
6641b97d44
|
Update README.md
|
3 years ago |
TianYuan
|
0bc9450c51
|
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into fix_docs
|
3 years ago |
Hui Zhang
|
81598e6ff0
|
default gpu 0 for scripts
|
3 years ago |
Junkun
|
7c8843448c
|
add word reward into beam search.
|
3 years ago |
Jackwaterveg
|
67551c6557
|
Add notes in example/aishell/s0/run.sh
|
3 years ago |
Hui Zhang
|
9a71c091c5
|
remove debug info and format code
|
3 years ago |
Hui Zhang
|
8b0e344c69
|
fix logfbank using PCM16
|
3 years ago |
Hui Zhang
|
d62092ac28
|
fix specaug param
|
3 years ago |
TianYuan
|
2931903add
|
Rename READEME.md to README.md
|
3 years ago |
huangyuxin
|
61ad2c87a7
|
update the ds2 online conf
|
3 years ago |
Hui Zhang
|
7b3a901b08
|
more conf with preprocess.yaml
|
3 years ago |
Hui Zhang
|
44743622d4
|
filter example; cmvn stride and window int; libri/s1 conf
|
3 years ago |
Hui Zhang
|
56d06f2aaf
|
Merge pull request #968 from yt605155624/merge_paddlespeech
[TTS] change nprocs to ngpu
|
3 years ago |
Hui Zhang
|
6a7e0265cd
|
add josn global cmvn
|
3 years ago |
TianYuan
|
bacdf5756b
|
Merge remote-tracking branch 'origin/develop' into merge_paddlespeech
|
3 years ago |
Hui Zhang
|
69055698a2
|
transformer using batch data loader
|
3 years ago |
TianYuan
|
35c37ace17
|
change nprocs to ngpu, add aishell3/voc1
|
3 years ago |
huangyuxin
|
d647cde870
|
change the lm dataset dir
|
3 years ago |
Hui Zhang
|
3f3442b98a
|
remove useless third lib
|
3 years ago |
Hui Zhang
|
aba37810ff
|
update BZNSYP.rar link
|
3 years ago |
Hui Zhang
|
e2bcaee4f1
|
merge deepspeech, parakeet and text_processing into paddlespeech
|
3 years ago |
Jackwaterveg
|
782b0ddceb
|
Merge pull request #957 from PaddlePaddle/ds2_offline
revert ds2 offline rnn bw_cell to fw_cell which loss can be 5.73
|
3 years ago |
Hui Zhang
|
2fa681237f
|
Merge pull request #955 from Jackwaterveg/fix
fix the run_test in test_export
|
3 years ago |
Hui Zhang
|
4ce4e7926e
|
revert ds2 offline rnn bw_cell to fw_cell which loss can be 5.73, but is not the birnn
|
3 years ago |
huangyuxin
|
b966bb8a31
|
fix the run_test in test_export
|
3 years ago |
Hui Zhang
|
980944dab1
|
Merge pull request #952 from Jackwaterveg/dev_transformerLM
Add the feature: caculating the perplexity of transformerLM
|
3 years ago |
Hui Zhang
|
04d84a87ae
|
Merge pull request #948 from yt605155624/fs2_tostatic
fix fastspeech2 to static
|
3 years ago |
Hui Zhang
|
1372a08813
|
Merge pull request #953 from Jackwaterveg/fix_bug
[Bug fix] fix the bug of 'dev/null' and the test_export
|
3 years ago |
TianYuan
|
b68c9c05c4
|
fix fs2 inference bug
|
3 years ago |
huangyuxin
|
d64f6e9ea5
|
Add the feature: caculating the perplexity of transformerLM
|
3 years ago |
Jackwaterveg
|
8741da5a68
|
Update README.md
|
3 years ago |
huangyuxin
|
542ee3f070
|
add the model description in 1xt2x doc
|
3 years ago |
huangyuxin
|
02083cdbd6
|
fix the bug of 'dev/null' and the test_export
|
3 years ago |
TianYuan
|
fc8a7a152e
|
Merge pull request #951 from yt605155624/add_mbmelgan
[TTS] add global init for multi band melgan
|
3 years ago |
TianYuan
|
d3d9f83594
|
add global init for multi band melgan to avoid large output in the begin
|
3 years ago |
TianYuan
|
79e7a4d44e
|
align ouput of dygraph and static graph
|
3 years ago |
Hui Zhang
|
28519c1f44
|
Merge pull request #949 from Jackwaterveg/develop
fix the bug of chooing dataloader, remove the log of downloads lm, ch…
|
3 years ago |
huangyuxin
|
e66da76db9
|
fix the bug of chooing dataloader, remove the log of downloads lm, change the epoch in tiny
|
3 years ago |
TianYuan
|
9125d71a81
|
fix pwg inference
|
3 years ago |
TianYuan
|
36d60a717e
|
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into add_mbmelgan
|
3 years ago |
TianYuan
|
88668513b1
|
fix mv writer to visualdl in train
|
3 years ago |
TianYuan
|
670a68ad95
|
fix textfrontend readme, fix imgs link
|
3 years ago |
TianYuan
|
950d17cbcf
|
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into add_mbmelgan
|
3 years ago |
TianYuan
|
41526ca1b8
|
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into fix_docs
|
3 years ago |
TianYuan
|
3f9e30c9b3
|
refactor docs
|
3 years ago |
TianYuan
|
304d71747a
|
Merge pull request #939 from Jackwaterveg/doc
fix the doc
|
3 years ago |
huangyuxin
|
cef36521f9
|
fix the doc
|
3 years ago |
Hui Zhang
|
0812a3df20
|
add more join ctc decode conf
|
3 years ago |
Hui Zhang
|
8370604084
|
Merge pull request #936 from PaddlePaddle/fix_lm
[asr] fix egs of transformer lm usage
|
3 years ago |
Hui Zhang
|
e4852e3bf9
|
Merge pull request #934 from yt605155624/fix_readme
[TTS]fix link in readme
|
3 years ago |
Hui Zhang
|
c89820e7b2
|
fix egs of transformer lm usage
|
3 years ago |
TianYuan
|
6dbcd7720d
|
add csmsc mb melgan example
|
3 years ago |
TianYuan
|
02055eb26a
|
fix link in readme
|
3 years ago |
Hui Zhang
|
b878027c9a
|
format code
|
3 years ago |
Hui Zhang
|
8cda812857
|
Merge branch 'develop' into join_ctc
|
3 years ago |
Hui Zhang
|
b7bdaf6f8f
|
add lm conf and load
|
3 years ago |
TianYuan
|
20226b4fdd
|
fix benchmark and chain, add parse_options in run.sh, move tacotron2_ge2e into voice_cloning
|
3 years ago |
Hui Zhang
|
8f869b4c1f
|
update gitignore
|
3 years ago |
Hui Zhang
|
a107b75bac
|
transform; librispeech/s2 data process ok
|
3 years ago |
TianYuan
|
2e9d9dc9a7
|
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into merge_parakeet
|
3 years ago |
TianYuan
|
3ce5dff460
|
refactor parakeet examples
|
3 years ago |
Hui Zhang
|
614a004c37
|
update librispeech/s2 result
|
3 years ago |
Hui Zhang
|
a37cfbfb96
|
add fbank/pitch conf
|
3 years ago |
Hui Zhang
|
7509dc4056
|
update path and flac
|
3 years ago |
Hui Zhang
|
871fc5b70d
|
more utils to support kaldi/espnet data preocess
|
3 years ago |
Hui Zhang
|
c5f6692191
|
update lirbi s2 result
|
3 years ago |
Hui Zhang
|
12f788dd0e
|
Merge branch 'develop' into join_ctc
|
3 years ago |
Hui Zhang
|
7cfb3334e3
|
Merge pull request #927 from PaddlePaddle/nn_ctc
[asr] not change ctc grad norm manually
|
3 years ago |
Hui Zhang
|
dfd80b3aa2
|
recog into decoders, format code
|
3 years ago |
Hui Zhang
|
a4e27da64b
|
decoder with ctc prefix score
|
3 years ago |
Hui Zhang
|
7d54ee4d1d
|
ctc_grad_norm_type by null
|
3 years ago |
Hui Zhang
|
30499a7654
|
not change ctc grad manual
|
3 years ago |
Hui Zhang
|
190f4cc4bc
|
update u2 result; fix test.sh
|
3 years ago |
huangyuxin
|
b1a90d4d7a
|
add hub for s1 in aishell and librispeech
|
3 years ago |
Hui Zhang
|
8539689b15
|
u2 kaldi wer4p0
|
3 years ago |
Hui Zhang
|
f55267f2b3
|
fix img link; rsl format;
|
3 years ago |
huangyuxin
|
bfda49bf40
|
fix the bug of benchmark after merge the parakeet, add the condition of using kaldi in aishll s1
|
3 years ago |
Hui Zhang
|
fa5531c03e
|
Merge pull request #908 from PaddlePaddle/speech
[TTS] merge parakeet repo into deepspeech
|
3 years ago |
Hui Zhang
|
b079577e08
|
merge parakeet repo into deepspeech
|
3 years ago |
Hui Zhang
|
50b2114b3b
|
fix error condition
|
3 years ago |
Hui Zhang
|
feaf71d468
|
u2 kaldi mutli process test with batchsize one
|
3 years ago |
Jackwaterveg
|
aaa87698c4
|
Merge pull request #906 from PaddlePaddle/rsl
update librispeech conformer result
|
3 years ago |
Hui Zhang
|
b34da366ee
|
update librispeech conformer result
|
3 years ago |
Hui Zhang
|
302afed42a
|
update librispeech conformer transformer config
|
3 years ago |
Jackwaterveg
|
20488c56bc
|
Merge pull request #885 from PaddlePaddle/exp
compute cmvn before build vocab
|
3 years ago |
Hui Zhang
|
8ebd4245d7
|
fix detoken for char
|
3 years ago |
Hui Zhang
|
b10af1688c
|
update librispeech transformer test w/o length filter of test clean
|
3 years ago |
Hui Zhang
|
13a4bee8be
|
using simple test for multi decode type, and gpu
|
3 years ago |
Junkun
|
75bb1c0444
|
update timit result
|
3 years ago |
Hui Zhang
|
eef8847a82
|
compute cmvn before build vocab
|
3 years ago |
Hui Zhang
|
f5ec6e34c6
|
disable __pycache__
|
3 years ago |
Hui Zhang
|
37563d975e
|
ds2 model_type more info
|
3 years ago |
Hui Zhang
|
81f89c53e6
|
Merge pull request #872 from Jackwaterveg/Hub
change test chain and g2p
|
3 years ago |
Hui Zhang
|
d05baeb6b0
|
update ted zh en
|
3 years ago |
huangyuxin
|
f5159ba6bc
|
g2p
|
3 years ago |
Hui Zhang
|
251d32a609
|
fix timit scripts; reader filtype case;
|
3 years ago |
Junkun
|
46df01151f
|
Merge branch 'develop' of https://github.com/LittleChenCc/DeepSpeech into develop
|
3 years ago |
Junkun
|
a0c94209e2
|
update the result of timit
|
3 years ago |
Hui Zhang
|
4745e15ece
|
tiny run w cpu
|
3 years ago |
Hui Zhang
|
3e37cef8e1
|
fix test.sh opts
|
3 years ago |
Hui Zhang
|
b7b1bda34f
|
test refactor collator
|
3 years ago |
Junkun
|
c32cb734a6
|
update the result of TED-EN-ZH
|
3 years ago |
huangyuxin
|
1a46125175
|
add bin for hub
|
3 years ago |
Jackwaterveg
|
4b225b7602
|
Merge pull request #858 from PaddlePaddle/ctc
ctc using nn.Dropout; ds2 libri vald batch_size / 4
|
3 years ago |
Hui Zhang
|
9abf03bb6b
|
fix libri s1 transformer config
|
3 years ago |
Hui Zhang
|
88a198972f
|
Merge pull request #851 from Jackwaterveg/release_model
Optimize the 1xt2x code
|
3 years ago |
huangyuxin
|
d9a9126496
|
fix the run.sh in g2p/zh
|
3 years ago |
huangyuxin
|
30b3e237e2
|
optimize the 1xt2x
|
3 years ago |
huangyuxin
|
285e0c9cad
|
merge the change
|
3 years ago |
Hui Zhang
|
8e16315ada
|
librispeech s1 support multi process decode and sclite
|
3 years ago |
Hui Zhang
|
20178e0e09
|
librispeech s1 support sclite and multi process decode
|
3 years ago |
Hui Zhang
|
f29caf8dee
|
refactor ds 1.x exp
|
3 years ago |
Hui Zhang
|
9abe33b4bd
|
add score_sclite
|
3 years ago |
Hui Zhang
|
c6e8a33b73
|
fix set_device; more utils; args.opts support multi same name
|
3 years ago |
huangyuxin
|
264bba760b
|
fix the bug: read space as unk
|
3 years ago |
Hui Zhang
|
913b2300c3
|
nprocs 0 for cpu, other for gpu
|
3 years ago |
Hui Zhang
|
80eb6b7f01
|
fix espnet kaldi libri s2 config
|
3 years ago |
Hui Zhang
|
45a75acee1
|
Delete nohup_test.out
|
3 years ago |
huangyuxin
|
7e96942c58
|
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into release_model
|
3 years ago |
huangyuxin
|
f0184352f5
|
change the code format to 2.x style
|
3 years ago |
Hui Zhang
|
b9beea5ab1
|
fix bench
|
3 years ago |
Hui Zhang
|
15d26cc4ad
|
update u2 transformer config
|
3 years ago |
Hui Zhang
|
b381f5b447
|
fix profiler optitons config
|
3 years ago |
Hui Zhang
|
0e91d26ae3
|
fix log; add report to trainer
|
3 years ago |
huangyuxin
|
4c7fefd4e3
|
add transformed v1.8 model
|
3 years ago |
Hui Zhang
|
cda6ca8323
|
add benchmark flags, and logic
|
3 years ago |
Hui Zhang
|
7907319288
|
fix profiler
|
3 years ago |
Hui Zhang
|
5fdda953b9
|
add op profiling
|
3 years ago |
Hui Zhang
|
ec76df6cbc
|
do not set seed since break model covergence, aishell s0 seed 10086 test ok
|
3 years ago |
Hui Zhang
|
256e9c1b9c
|
more doc for egs
|
3 years ago |
Hui Zhang
|
3843372958
|
u2 with chianer updater
|
3 years ago |
Hui Zhang
|
28a0a64153
|
fix train.sh
|
3 years ago |
Hui Zhang
|
890a28f9bf
|
add more ctc conf
|
3 years ago |
Hui Zhang
|
41ed7a184c
|
add ctc conf
|
3 years ago |
Hui Zhang
|
1a8c5278a1
|
export ctc grad norm config
|
3 years ago |
Hui Zhang
|
7e136d0893
|
support no_sync for backward; ds support accum grad
|
3 years ago |
Hui Zhang
|
184d30dd9c
|
relase librispeech audio max len to 30 second
|
3 years ago |
Hui Zhang
|
d028c8416d
|
fix recipe train and avg shell
|
3 years ago |
huangyuxin
|
04d9db199f
|
add blank_id parameter
|
3 years ago |
Hui Zhang
|
f54dc983b6
|
using bw rnn in ds2
|
3 years ago |
Hui Zhang
|
7181e427af
|
Merge pull request #786 from Jackwaterveg/ds2_online
[Static model test] Add the test process for export model
|
3 years ago |
Hui Zhang
|
341038b626
|
ds2 offline cer 6p4287
|
3 years ago |
huangyuxin
|
7ab022e1cc
|
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into ds2_online_export
|
3 years ago |
Hui Zhang
|
673cc4a081
|
seed all with log; and format
|
3 years ago |
huangyuxin
|
92617f0802
|
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into ds2_online_export
|
3 years ago |
Hui Zhang
|
d1db859657
|
fix dataloader pickle bugs
|
3 years ago |
huangyuxin
|
564b6b6824
|
fix conflict
|
3 years ago |
huangyuxin
|
40466ef669
|
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into ds2_online
|
3 years ago |
Hui Zhang
|
715e90a9df
|
fix librispeech s0 specaug
|
3 years ago |
huangyuxin
|
b3d27e4bbb
|
merge the develop
|
3 years ago |
huangyuxin
|
b585684bf4
|
add function: test export
|
3 years ago |
Hui Zhang
|
3d9aebfaa3
|
fix specaug; add data static
|
3 years ago |
Hui Zhang
|
b56f899b76
|
Merge pull request #782 from PaddlePaddle/espnet
refactor component for test, align
|
3 years ago |
huangyuxin
|
2d3b2aed05
|
add seed in argparse
|
3 years ago |
Hui Zhang
|
561d5cf085
|
refactor feature, dict and argument for new config format
|
3 years ago |
TianYuan
|
2c75c923b9
|
fix_mfa
|
3 years ago |
Hui Zhang
|
aab02997f9
|
fix specaug config
|
3 years ago |
Hui Zhang
|
50f10f37ae
|
support replace with mean by aug
|
3 years ago |
Hui Zhang
|
86d08f994b
|
Merge pull request #768 from PaddlePaddle/espnet
support kaldi data pipeline
|
3 years ago |
Hui Zhang
|
f0c33a3081
|
Merge pull request #769 from Jackwaterveg/ds2_online
Ds2 online
|
3 years ago |
Hui Zhang
|
c09b0e8940
|
fix specaug
|
3 years ago |
Hui Zhang
|
9dace62581
|
fix augmentation
|
3 years ago |
Jackwaterveg
|
5e8dc5c17f
|
update the deepspech_online.conf
set lr_decay from 0.9 to 0.91
|
3 years ago |
huangyuxin
|
08b68e4b8f
|
change the deepspeech2_online.yaml
|
3 years ago |
Hui Zhang
|
ab23eb5710
|
fix for kaldi
|
3 years ago |
Hui Zhang
|
f05f367cc5
|
Merge pull request #756 from PaddlePaddle/filter
test w/ all example & fix ctc api & add new io
|
3 years ago |
Hui Zhang
|
7d133368e5
|
fix bugs
|
3 years ago |
Hui Zhang
|
7b649af8d7
|
add batchfy
|
3 years ago |
Hui Zhang
|
ee605b49ec
|
Merge pull request #757 from PaddlePaddle/punc
add punc egs
|
3 years ago |
Hui Zhang
|
7c3880b718
|
add punc egs
|
3 years ago |
Hui Zhang
|
433c1dc41a
|
Merge pull request #755 from PaddlePaddle/ted
fix ted egs
|
3 years ago |
Hui Zhang
|
e00567bb84
|
more result
|
3 years ago |
Hui Zhang
|
417a8b7996
|
fix ted egs
|
3 years ago |
Hui Zhang
|
99dfe04515
|
test w/ all example
|
3 years ago |
huangyuxin
|
61fe292c47
|
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into ds2_online
|
3 years ago |
huangyuxin
|
718ae52e3f
|
add from_config function to ds2_oneline and ds2
|
3 years ago |
Hui Zhang
|
4c0ee8d354
|
fix conf and readme
|
3 years ago |