Hui Zhang
|
581a545c69
|
Update RESULTS.md
fix table header
|
3 years ago |
Hui Zhang
|
27087de5e9
|
update librispeech asr1 transformer result
|
3 years ago |
Junkun
|
1f3357f2d2
|
minor
|
3 years ago |
Junkun
|
72a8c9337c
|
update data process
|
3 years ago |
Jackwaterveg
|
cfed8d0182
|
Merge pull request #1061 from LittleChenCc/develop
[Bug Fix] fix bugs in the data reader
|
3 years ago |
Hui Zhang
|
ecbe785e47
|
remove ctc grad norm option
|
3 years ago |
Hui Zhang
|
5d626aa6b4
|
fix tiny conf
|
3 years ago |
Junkun
|
f50a2ab4ca
|
fix bugs
|
3 years ago |
Hui Zhang
|
3e19978194
|
Merge pull request #1054 from zh794390558/visual
[asr] using visualdl , jsonlines read manifest
|
3 years ago |
Jerryuhoo
|
13411d8a26
|
fix readme typo
|
3 years ago |
Hui Zhang
|
39228864bb
|
format code
|
3 years ago |
Junkun
|
aea1e92a3d
|
update cmd.sh
|
3 years ago |
Junkun
|
3e5fc3dd54
|
Merge branch 'develop' of https://github.com/LittleChenCc/DeepSpeech into develop
|
3 years ago |
Junkun Chen
|
2301fed1b4
|
Merge branch 'PaddlePaddle:develop' into develop
|
3 years ago |
Junkun
|
f225b1d88e
|
minor updates
|
3 years ago |
TianYuan
|
2de7bc14b0
|
Update finetune.yaml
|
3 years ago |
TianYuan
|
507c3b52ea
|
Update default.yaml
|
3 years ago |
Junkun
|
351e4e8e87
|
training script
|
3 years ago |
Junkun
|
3c8e87344a
|
update run scripts
|
3 years ago |
Junkun
|
e867f3bb41
|
minor
|
3 years ago |
Junkun
|
48207c1410
|
process scripts and configs
|
3 years ago |
Junkun
|
8f3280af8e
|
fix data process
|
3 years ago |
Junkun
|
6a50211c80
|
data process for ted-en-zh st1
|
3 years ago |
huangyuxin
|
b48bc4e046
|
fix the run.sh
|
3 years ago |
huangyuxin
|
dcc2390323
|
merge the develop branch and do the revising
|
3 years ago |
huangyuxin
|
895a086fdd
|
rename the config.feat_size and the config.vocab.size to input_size and output_size
|
3 years ago |
Hui Zhang
|
a1f5db8d7f
|
Merge pull request #1037 from Jackwaterveg/dev
[run.sh] fix the audio_file location in run.sh
|
3 years ago |
TianYuan
|
022f1ce8e9
|
Merge pull request #1040 from yt605155624/fix_frontend
[TTS]update text frontend
|
3 years ago |
huangyuxin
|
b6a466ceea
|
upload the demo audio_file
|
3 years ago |
huangyuxin
|
ef27a0e18a
|
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into dev
|
3 years ago |
Hui Zhang
|
32afa23e50
|
Merge pull request #1041 from zh794390558/ctc
[asr] update librispeech asr1-2 result; add warpctc source link in ctc topic
|
3 years ago |
Hui Zhang
|
396db4a56a
|
update librispeech asr1-2 result; add warpctc source link in ctc topic
|
3 years ago |
TianYuan
|
dad1cbbcd6
|
update text frontend
|
3 years ago |
KP
|
6e1ac1cc15
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
33f0e7622c
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
dfdc19fb49
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
2c531d78ac
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
bdb3ce23ee
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
eb68b3d800
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
1189117784
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
huangyuxin
|
5047e8786c
|
merge the develop
|
3 years ago |
TianYuan
|
b6ade97b32
|
Update README.md
|
3 years ago |
TianYuan
|
47434c1ac6
|
Update README.md
|
3 years ago |
Hui Zhang
|
2bbfdbae91
|
Merge pull request #1015 from yt605155624/fs2_conformer
[TTS]fastspeech2 conformer
|
3 years ago |
TianYuan
|
f9bd802eb0
|
Update README.md
|
3 years ago |
TianYuan
|
469329221b
|
refactor encoder, rm old code
|
3 years ago |
TianYuan
|
6a76ee00aa
|
Update README.md
|
3 years ago |
TianYuan
|
27b9a411f0
|
Update README.md
|
3 years ago |
TianYuan
|
14413f7464
|
Update README.md
|
3 years ago |
TianYuan
|
38f44ff736
|
Update README.md
|
3 years ago |
TianYuan
|
13d38942ec
|
Update README.md
|
3 years ago |
Hui Zhang
|
deffc958cf
|
support kaldi static
|
3 years ago |
Hui Zhang
|
712de751cb
|
Merge pull request #1036 from zh794390558/nproc
[asr] nproc to ngpu
|
3 years ago |
Hui Zhang
|
fd15d0daf8
|
Merge pull request #1035 from zh794390558/dataset
[asr] dataset to root dir
|
3 years ago |
huangyuxin
|
45ac9e0520
|
delete the unsupport
|
3 years ago |
huangyuxin
|
357a6723e0
|
fix the audio_file location in run.sh
|
3 years ago |
Hui Zhang
|
fe83adfbcb
|
nproc to ngpu
|
3 years ago |
Hui Zhang
|
6151800d04
|
fix dataset dir in data.sh
|
3 years ago |
Hui Zhang
|
cc7096dd27
|
examples/dataset to dataset
|
3 years ago |
Jackwaterveg
|
4d46cc9357
|
Merge pull request #1034 from zh794390558/rsl
[asr] rename to result.md
|
3 years ago |
Hui Zhang
|
733b0ce29a
|
rename to result.md
|
3 years ago |
Hui Zhang
|
789471bfca
|
test wav for u2
|
3 years ago |
huangyuxin
|
50cf88b7f1
|
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into doc
|
3 years ago |
Jackwaterveg
|
563568a2b8
|
Merge pull request #1031 from yt605155624/fix_docs
[TTS]update ipynb, add eval loss
|
3 years ago |
TianYuan
|
7d3985bff9
|
update table
|
3 years ago |
TianYuan
|
f3fbce005e
|
update ipynb, add eval loss
|
3 years ago |
Hui Zhang
|
042bbe5ed5
|
update ds2 offline result
|
3 years ago |
TianYuan
|
bc0dd51149
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into HEAD
|
3 years ago |
Hui Zhang
|
b119cfe06d
|
fix preprocess of libri asr2
|
3 years ago |
huangyuxin
|
649fcc4c16
|
revise some programming mistakes
|
3 years ago |
huangyuxin
|
2274a07235
|
Merge branch 'develop' into doc
|
3 years ago |
Jackwaterveg
|
04cfcd96ca
|
Merge pull request #1023 from zh794390558/dict
[asr] put vocab into data/lang_char
|
3 years ago |
Jackwaterveg
|
88d4208430
|
Merge pull request #1022 from yt605155624/fix_tts_doc
[TTS]fix readme
|
3 years ago |
TianYuan
|
f5a3b21f45
|
fix readme
|
3 years ago |
Hui Zhang
|
cdeb5cf6b6
|
update librispeech transformer result
|
3 years ago |
Jackwaterveg
|
09931d2ccc
|
Merge pull request #1019 from zh794390558/feat
[bugfix] Kaldi Feature using dither in train
|
3 years ago |
huangyuxin
|
f765171111
|
add the readme for the run.sh in aishsll asr1
|
3 years ago |
Hui Zhang
|
4f54e36294
|
vocab into data/lang_char
|
3 years ago |
gongel
|
3a31547516
|
refactor: rename t1 to st1
|
3 years ago |
gongel
|
d4ee5916b1
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into ted_en_zh_t0
|
3 years ago |
gongel
|
7cef93a6f4
|
refactor: update
|
3 years ago |
huangyuxin
|
8aebfeac81
|
fix the prc-commit
|
3 years ago |
Hui Zhang
|
56480e1033
|
fix format
|
3 years ago |
TianYuan
|
4537e900ef
|
Update README.md
|
3 years ago |
Jackwaterveg
|
524658a04f
|
Merge pull request #1018 from yt605155624/fix_url
[TTS]fix urls
|
3 years ago |
TianYuan
|
2d808a3c64
|
fix urls
|
3 years ago |
Hui Zhang
|
6750770e54
|
Merge pull request #1012 from zh794390558/datapipe
[asr] independent dataloader
|
3 years ago |
gongel
|
5b5c73f9bb
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into ted_en_zh_t0
|
3 years ago |
TianYuan
|
bdd2fb8f93
|
add aishell3/vc1 readme, add csmsc/voc1 readme
|
3 years ago |
Hui Zhang
|
2f4f744071
|
rename asr egs
|
3 years ago |
Hui Zhang
|
2ba3f00bbd
|
Merge branch 'develop' into datapipe
|
3 years ago |
Hui Zhang
|
b57b865989
|
rename egs
|
3 years ago |
Hui Zhang
|
b944418d6f
|
new format data support ds2/st
|
3 years ago |
Hui Zhang
|
02c7ef3198
|
format data support multi output
|
3 years ago |
Hui Zhang
|
e79e00a6b2
|
pack model
|
3 years ago |
Hui Zhang
|
0defc658e1
|
update aishell/librispeech transformer result; wenetspeech pretrain conformer result
|
3 years ago |
TianYuan
|
4370c5cfa6
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into fs2_conformer
|
3 years ago |
Hui Zhang
|
a7858551b7
|
add utt2spk for all dataset
|
3 years ago |
Hui Zhang
|
638b96bf07
|
check if cmvn_file in config for u2
|
3 years ago |
TianYuan
|
ea81c772ce
|
Merge pull request #1010 from zh794390558/statis
[asr]disable export for u2
|
3 years ago |
Hui Zhang
|
a87ba13d93
|
disable export for u2
|
3 years ago |
Hui Zhang
|
c354e9154b
|
Merge pull request #1003 from yt605155624/fs2_ge2e
[TTS]add fastspeech2 voice cloning in aishell3
|
3 years ago |
TianYuan
|
133ee7db0b
|
rename num_speakers
|
3 years ago |
TianYuan
|
3d5e078c91
|
add conformer
|
3 years ago |
TianYuan
|
a97c7b5206
|
rename spembs
|
3 years ago |
gongel
|
9f42ec4bc2
|
feat: add ted_en_zh t1
|
3 years ago |
Hui Zhang
|
b9790d03f2
|
add wenetspeech egs
|
3 years ago |
Hui Zhang
|
171fa353ee
|
refactor libri s2 conf
|
3 years ago |
Hui Zhang
|
26258949ab
|
Merge pull request #995 from yt605155624/mbmelgan_fine
[TTS]add multi-band melgan finetune scripts
|
3 years ago |
TianYuan
|
8d025451de
|
add fastspeech2 voice cloning in aishell3
|
3 years ago |
TianYuan
|
c5c9f19091
|
rename to gen_gta_mel.py, remove stats compute when gen fintune data
|
3 years ago |
Zeyu Chen
|
4a28751df0
|
Formalize the terms in README
|
3 years ago |
Hui Zhang
|
3046a22719
|
aishell support utt2spk
|
3 years ago |
TianYuan
|
b9dc017011
|
Update synthesize_e2e.sh
|
3 years ago |
TianYuan
|
c4234b3ecd
|
Update synthesize.sh
|
3 years ago |
TianYuan
|
a6ac497f8e
|
add multi-band melgan finetune scripts
|
3 years ago |
TianYuan
|
39400e5ee8
|
Update synthesize.sh
|
3 years ago |
Hui Zhang
|
bc4e2e4ee2
|
Merge pull request #982 from Jackwaterveg/develop
Optimizer the hips while downloading the LM
|
3 years ago |
huangyuxin
|
754c0b560b
|
optimizer the hips of downloading LM
|
3 years ago |
TianYuan
|
30d09b411d
|
fix style_syn, replace DeepSpeech with PaddleSpeech in readme
|
3 years ago |
Mingxue-Xu
|
f26db2e762
|
Update README.md
|
3 years ago |
Mingxue-Xu
|
6641b97d44
|
Update README.md
|
3 years ago |
TianYuan
|
0bc9450c51
|
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into fix_docs
|
3 years ago |
Hui Zhang
|
81598e6ff0
|
default gpu 0 for scripts
|
3 years ago |
Junkun
|
7c8843448c
|
add word reward into beam search.
|
3 years ago |
Jackwaterveg
|
67551c6557
|
Add notes in example/aishell/s0/run.sh
|
3 years ago |
Hui Zhang
|
9a71c091c5
|
remove debug info and format code
|
3 years ago |
Hui Zhang
|
8b0e344c69
|
fix logfbank using PCM16
|
3 years ago |
Hui Zhang
|
d62092ac28
|
fix specaug param
|
3 years ago |
TianYuan
|
2931903add
|
Rename READEME.md to README.md
|
3 years ago |
huangyuxin
|
61ad2c87a7
|
update the ds2 online conf
|
3 years ago |
Hui Zhang
|
7b3a901b08
|
more conf with preprocess.yaml
|
3 years ago |
Hui Zhang
|
44743622d4
|
filter example; cmvn stride and window int; libri/s1 conf
|
3 years ago |
Hui Zhang
|
56d06f2aaf
|
Merge pull request #968 from yt605155624/merge_paddlespeech
[TTS] change nprocs to ngpu
|
3 years ago |
Hui Zhang
|
6a7e0265cd
|
add josn global cmvn
|
3 years ago |
TianYuan
|
bacdf5756b
|
Merge remote-tracking branch 'origin/develop' into merge_paddlespeech
|
3 years ago |
Hui Zhang
|
69055698a2
|
transformer using batch data loader
|
3 years ago |
TianYuan
|
35c37ace17
|
change nprocs to ngpu, add aishell3/voc1
|
3 years ago |
huangyuxin
|
d647cde870
|
change the lm dataset dir
|
3 years ago |
Hui Zhang
|
3f3442b98a
|
remove useless third lib
|
3 years ago |
Hui Zhang
|
aba37810ff
|
update BZNSYP.rar link
|
3 years ago |
Hui Zhang
|
e2bcaee4f1
|
merge deepspeech, parakeet and text_processing into paddlespeech
|
3 years ago |
Jackwaterveg
|
782b0ddceb
|
Merge pull request #957 from PaddlePaddle/ds2_offline
revert ds2 offline rnn bw_cell to fw_cell which loss can be 5.73
|
3 years ago |
Hui Zhang
|
2fa681237f
|
Merge pull request #955 from Jackwaterveg/fix
fix the run_test in test_export
|
3 years ago |
Hui Zhang
|
4ce4e7926e
|
revert ds2 offline rnn bw_cell to fw_cell which loss can be 5.73, but is not the birnn
|
3 years ago |
huangyuxin
|
b966bb8a31
|
fix the run_test in test_export
|
3 years ago |
Hui Zhang
|
980944dab1
|
Merge pull request #952 from Jackwaterveg/dev_transformerLM
Add the feature: caculating the perplexity of transformerLM
|
3 years ago |
Hui Zhang
|
04d84a87ae
|
Merge pull request #948 from yt605155624/fs2_tostatic
fix fastspeech2 to static
|
3 years ago |
Hui Zhang
|
1372a08813
|
Merge pull request #953 from Jackwaterveg/fix_bug
[Bug fix] fix the bug of 'dev/null' and the test_export
|
3 years ago |
TianYuan
|
b68c9c05c4
|
fix fs2 inference bug
|
3 years ago |
huangyuxin
|
d64f6e9ea5
|
Add the feature: caculating the perplexity of transformerLM
|
3 years ago |
Jackwaterveg
|
8741da5a68
|
Update README.md
|
3 years ago |
huangyuxin
|
542ee3f070
|
add the model description in 1xt2x doc
|
3 years ago |
huangyuxin
|
02083cdbd6
|
fix the bug of 'dev/null' and the test_export
|
3 years ago |
TianYuan
|
fc8a7a152e
|
Merge pull request #951 from yt605155624/add_mbmelgan
[TTS] add global init for multi band melgan
|
3 years ago |
TianYuan
|
d3d9f83594
|
add global init for multi band melgan to avoid large output in the begin
|
3 years ago |
TianYuan
|
79e7a4d44e
|
align ouput of dygraph and static graph
|
3 years ago |
Hui Zhang
|
28519c1f44
|
Merge pull request #949 from Jackwaterveg/develop
fix the bug of chooing dataloader, remove the log of downloads lm, ch…
|
3 years ago |
huangyuxin
|
e66da76db9
|
fix the bug of chooing dataloader, remove the log of downloads lm, change the epoch in tiny
|
3 years ago |
TianYuan
|
9125d71a81
|
fix pwg inference
|
3 years ago |
TianYuan
|
36d60a717e
|
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into add_mbmelgan
|
3 years ago |
TianYuan
|
88668513b1
|
fix mv writer to visualdl in train
|
3 years ago |
TianYuan
|
670a68ad95
|
fix textfrontend readme, fix imgs link
|
3 years ago |
TianYuan
|
950d17cbcf
|
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into add_mbmelgan
|
3 years ago |
TianYuan
|
41526ca1b8
|
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into fix_docs
|
3 years ago |
TianYuan
|
3f9e30c9b3
|
refactor docs
|
3 years ago |
TianYuan
|
304d71747a
|
Merge pull request #939 from Jackwaterveg/doc
fix the doc
|
3 years ago |
huangyuxin
|
cef36521f9
|
fix the doc
|
3 years ago |
Hui Zhang
|
0812a3df20
|
add more join ctc decode conf
|
3 years ago |
Hui Zhang
|
8370604084
|
Merge pull request #936 from PaddlePaddle/fix_lm
[asr] fix egs of transformer lm usage
|
3 years ago |
Hui Zhang
|
e4852e3bf9
|
Merge pull request #934 from yt605155624/fix_readme
[TTS]fix link in readme
|
3 years ago |
Hui Zhang
|
c89820e7b2
|
fix egs of transformer lm usage
|
3 years ago |
TianYuan
|
6dbcd7720d
|
add csmsc mb melgan example
|
3 years ago |
TianYuan
|
02055eb26a
|
fix link in readme
|
3 years ago |
Hui Zhang
|
b878027c9a
|
format code
|
3 years ago |
Hui Zhang
|
8cda812857
|
Merge branch 'develop' into join_ctc
|
3 years ago |
Hui Zhang
|
b7bdaf6f8f
|
add lm conf and load
|
3 years ago |
TianYuan
|
20226b4fdd
|
fix benchmark and chain, add parse_options in run.sh, move tacotron2_ge2e into voice_cloning
|
3 years ago |
Hui Zhang
|
8f869b4c1f
|
update gitignore
|
3 years ago |
Hui Zhang
|
a107b75bac
|
transform; librispeech/s2 data process ok
|
3 years ago |
TianYuan
|
2e9d9dc9a7
|
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into merge_parakeet
|
3 years ago |
TianYuan
|
3ce5dff460
|
refactor parakeet examples
|
3 years ago |
Hui Zhang
|
614a004c37
|
update librispeech/s2 result
|
3 years ago |
Hui Zhang
|
a37cfbfb96
|
add fbank/pitch conf
|
3 years ago |
Hui Zhang
|
7509dc4056
|
update path and flac
|
3 years ago |
Hui Zhang
|
871fc5b70d
|
more utils to support kaldi/espnet data preocess
|
3 years ago |
Hui Zhang
|
c5f6692191
|
update lirbi s2 result
|
3 years ago |
Hui Zhang
|
12f788dd0e
|
Merge branch 'develop' into join_ctc
|
3 years ago |
Hui Zhang
|
7cfb3334e3
|
Merge pull request #927 from PaddlePaddle/nn_ctc
[asr] not change ctc grad norm manually
|
3 years ago |
Hui Zhang
|
dfd80b3aa2
|
recog into decoders, format code
|
3 years ago |
Hui Zhang
|
a4e27da64b
|
decoder with ctc prefix score
|
3 years ago |
Hui Zhang
|
7d54ee4d1d
|
ctc_grad_norm_type by null
|
3 years ago |
Hui Zhang
|
30499a7654
|
not change ctc grad manual
|
3 years ago |
Hui Zhang
|
190f4cc4bc
|
update u2 result; fix test.sh
|
3 years ago |
huangyuxin
|
b1a90d4d7a
|
add hub for s1 in aishell and librispeech
|
3 years ago |
Hui Zhang
|
8539689b15
|
u2 kaldi wer4p0
|
3 years ago |
Hui Zhang
|
f55267f2b3
|
fix img link; rsl format;
|
3 years ago |
huangyuxin
|
bfda49bf40
|
fix the bug of benchmark after merge the parakeet, add the condition of using kaldi in aishll s1
|
3 years ago |
Hui Zhang
|
fa5531c03e
|
Merge pull request #908 from PaddlePaddle/speech
[TTS] merge parakeet repo into deepspeech
|
3 years ago |
Hui Zhang
|
b079577e08
|
merge parakeet repo into deepspeech
|
3 years ago |