TianYuan
|
e622f42d92
|
add aishell3 voice cloning with ECAPA-TDNN spk encoder
|
2 years ago |
TianYuan
|
1c30cff1bf
|
fix gpus of ernie_sat, test=tts (#2355)
|
2 years ago |
Zhao Yuting
|
59e7444efe
|
solve the bug of training mdtc_bs16_fp32
fix the filepath bug
|
2 years ago |
艾梦
|
ea9ee93739
|
[TTS]Update VITS to support VITS and its voice cloning training on AIShell-3 (#2268)
* code for training vits voice clone on aishell3.
Co-authored-by: TianYuan <white-sky@qq.com>
|
2 years ago |
TianYuan
|
e8656fdfba
|
update version of paddle2onnx, test=tts (#2347)
|
2 years ago |
TianYuan
|
795eb7bd10
|
format paddlespeech with pre-commit (#2331)
|
2 years ago |
TianYuan
|
5d5888af8e
|
fix tone, update readme (#2335)
|
2 years ago |
Zhao Yuting
|
cb74803957
|
fix the bug "can't start ASR streaming_server" (#2337)
* Update application.yaml
|
2 years ago |
Hui Zhang
|
e0081b7e50
|
[vec][spk] add speechbrain ecapa-tdnn result
|
2 years ago |
TianYuan
|
7b864e8f38
|
clean old ernie sat inference scripts (#2316)
|
2 years ago |
TianYuan
|
d21e03c03e
|
update tts3 readme, test=doc (#2315)
|
2 years ago |
liangym
|
1c2a6b8e30
|
updata readme, test=doc (#2313)
|
2 years ago |
TianYuan
|
7cc1d66863
|
Update README.md
|
2 years ago |
liangym
|
1f100b1573
|
[tts] add tts finetune example (#2297)
* add tts finetune example, test=tts
* fix finetune
Co-authored-by: TianYuan <white-sky@qq.com>
|
2 years ago |
TianYuan
|
c1d4551055
|
add ernie sat synthesize_e2e, test=tts (#2287)
|
2 years ago |
Zhao Yuting
|
9473b8468c
|
Create preprocess.py
If there are no spaces between sentences in your text file, use this file to generate a new file, which adds spaces between each token.
|
2 years ago |
Zhao Yuting
|
d2f7362aa7
|
Delete preprocess.py
|
2 years ago |
Zhao Yuting
|
2aef6958de
|
Create preprocess.py
If there are no spaces between sentences in your text file, use this file to generate a new file, which adds spaces between each token.
|
2 years ago |
李子
|
5a58a27492
|
[TTS]指定G2PW的传入数据类型 , test=tts (#2288)
* fix ONNXRuntimeError Specify data type (int64),test=tts
* Tactron2→Tacotron2 ,test=doc
|
2 years ago |
Hui Zhang
|
99977b2f7e
|
Update README.md
|
2 years ago |
TianYuan
|
f7780658db
|
fix tone sand_hi bugs for Chinese frontend
|
2 years ago |
TianYuan
|
ed18b08d07
|
Update README.md
|
2 years ago |
TianYuan
|
3da9a15f9d
|
Update README.md
|
2 years ago |
TianYuan
|
18b4fb57be
|
update readme
|
2 years ago |
TianYuan
|
8dbefc0165
|
fix preprocess bug, add hifigan_csmsc decoder, update readme
|
2 years ago |
lym0302
|
368e3e1b59
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into mix_example
|
2 years ago |
lym0302
|
894556f871
|
add zh_en mix example, test=tts
|
2 years ago |
TianYuan
|
2f9bdf2306
|
Merge pull request #2222 from yt605155624/add_onnx_cli
[CLI]add onnxruntime infer for cli
|
2 years ago |
TianYuan
|
b9ade18055
|
add onnxruntime infer for cli
|
2 years ago |
huangyuxin
|
dca51c5131
|
fix wenetspeech conf, test=asr
|
2 years ago |
Hui Zhang
|
812d80ab1c
|
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into u2_export
|
2 years ago |
Hui Zhang
|
d3572be0bb
|
add ws export.sh
|
2 years ago |
Zhao Yuting
|
535f0d3e33
|
Update default.yaml
|
2 years ago |
Zhao Yuting
|
0308b86054
|
Update default.yaml
|
2 years ago |
Zhao Yuting
|
18b48c256b
|
Update default.yaml
|
2 years ago |
TianYuan
|
1f128a0817
|
Merge pull request #2117 from yt605155624/ernie_sat_trainer
[TTS]add ernie sat trainer
|
2 years ago |
TianYuan
|
9d4161ce5f
|
update config, test=doc
|
2 years ago |
lym0302
|
e1f8695456
|
add mix tts, test=tts
|
2 years ago |
TianYuan
|
97965f4c37
|
fix mlm_prob, test=tts
|
2 years ago |
liangym
|
b2baf1450a
|
Merge pull request #2159 from yt605155624/add_blank
[TTS]update vits ckpt
|
2 years ago |
TianYuan
|
378d25aef8
|
update vits ckpt, test=tts
|
2 years ago |
Jackwaterveg
|
e37bcdd5a8
|
test=doc
|
2 years ago |
Jackwaterveg
|
69f399f8cd
|
test=doc
|
2 years ago |
Jackwaterveg
|
e1f686abd9
|
test=doc
|
2 years ago |
Jackwaterveg
|
c167e128c5
|
fix doc,test=doc
|
2 years ago |
TianYuan
|
c1395e3a05
|
add synthesize for ernie_sat aishell3 and aishell3_vctk, test=tts
|
2 years ago |
Hui Zhang
|
6699d760e4
|
Update README.md
|
2 years ago |
TianYuan
|
5503c8bd6b
|
add ernie_sat synthesize script for metadata.jsonl, test=tts
|
2 years ago |
TianYuan
|
e129bc736b
|
fix am name, test=tts
|
2 years ago |
TianYuan
|
028742b69a
|
update lr scheduler
|
2 years ago |
TianYuan
|
94688264c7
|
add ernie sat model file and config
|
2 years ago |
huangyuxin
|
05d41523ad
|
Merge branch 'develop' into webdataset
|
2 years ago |
huangyuxin
|
92d1d08b9a
|
fix scripts
|
2 years ago |
Jackwaterveg
|
7fc81fe9d9
|
test=doc
|
2 years ago |
Jackwaterveg
|
32e8c6f16c
|
test=doc
|
2 years ago |
Jackwaterveg
|
1b0cda961f
|
test=doc
|
2 years ago |
TianYuan
|
e0a87ea914
|
Merge pull request #2090 from KPatr1ck/sv
[Examples][SV] Fix rir download. test=doc
|
2 years ago |
TianYuan
|
60c1a1e575
|
Merge pull request #2087 from yt605155624/add_blank
[TTS]install CPython version monotonic_align before training
|
2 years ago |
TianYuan
|
b2b05a0bc7
|
add vits ckpt, test=doc
|
2 years ago |
TianYuan
|
e3075e7917
|
install CPython version monotonic_align before train, test=tts
|
2 years ago |
huangyuxin
|
429221dc03
|
adopt multi machine traiing
|
2 years ago |
KP
|
19fd46f57b
|
Fix rir download. test=doc
|
2 years ago |
huangyuxin
|
ac1b301657
|
Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
|
2 years ago |
huangyuxin
|
81934d7191
|
fix run.sh
|
2 years ago |
Jackwaterveg
|
6598216b2f
|
Merge branch 'develop' into webdataset
|
2 years ago |
huangyuxin
|
0c7abc1f17
|
add training scripts
|
2 years ago |
TianYuan
|
e233c4e449
|
Merge pull request #2079 from Honei/develop
[vector]add convert.sh
|
2 years ago |
xiongxinlei
|
d15883e3dc
|
add convert.sh
|
2 years ago |
huangyuxin
|
c7a7b113c8
|
support multi-gpu training with webdataset
|
2 years ago |
TianYuan
|
6a45c5c3f5
|
add tts static/onnx models' link in released_model.md, test=doc
|
2 years ago |
TianYuan
|
7743c6a1ff
|
add onnx models for aishell3/ljspeech/vctk's tts3/voc1/voc5, test=tts
|
2 years ago |
TianYuan
|
d1aa83a239
|
Merge pull request #2052 from yt605155624/ernie_sat
[TTS]add ernie sat inference
|
2 years ago |
TianYuan
|
79658a5f20
|
add ernie sat inference, test=tts
|
2 years ago |
TianYuan
|
02734141ce
|
Merge pull request #2040 from yt605155624/add_blank
[TTS]add blank between characters for vits
|
2 years ago |
TianYuan
|
1731976e4e
|
add blank between characters for vits, test=tts
|
2 years ago |
Jackwaterveg
|
bca014fd92
|
Merge pull request #2032 from PaddlePaddle/audio_refactoring
[audio] Audio refactoring
|
2 years ago |
KP
|
bf056c013d
|
Refactor paddleaudio to paddlespeech.audio
|
2 years ago |
TianYuan
|
a37a5266f5
|
Merge pull request #2031 from Jackwaterveg/develop_fix
[ASR] fix local/test.sh in librispeech asr1
|
2 years ago |
huangyuxin
|
865d075831
|
fix local/test.sh of librispeech asr1
|
2 years ago |
Jackwaterveg
|
b9d35c9b2b
|
Merge pull request #2028 from Jackwaterveg/develop_dev
[ASR] support distrbuted training
|
2 years ago |
huangyuxin
|
9aa868d14d
|
support distrbuted training
|
2 years ago |
Jackwaterveg
|
4432190fa8
|
test=doc
|
2 years ago |
Jackwaterveg
|
6fe4cc1e47
|
test=doc
|
2 years ago |
Jackwaterveg
|
681151a8c8
|
test=doc
|
2 years ago |
Hui Zhang
|
dfdf450b22
|
fix #2013; and format
|
2 years ago |
huangyuxin
|
61e565182a
|
add preprocess.yaml
|
3 years ago |
huangyuxin
|
e48e1d5e81
|
fix tiny and local script, test=asr
|
3 years ago |
huangyuxin
|
47dd61e5b2
|
refactor ds2, cli, server
|
3 years ago |
TianYuan
|
6c7ed42712
|
fix ljspeech readme, test=doc
|
3 years ago |
TianYuan
|
9a253bc091
|
gen lexicon with tone in mfa, test=tts
|
3 years ago |
TianYuan
|
e6e5d86a5a
|
Merge pull request #1984 from Jackwaterveg/develop
[Doc] deprecate the 1.8x model
|
3 years ago |
huangyuxin
|
62c50e0060
|
deprecate the 1.8x model, test=doc
|
3 years ago |
Hui Zhang
|
42fba661c9
|
more detail of copyright
|
3 years ago |
TianYuan
|
7bc54cbbe6
|
Merge pull request #1957 from yt605155624/vits_doc
[doc]add VITS readme, test=tts
|
3 years ago |
TianYuan
|
f9f014d159
|
add VITS readme, test=tts
|
3 years ago |
Hui Zhang
|
8f8239ad3b
|
Merge pull request #1954 from Honei/acs_server
[server][vector]update the vector model
|
3 years ago |
xiongxinlei
|
07c0d7d7cc
|
remove old vector model info, test=doc
|
3 years ago |
xiongxinlei
|
7afbdbefad
|
update the vector model, test=doc
|
3 years ago |
qingen
|
a7037dc029
|
[vec][doc] update der result, test=doc
|
3 years ago |
TianYuan
|
7a88e3f4e4
|
update readme, test=doc
|
3 years ago |
TianYuan
|
df3f975ea5
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_vits
|
3 years ago |
TianYuan
|
8db06444c5
|
add vits trainer and synthesize
|
3 years ago |
Jackwaterveg
|
6b6c6cc1eb
|
test=doc
|
3 years ago |
Jackwaterveg
|
689e3bfc60
|
test=doc
|
3 years ago |
Jackwaterveg
|
a55ec2f69f
|
test=doc
|
3 years ago |
Jackwaterveg
|
cab0394954
|
test=doc
|
3 years ago |
Jackwaterveg
|
b5437a293d
|
test=doc
|
3 years ago |
root
|
8a3c88d42e
|
add launch for st, test=asr
|
3 years ago |
root
|
9f389a7a33
|
support cpu, test=asr
|
3 years ago |
root
|
864041085f
|
replace dist.spawn with dist.launch, test=asr
|
3 years ago |
TianYuan
|
4b7786f2ed
|
add vits network scripts, test=tts
|
3 years ago |
root
|
9947380898
|
fix the doc, test=doc
|
3 years ago |
root
|
4d7046d244
|
updata released model info, test=doc
|
3 years ago |
root
|
0309a4d032
|
Add doc for wenetspeech model, test=doc
|
3 years ago |
Hui Zhang
|
624ab2c57a
|
update asr1 config
|
3 years ago |
lizi
|
10b00b4da7
|
fix the reorganize_aishell3 trouble now it can generate lab files of audio files under training classification
|
3 years ago |
Honei
|
ff7dbcc2de
|
Merge branch 'develop' into v0.3
|
3 years ago |
xiongxinlei
|
c5fe181405
|
update the paddlespeech_client asr_online cli, test=doc
|
3 years ago |
Hui Zhang
|
6132457b7b
|
Merge pull request #1808 from Jackwaterveg/develop_dev
[Doc] Renew ds2 online info
|
3 years ago |
Hui Zhang
|
b66838faa9
|
Merge pull request #1811 from Honei/v0.3
[R1.0]update the streaming asr server readme
|
3 years ago |
xiongxinlei
|
4c56e4d42c
|
update the voxceleb readme.md, test=doc
|
3 years ago |
huangyuxin
|
e5fbd8ce75
|
renew ds2 online doc, test=doc
|
3 years ago |
xiongxinlei
|
cb9beabace
|
fix the sv ecapa-tdnn cpu training, test=doc
|
3 years ago |
Hui Zhang
|
5f62c84cb0
|
Merge pull request #1791 from qingen/cluster
[vec] update readme
|
3 years ago |
qingen
|
648cc5823b
|
[vec] update readme, test=doc
|
3 years ago |
Hui Zhang
|
42a81f453b
|
Merge pull request #1781 from PaddlePaddle/Jackwaterveg-patch-2
[Doc] Update ds2online model info
|
3 years ago |
KP
|
abb15ac6e8
|
Update KWS example.
|
3 years ago |
Jackwaterveg
|
5ecdf3d3cd
|
Update RESULTS.md
|
3 years ago |
KP
|
caa8eb4d0d
|
Add KWS example.
|
3 years ago |
KP
|
43659b9882
|
Add KWS example.
|
3 years ago |
KP
|
f9761d532c
|
Add KWS example.
|
3 years ago |
KP
|
b60b1dadde
|
Add KWS example.
|
3 years ago |
TianYuan
|
0b1b573a3f
|
Merge pull request #1767 from Jackwaterveg/cli
[CLI] Add conformer_aishell, conformer_online_aishell
|
3 years ago |
huangyuxin
|
ad4e04fc82
|
add conformer_online_aishell, test=doc
|
3 years ago |
Hui Zhang
|
91e24b0480
|
format code
|
3 years ago |
TianYuan
|
a7402203ec
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into update_paddle2onnx
|
3 years ago |
TianYuan
|
b55865b228
|
update version of paddle2onnx, test=tts
|
3 years ago |
TianYuan
|
9121dfc046
|
Merge pull request #1752 from yt605155624/fix_wavernn
[TTS]fix wavernn white noise bug for paddle develop(2.3)
|
3 years ago |
TianYuan
|
08a4673355
|
fix wavernn bug, test=tts
|
3 years ago |
Jackwaterveg
|
be42b1322a
|
Updata released model info, test=doc
|
3 years ago |
Hui Zhang
|
21b740f3cf
|
Merge pull request #1746 from PaddlePaddle/Jackwaterveg-patch-1
[Doc] Fix release_model info
|
3 years ago |
Hui Zhang
|
2974931196
|
Merge pull request #1747 from Jackwaterveg/debug
[ASR] fix CER tools
|
3 years ago |
huangyuxin
|
9c77d9e880
|
fix, test=doc
|
3 years ago |
Jackwaterveg
|
132c8916c3
|
Update RESULTS.md
|
3 years ago |
xiongxinlei
|
df503e97c1
|
update the voxceleb readme.md, test=doc
|
3 years ago |
Jackwaterveg
|
b7f62ba82f
|
Update RESULTS.md
|
3 years ago |
Jackwaterveg
|
8d1ee8262e
|
Merge branch 'develop' into CER
|
3 years ago |
TianYuan
|
c74fa9ada8
|
restructure syn_utils.py, test=tts
|
3 years ago |
huangyuxin
|
4e431ae269
|
resum librispeech
|
3 years ago |
huangyuxin
|
7c3c1b440a
|
change librispeech
|
3 years ago |
huangyuxin
|
6e80618e3d
|
add ds2
|
3 years ago |
Hui Zhang
|
7220b11b58
|
Merge pull request #1715 from zh794390558/spx_egs
[speechx] refactor egs and more egs for TLG wfst graph build
|
3 years ago |
Hui Zhang
|
b78bc6375b
|
Merge pull request #1712 from yt605155624/add_cnndecoder_onnx
[TTS]add fastspeech2 cnndecoder onnx model
|
3 years ago |
Hui Zhang
|
0ede6c2ee7
|
train lm
|
3 years ago |
TianYuan
|
da93f944e6
|
update, test=doc
|
3 years ago |
TianYuan
|
dafe7c3657
|
add fastspeech2 cnndecoder onnx model, test=tts
|
3 years ago |
Hui Zhang
|
a054d1c452
|
text process for lm
|
3 years ago |
huangyuxin
|
4f23caa238
|
fix bug
|
3 years ago |
TianYuan
|
7dcfc4aa95
|
[doc]add pwgan onnx model, test=doc
|
3 years ago |
TianYuan
|
98f67870ea
|
Merge pull request #1693 from yt605155624/fix_ss_NHWC
[TTS]change NLC to NCL in speedyspeech, test=tts
|
3 years ago |
buchongyu
|
48358055d0
|
修改hack 单词拼写错误
|
3 years ago |
buchongyu
|
607a20a54c
|
修复 example 目录中speech单词拼写错误问题
|
3 years ago |
TianYuan
|
8b801ca18b
|
change NLC to NCL in speedyspeech, test=tts
|
3 years ago |
xiongxinlei
|
d1935d8552
|
add vector necessary note, test=doc
|
3 years ago |
Honei
|
48e0177767
|
Merge pull request #1630 from Honei/vox12
[vec]voxceleb convert dataset format to paddlespeech
|
3 years ago |
qingen
|
fc72295334
|
Merge pull request #1651 from ccrrong/ami
[vec] add speaker diarization pipeline
|
3 years ago |
ccrrong
|
995436c6f1
|
delete unused file ami_dataset.py, compute_der.py, test=doc
|
3 years ago |
Hui Zhang
|
44ee5cd805
|
Merge pull request #1677 from PaddlePaddle/Jackwaterveg-patch-1
[Doc] update readem for aishell/asr0
|
3 years ago |
ccrrong
|
bc53f726fe
|
convert dataset format to paddlespeech, test=doc
|
3 years ago |
TianYuan
|
c674e59b91
|
update readme, test=doc
|
3 years ago |
TianYuan
|
0282d45c62
|
remove fill_constant_batch_size_like in static model of speedyspeech, test=tts
|
3 years ago |
TianYuan
|
30628f6832
|
update readme, test=doc
|
3 years ago |
TianYuan
|
c765fca6b4
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_onnx
|
3 years ago |
TianYuan
|
21c75684ac
|
add paddle2onnx, test=tts
|
3 years ago |
Jackwaterveg
|
75c9dc773b
|
test=doc
|
3 years ago |
Jackwaterveg
|
3c93953550
|
test=doc
|
3 years ago |
Jackwaterveg
|
f71b9b915d
|
test=doc
|
3 years ago |
Jackwaterveg
|
1a67038616
|
test=doc
|
3 years ago |
Jackwaterveg
|
88f5595bd7
|
test=doc
|
3 years ago |
Jackwaterveg
|
ee96fb40f0
|
test=doc
|
3 years ago |
Jackwaterveg
|
a22f29ba10
|
test=doc
|
3 years ago |
Jackwaterveg
|
ae1b22273f
|
[Doc] update readem for aishell/asr0, test=doc
|
3 years ago |
xiongxinlei
|
a8244dc5b0
|
update the note, test=doc
|
3 years ago |
KP
|
80b1fb9839
|
Update RESULTS.md. test=doc
|
3 years ago |
KP
|
34b77a9db1
|
Update RESULTS.md. test=doc
|
3 years ago |
huangyuxin
|
fd7a50d5a0
|
add new cer tools, test=asr
|
3 years ago |
Hui Zhang
|
2b7ca6f261
|
Update RESULTS.md
|
3 years ago |
Hui Zhang
|
7ca40ff008
|
Merge pull request #1668 from PaddlePaddle/Jackwaterveg-patch-1
[ASR] update ds2 online model
|
3 years ago |
Honei
|
89791d7aca
|
Merge pull request #1663 from Honei/model
[vec]update the speaker verification model
|
3 years ago |
Jackwaterveg
|
82cd7015d7
|
test=doc
|
3 years ago |
Jackwaterveg
|
5bb36472e8
|
test=doc
|
3 years ago |
Jackwaterveg
|
eeae00cc04
|
test=doc
|
3 years ago |
TianYuan
|
d592f25279
|
add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
|
3 years ago |
TianYuan
|
7aecb2c4bb
|
add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
|
3 years ago |
xiongxinlei
|
d064c8196e
|
update the speaker verification model, test=doc
|
3 years ago |
KP
|
079ac5caa0
|
Update README.md
|
3 years ago |
xiongxinlei
|
38e4e9c893
|
refactor voxceleb2 data download, test=doc
|
3 years ago |
ccrrong
|
7a03f36548
|
code format, test=doc
|
3 years ago |
ccrrong
|
378fe5909f
|
add ami diarization pipeline, test=doc
|
3 years ago |
xiongxinlei
|
acebfad7b7
|
change the vector csv.spk_id to csv.label, test=doc
|
3 years ago |