lym0302
|
e1f8695456
|
add mix tts, test=tts
|
2 years ago |
TianYuan
|
97965f4c37
|
fix mlm_prob, test=tts
|
2 years ago |
TianYuan
|
72fa8176ca
|
fix for mix_lang
|
2 years ago |
TianYuan
|
5503c8bd6b
|
add ernie_sat synthesize script for metadata.jsonl, test=tts
|
2 years ago |
lym0302
|
207bb5d93b
|
add mix frontend, test=tts
|
2 years ago |
TianYuan
|
028742b69a
|
update lr scheduler
|
2 years ago |
TianYuan
|
94688264c7
|
add ernie sat model file and config
|
2 years ago |
liangym
|
e153495519
|
Merge pull request #2122 from yt605155624/rm_server_log
[TTS]add stream_play_tts.py, test=doc
|
2 years ago |
TianYuan
|
6bbe6de1ec
|
add stream_play_tts.py, test=doc
|
2 years ago |
TianYuan
|
f6d1c545ac
|
fromat doc_string
|
2 years ago |
TianYuan
|
cf846f9ebc
|
rm extra log
|
2 years ago |
TianYuan
|
c0f126ecd9
|
fix int32 warning in tts, test=tts
|
2 years ago |
TianYuan
|
7743c6a1ff
|
add onnx models for aishell3/ljspeech/vctk's tts3/voc1/voc5, test=tts
|
2 years ago |
TianYuan
|
d1aa83a239
|
Merge pull request #2052 from yt605155624/ernie_sat
[TTS]add ernie sat inference
|
2 years ago |
TianYuan
|
79658a5f20
|
add ernie sat inference, test=tts
|
2 years ago |
TianYuan
|
02734141ce
|
Merge pull request #2040 from yt605155624/add_blank
[TTS]add blank between characters for vits
|
2 years ago |
TianYuan
|
1731976e4e
|
add blank between characters for vits, test=tts
|
2 years ago |
Hui Zhang
|
dfdf450b22
|
fix #2013; and format
|
2 years ago |
TianYuan
|
004ab8d0c0
|
reneame chunk to block in streaming tts, test=tts
|
2 years ago |
TianYuan
|
7bc54cbbe6
|
Merge pull request #1957 from yt605155624/vits_doc
[doc]add VITS readme, test=tts
|
2 years ago |
TianYuan
|
f9f014d159
|
add VITS readme, test=tts
|
2 years ago |
Hui Zhang
|
f07f57a3a8
|
Merge pull request #1945 from PaddlePaddle/asr_line
[server][asr] refactor asr streaming server and remove useless code
|
2 years ago |
TianYuan
|
5ee3cc0c31
|
Merge pull request #1855 from yt605155624/add_vits
[TTS]add vits network scripts, test=tts
|
2 years ago |
Hui Zhang
|
c15278ed80
|
format
|
2 years ago |
TianYuan
|
327509951f
|
rm unused comment, test=tts
|
2 years ago |
TianYuan
|
c1b512c58a
|
rm fluid in tts, test=tts
|
2 years ago |
TianYuan
|
df3f975ea5
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_vits
|
2 years ago |
TianYuan
|
58028509c3
|
replace dynamic_import
|
2 years ago |
TianYuan
|
8db06444c5
|
add vits trainer and synthesize
|
3 years ago |
TianYuan
|
e61757dbf7
|
fix yao, test=tts
|
3 years ago |
Jerryuhoo
|
167aaa65b9
|
normalize wav max value to 1 in preprocess, test=tts
|
3 years ago |
pollyyan
|
018dda6ee9
|
Merge pull request #1879 from QingshuChen/develop
speedyspeech support kunlun
|
3 years ago |
QingshuChen
|
e55177c3db
|
speedyspeech support kunlun
|
3 years ago |
root
|
9f389a7a33
|
support cpu, test=asr
|
3 years ago |
TianYuan
|
4b7786f2ed
|
add vits network scripts, test=tts
|
3 years ago |
Jerryuhoo
|
fba0693a20
|
fix random speaker embedding bug, test=tts
|
3 years ago |
lym0302
|
c6e6210964
|
code format, test=tts
|
3 years ago |
lym0302
|
88adcaa6dc
|
fix code, test=doc
|
3 years ago |
TianYuan
|
9121dfc046
|
Merge pull request #1752 from yt605155624/fix_wavernn
[TTS]fix wavernn white noise bug for paddle develop(2.3)
|
3 years ago |
TianYuan
|
08a4673355
|
fix wavernn bug, test=tts
|
3 years ago |
Hui Zhang
|
c7d9b11529
|
format
|
3 years ago |
TianYuan
|
4646f7cc8d
|
add paddle device set for ort and inference, test=doc
|
3 years ago |
TianYuan
|
c74fa9ada8
|
restructure syn_utils.py, test=tts
|
3 years ago |
TianYuan
|
dafe7c3657
|
add fastspeech2 cnndecoder onnx model, test=tts
|
3 years ago |
TianYuan
|
8bebf81199
|
[doc]fix typo, test=doc
|
3 years ago |
TianYuan
|
8b801ca18b
|
change NLC to NCL in speedyspeech, test=tts
|
3 years ago |
TianYuan
|
0282d45c62
|
remove fill_constant_batch_size_like in static model of speedyspeech, test=tts
|
3 years ago |
TianYuan
|
c765fca6b4
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_onnx
|
3 years ago |
TianYuan
|
124eb6af8f
|
update notes, test=doc
|
3 years ago |
TianYuan
|
e0d222e674
|
update notes, test=doc
|
3 years ago |
TianYuan
|
f264b912fc
|
add warmup for frontend, test=doc
|
3 years ago |
TianYuan
|
7aecb2c4bb
|
add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
|
3 years ago |
TianYuan
|
a8f5990869
|
fix preprocess bug, test=tts
|
3 years ago |
TianYuan
|
828ee14404
|
add license and reference for some models, test=doc
|
3 years ago |
TianYuan
|
3aec266ca5
|
add chunk size and pad size in args, test=doc
|
3 years ago |
TianYuan
|
4d7cd0e063
|
add streaming synthesize, test=tts
|
3 years ago |
TianYuan
|
005aa4066c
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into cnn_decoder
|
3 years ago |
TianYuan
|
0fc79f474d
|
add CNNDecoder, test=tts
|
3 years ago |
TianYuan
|
bc5ae43d3a
|
restructure expand in length_regulator.py for paddle2onnx, test=tts
|
3 years ago |
TianYuan
|
5347dbad3f
|
fix dtype of window of stft, test=tts
|
3 years ago |
Hui Zhang
|
4051e7b762
|
fix compliance test bug, and format
|
3 years ago |
TianYuan
|
26ef47810d
|
Merge pull request #1593 from windstamp/npu_dev_20220322
[NPU] Add NPU support for TransformerTTS
|
3 years ago |
zhangkeliang
|
59b3de6a6d
|
[NPU] test TransformerTTS with NPU
|
3 years ago |
TianYuan
|
6469568d2a
|
update readme for vctk hifigan, test=tts
|
3 years ago |
TianYuan
|
5ab2601759
|
update readme for aishell3 hifigan, test=tts
|
3 years ago |
Hui Zhang
|
40ab05a462
|
Merge pull request #1552 from yt605155624/format_syn
[TTS]format synthesize
|
3 years ago |
TianYuan
|
544c372b50
|
fix cr, test=tts
|
3 years ago |
TianYuan
|
fe8bf2a38c
|
format synthesize, test=tts
|
3 years ago |
TianYuan
|
81d964f0a0
|
add vctk hifigan, test=tts
|
3 years ago |
TianYuan
|
175c39b4a4
|
Merge pull request #1511 from yt605155624/pre_fix_for_streaming
[TTS]add rtf for synthesize, add more vocoder for synthesize.sh
|
3 years ago |
Jerryuhoo
|
c116a3a926
|
fix Speedyspeech multi-speaker inference, test=tts
|
3 years ago |
TianYuan
|
cb07bd2a94
|
add rtf for synthesize, add more vocoder for synthesize_e2e.sh, test=tts
|
3 years ago |
Hui Zhang
|
e8f2d8f11b
|
Merge pull request #1507 from zh794390558/cli
[cli] add cli batch/pipe example to readme
|
3 years ago |
Hui Zhang
|
75098698d8
|
format,test=doc
|
3 years ago |
TianYuan
|
66a8beb27f
|
update text frontend, test=tts
|
3 years ago |
Hui Zhang
|
6b1fe70100
|
format code,test=doc
|
3 years ago |
TianYuan
|
67ec6242c3
|
fix ci for waveflow, test=tts
|
3 years ago |
TianYuan
|
f51097618b
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into fix_ci_waveflow
|
3 years ago |
TianYuan
|
fc8c0e3ea2
|
fix ci for waveflow, test=tts
|
3 years ago |
Hui Zhang
|
718c849f68
|
Merge pull request #1445 from yt605155624/update_train
[TTS]init for all works in train.py when ngpu>1
|
3 years ago |
TianYuan
|
4ac7db185e
|
init for all works in train.py when ngpu>1, test=tts
|
3 years ago |
TianYuan
|
9699c00769
|
change the docstring style from numpydoc to google, test=tts
|
3 years ago |
TianYuan
|
683679bec7
|
merge data and datasets, test=tts
|
3 years ago |
TianYuan
|
7dc1f2daa3
|
fix some librosa bugs, test=tts
|
3 years ago |
TianYuan
|
25347bb6a3
|
rename tacotron2, test=tts
|
3 years ago |
Hui Zhang
|
dcfc32f1ec
|
Merge pull request #1379 from yt605155624/new_wavernn
[TTS] add wavernn
|
3 years ago |
TianYuan
|
0747600c95
|
[TTS]add ljspeech new tacotron2 (#1416)
* add ljspeech new tacotron2, test=tts
* update ljspeech waveflow's synthesize
* add config, test=doc
Co-authored-by: Hui Zhang <zhtclz@foxmail.com>
|
3 years ago |
TianYuan
|
348a1a33bf
|
update tacotron2 voice cloning in aishell3 with new tacotron2, test=tts (#1419)
|
3 years ago |
TianYuan
|
1b0c034134
|
update wavernn, test=tts
|
3 years ago |
TianYuan
|
89e69ee10e
|
[TTS]fix tacotron2 dygraph to static (#1414)
* fix tacotron2 dygraph to static , test=tts
* fix tacotron2 dygraph to static , test=tts
* simplify synthesize_e2e.py , test=tts
|
3 years ago |
huangyuxin
|
2a42421a63
|
cli add ds2-librispeech offline, fix versionm, test=asr
|
3 years ago |
TianYuan
|
001afee644
|
fix wavernn dygraph to static , test=tts
|
3 years ago |
TianYuan
|
2071774d81
|
add wavernn in synthesize_e2e, test=tts
|
3 years ago |
TianYuan
|
1cc7905d51
|
rm csmsc.py, test=tts
|
3 years ago |
TianYuan
|
4c3e57a23c
|
align preprocess of wavernn, test=tts
|
3 years ago |
TianYuan
|
fb0acd40a2
|
add wavernn, test=tts
|
3 years ago |
Jerryuhoo
|
f515416c4a
|
fix missing model choice, test=doc
|
3 years ago |
Jerryuhoo
|
a22080130b
|
Add speedyspeech multi-speaker support for synthesize_e2e.py, test=tts
|
3 years ago |
TianYuan
|
96323816e9
|
fix yamls, change labels to stop_labels, test=tts
|
3 years ago |
TianYuan
|
1bf1a876ae
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_new_tacotron2, test=tts
|
3 years ago |
TianYuan
|
3fd7a7790b
|
add typehit for updater and evaluator, test=tts
|
3 years ago |
TianYuan
|
41d24337cb
|
fix fastspeech2 multi speaker to static, test=tts
|
3 years ago |
TianYuan
|
1a9e59612a
|
fix fastspeech2 multi speaker to static, test=tts
|
3 years ago |
TianYuan
|
d368d57d67
|
fix low ips bug of speedyspeech and fastspeech2, test=tts (#1349)
|
3 years ago |
TianYuan
|
9c7f0762b0
|
update racotron2 and transformer tts, test=tts
|
3 years ago |
TianYuan
|
a1867c20c3
|
fix slice bug of speedyspeech expand, test=tts (#1337)
|
3 years ago |
TianYuan
|
7ae4f7221e
|
Update length_regulator.py
|
3 years ago |
TianYuan
|
acfe2b9084
|
Update duration_predictor.py
|
3 years ago |
TianYuan
|
caa391f461
|
fix speedyspeech inference, test=tts (#1322)
|
3 years ago |
TianYuan
|
8f507ba4ba
|
Merge pull request #1302 from jerryuhoo/develop
[TTS] Add support for finetuning speedyspeech
|
3 years ago |
Jerryuhoo
|
111a452378
|
Fix the code format, test=tts
|
3 years ago |
TianYuan
|
89e988a69e
|
add csmsc tacotron2, test=tts
|
3 years ago |
TianYuan
|
c088b9a304
|
add csmsc tacotron2
|
3 years ago |
TianYuan
|
27bb76bdb9
|
fix tone_sandhi of yi, test=tts
|
3 years ago |
Jerryuhoo
|
be99807d61
|
Add durations to gen_gta_mel.py inference
|
3 years ago |
Jerryuhoo
|
fcc34e3e95
|
[tts] add gen_gta_mel.py for finetuning speedypeech, test=tts
|
3 years ago |
TianYuan
|
73dc0e2535
|
fix_ning
|
3 years ago |
Jerryuhoo
|
d6e9b76e76
|
change link_wav.py path, test=tts
|
3 years ago |
Jerryuhoo
|
c94f346207
|
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/ so that different vocoders can use one script.
|
3 years ago |
Jerryuhoo
|
e239ee1cd2
|
add multi-speaker support for finetuning hifigan vocoder
|
3 years ago |
TianYuan
|
680eac02b9
|
[tts]Update mb melgan (#1272)
* update mb melgan
* update mb melgan, test=tts
|
3 years ago |
TianYuan
|
98ce69d0aa
|
Merge pull request #1259 from jerryuhoo/develop
[TTS]Add multi-speaker support for the SpeedySpeech model
|
3 years ago |
JiehangXie
|
bdc48114a4
|
Update text_normlization.py
|
3 years ago |
JiehangXie
|
d88ceef7bc
|
Fix punctuation bug
修复顿号和英文冒号停顿和分句的问题
|
3 years ago |
JiehangXie
|
6065b1b607
|
Fix punctuation bug
修复顿号和英文冒号停顿和分句的问题
|
3 years ago |
Jerryuhoo
|
2dccd5315d
|
remove useless "other" dataset
|
3 years ago |
Jerryuhoo
|
f191d0b022
|
change speaker embedding position
Change speaker embedding position into the encoder.
|
3 years ago |
Jerryuhoo
|
11991b6d35
|
add multi-speaker support for speedyspeech
|
3 years ago |
TianYuan
|
b9a55262f1
|
Update fastspeech2.py
|
3 years ago |
Hui Zhang
|
c81a3f0f83
|
[s2t] DataLoader with BatchSampler or DistributeBatchSampler (#1242)
* batchsampler or distributebatchsampler
* format
|
3 years ago |
TianYuan
|
fbe3c05137
|
add style_melgan and hifigan in tts cli, test=tts (#1241)
|
3 years ago |
TianYuan
|
a232cd8b12
|
Update fastspeech2.py
|
3 years ago |
TianYuan
|
42c109216d
|
[tts]add style melgan pretraied model (#1228)
* add style melgan pretraied model
* add style melgan pretraied model, test=tts
Co-authored-by: Hui Zhang <zhtclz@foxmail.com>
|
3 years ago |
Hui Zhang
|
bb2a370b23
|
[asr] remove useless conf of librispeech (#1227)
* remve useless conf
* format code
* update conf
* update conf
* update conf
|
3 years ago |
TianYuan
|
b031ee43c4
|
Merge pull request #1215 from yt605155624/refactor_punc
[text]Refactor punc
|
3 years ago |
TianYuan
|
15b8904fa1
|
refactor punc
|
3 years ago |
JiehangXie
|
927c9bbdb6
|
Fix a bug when sentence inputed contain English words
|
3 years ago |
Jerryuhoo
|
3cbfd7bf35
|
Add speaker embedding and speaker id for style fastspeech2 inference
|
3 years ago |
Hui Zhang
|
d852aee2ff
|
[asr] logfbank with dither (#1179)
* fix logfbank dither
* format
|
3 years ago |
TianYuan
|
19ef7210a0
|
[TTS]Add hifigan (#1097)
* add hifigan
* add hifigan
* integrate synthesize synthesize_e2e, inference for tts, test=tts
* add some python files, test=tts
* update readme, test=doc_fix
|
3 years ago |
TianYuan
|
675cff258b
|
[TTS]fix praatio version, test=tts (#1158)
* fix praatio version, test=tts
* fix praatio version, test=tts
|
3 years ago |
TianYuan
|
f9efbf3063
|
Update generate_lexicon.py
|
3 years ago |
TianYuan
|
9db1710ba7
|
add conformer demos (#1108)
|
3 years ago |
TianYuan
|
963e906f56
|
Merge pull request #1068 from yt605155624/add_style_melgan
[TTS]add style_melgan
|
3 years ago |
TianYuan
|
f701882b66
|
update add_style_melgan
|
3 years ago |
TianYuan
|
2189b46004
|
add tts cli
|
3 years ago |
TianYuan
|
7b2ecb6eed
|
add style_melgan, test=tts
|
3 years ago |
TianYuan
|
aead853b1d
|
Update zh_frontend.py
|
3 years ago |
TianYuan
|
dd36eafe34
|
add style_melgan
|
3 years ago |
TianYuan
|
6e3257ab8a
|
Create __init__.py
|
3 years ago |
TianYuan
|
a861e56e91
|
rm space for pure Chinese
|
3 years ago |
TianYuan
|
dad1cbbcd6
|
update text frontend
|
3 years ago |
Hui Zhang
|
2bbfdbae91
|
Merge pull request #1015 from yt605155624/fs2_conformer
[TTS]fastspeech2 conformer
|
3 years ago |
TianYuan
|
b0a1d8ab60
|
fix base
|
3 years ago |
TianYuan
|
469329221b
|
refactor encoder, rm old code
|
3 years ago |
Hui Zhang
|
fe83adfbcb
|
nproc to ngpu
|
3 years ago |
TianYuan
|
bc0dd51149
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into HEAD
|
3 years ago |
TianYuan
|
4370c5cfa6
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into fs2_conformer
|
3 years ago |
Hui Zhang
|
c354e9154b
|
Merge pull request #1003 from yt605155624/fs2_ge2e
[TTS]add fastspeech2 voice cloning in aishell3
|
3 years ago |
TianYuan
|
133ee7db0b
|
rename num_speakers
|
3 years ago |
TianYuan
|
3d5e078c91
|
add conformer
|
3 years ago |
TianYuan
|
a97c7b5206
|
rename spembs
|
3 years ago |
TianYuan
|
8d025451de
|
add fastspeech2 voice cloning in aishell3
|
3 years ago |
TianYuan
|
c5c9f19091
|
rename to gen_gta_mel.py, remove stats compute when gen fintune data
|
3 years ago |
TianYuan
|
a6ac497f8e
|
add multi-band melgan finetune scripts
|
3 years ago |
TianYuan
|
30d09b411d
|
fix style_syn, replace DeepSpeech with PaddleSpeech in readme
|
3 years ago |
TianYuan
|
0bc9450c51
|
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into fix_docs
|
3 years ago |
TianYuan
|
0fcc5005a2
|
add tts tutorial
|
3 years ago |
Hui Zhang
|
7ceef6c3f5
|
format code
|
3 years ago |
TianYuan
|
bacdf5756b
|
Merge remote-tracking branch 'origin/develop' into merge_paddlespeech
|
3 years ago |
TianYuan
|
35c37ace17
|
change nprocs to ngpu, add aishell3/voc1
|
3 years ago |
TianYuan
|
6655728b08
|
add reference
|
3 years ago |
Hui Zhang
|
38cf56295a
|
fix reference format
|
3 years ago |
Hui Zhang
|
c463a00f81
|
add reference code license
|
3 years ago |
Hui Zhang
|
e2bcaee4f1
|
merge deepspeech, parakeet and text_processing into paddlespeech
|
3 years ago |