Honei
305bacdcf2
Merge branch 'develop' into vox12
3 years ago
xiongxinlei
0bb67d8b8e
add vector cli unit test, test=doc
3 years ago
KP
b6e976a860
Merge pull request #1602 from yt605155624/fix_dtype
...
[TTS]fix dtype of window of stft
3 years ago
xiongxinlei
62cbce6915
add vectorwrapper to extract audio embedding
3 years ago
TianYuan
8938483529
Merge pull request #1601 from yt605155624/add_ljspeech_hifigan
...
[TTS] update readme for ljspeech hifigan
3 years ago
TianYuan
5347dbad3f
fix dtype of window of stft, test=tts
3 years ago
TianYuan
342b487383
update readme for ljspeech hifigan, test=tts
3 years ago
Hui Zhang
4051e7b762
fix compliance test bug, and format
3 years ago
TianYuan
26ef47810d
Merge pull request #1593 from windstamp/npu_dev_20220322
...
[NPU] Add NPU support for TransformerTTS
3 years ago
zhangkeliang
59b3de6a6d
[NPU] test TransformerTTS with NPU
3 years ago
Jackwaterveg
fcc1762048
Merge pull request #1577 from Jackwaterveg/change_init
...
[ASR] change default initializer to kaiming_uniform
3 years ago
huangyuxin
e1b581b622
fix some bug, test=asr
3 years ago
Hui Zhang
b5315657ff
Merge pull request #1509 from qingen/cluster
...
[vec] add clustering of vectors
3 years ago
huangyuxin
6da8465f14
add dist_sampler args, test=asr
3 years ago
TianYuan
e5e8b8a129
Merge pull request #1587 from yt605155624/add_vctk_hifigan
...
[TTS]Add vctk hifigan
3 years ago
TianYuan
6469568d2a
update readme for vctk hifigan, test=tts
3 years ago
huangyuxin
a4f5a68074
fix some format, test=asr
3 years ago
xiongxinlei
d85d1deef5
exec pre-commit in paddlespeech vector, test=doc
3 years ago
xiongxinlei
9874fb7d75
add some comments in code
3 years ago
huangyuxin
e991d82ae7
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init
3 years ago
huangyuxin
d53e1163a6
update the code, test=asr
3 years ago
xiongxinlei
b9eafddd94
change - to _ to distinguish field
3 years ago
xiongxinlei
9c6735f921
add vector voxceleb12 base mode url, test=doc
3 years ago
xiongxinlei
d28ccfa96b
add vector cli component, test=doc
3 years ago
KP
831cadacc7
Add paddleaudio doc.
3 years ago
TianYuan
5ab2601759
update readme for aishell3 hifigan, test=tts
3 years ago
Hui Zhang
6abc5d9f7e
format
3 years ago
huangyuxin
ab16d8ce3c
change default initializer to kaiming_uniform, test=asr
3 years ago
qingen
0f7ede11ef
Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster
3 years ago
qingen
d16ce21d47
[wip][vec] update cluster of diarization, test=doc #1304
3 years ago
xiongxinlei
506d26a957
change the code style to s2t code style, test=doc
3 years ago
xiongxinlei
311fa87a11
add some comments to the code
3 years ago
Hui Zhang
90deeca06f
Merge pull request #1554 from lym0302/develop
...
[server] add server cls
3 years ago
lym0302
89457b273a
modify, test=doc
3 years ago
xiongxinlei
8ed5c287a3
add vox2 data into VoxCeleb class
3 years ago
lym0302
77bad44e8b
modify readme, test=doc
3 years ago
lym0302
8ef92a9495
modify, test=doc
3 years ago
lym0302
89dbda58f6
add cls static model, test=doc
3 years ago
Hui Zhang
40ab05a462
Merge pull request #1552 from yt605155624/format_syn
...
[TTS]format synthesize
3 years ago
lym0302
5187df847f
modify server demo, test=doc
3 years ago
xiongxinlei
584a2c0e39
add ecapa-tdnn config yaml file
3 years ago
lym0302
0a6602c708
modify application.yaml, test=doc
3 years ago
TianYuan
544c372b50
fix cr, test=tts
3 years ago
lym0302
99fa7a8205
add server cls, test=doc
3 years ago
TianYuan
fe8bf2a38c
format synthesize, test=tts
3 years ago
xiongxinlei
993d6783d7
remove unused code, test=doc
3 years ago
xiongxinlei
0e87037f2c
refactor to compilance paddleaudio
3 years ago
xiongxinlei
4473405f82
merge develop to vox12, test=doc
3 years ago
Honei
0dee8f40e9
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
xiongxinlei
60d73bb7bd
add state 0 to prepare the voxcele data and augment data
3 years ago
xiongxinlei
14efbf5b15
check extract embedding result, test=doc
3 years ago
xiongxinlei
386ef3f161
add voxceleb augment unit test, test=doc
3 years ago
Hui Zhang
5147163592
Merge pull request #1544 from yt605155624/add_vctk_hifigan
...
[tts]add vctk hifigan egs
3 years ago
TianYuan
81d964f0a0
add vctk hifigan, test=tts
3 years ago
xiongxinlei
2d89c80e6f
add waveform augment pipeline, test=doc
3 years ago
lym0302
3b304544f6
modify yaml, test=doc
3 years ago
xiongxinlei
ac4967e204
optimize the data prepare process
3 years ago
xiongxinlei
016ed6d69c
repair the code according to the part comment, test=doc
3 years ago
Hui Zhang
2886ab9373
Merge pull request #1530 from lym0302/server_cli
...
[server] add server test
3 years ago
xiongxinlei
1f74af110b
add training log info and comment, test=doc
3 years ago
lym0302
e50c1b3b1d
add server test, test=doc
3 years ago
xiongxinlei
4648059b5f
add training process for sid, test=doc
3 years ago
xiongxinlei
7668f61422
add sid dataloader for training, test=doc
3 years ago
xiongxinlei
6af2bc3d5b
add sid loss wraper for voxceleb, test=doc
3 years ago
xiongxinlei
57c4f4a68c
add sid learning rate and training model
3 years ago
TianYuan
4d2f2191a8
fix gbk encode bug
3 years ago
Honei
1395b5f5fa
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
TianYuan
175c39b4a4
Merge pull request #1511 from yt605155624/pre_fix_for_streaming
...
[TTS]add rtf for synthesize, add more vocoder for synthesize.sh
3 years ago
Hui Zhang
5ba4907c44
Merge pull request #1514 from lym0302/server_cli
...
[server] update server cli
3 years ago
lym0302
85d4a31e04
update application.yaml, test=doc
3 years ago
Jerryuhoo
c116a3a926
fix Speedyspeech multi-speaker inference, test=tts
3 years ago
lym0302
ab04488738
update server cli, test=doc
3 years ago
TianYuan
cb07bd2a94
add rtf for synthesize, add more vocoder for synthesize_e2e.sh, test=tts
3 years ago
Hui Zhang
26d413ce8f
Merge pull request #1510 from lym0302/paddlespeech_stats
...
[server] add paddlespeech_server stats
3 years ago
lym0302
72c0cda30c
add paddlespeech_server stats, test=doc
3 years ago
Hui Zhang
e8f2d8f11b
Merge pull request #1507 from zh794390558/cli
...
[cli] add cli batch/pipe example to readme
3 years ago
Hui Zhang
2517df92a0
Merge pull request #1508 from lym0302/paddlespeech_stats
...
[CLI] modified text sr to lang
3 years ago
TianYuan
b6d33a7fb4
Merge pull request #1506 from yt605155624/fix_frontend
...
[TTS]update text frontend, test=tts
3 years ago
lym0302
395c923dee
modified text sr to lang, test=doc
3 years ago
Hui Zhang
75098698d8
format,test=doc
3 years ago
TianYuan
66a8beb27f
update text frontend, test=tts
3 years ago
lym0302
96abb33b5b
add __call__, test=doc
3 years ago
lym0302
5f1728f855
rm server related, test=doc
3 years ago
xiongxinlei
70d3b01c0d
remove invalid code
3 years ago
xiongxinlei
d7da629302
add kaldi feats egs dataset
3 years ago
xiongxinlei
6f7e9656fe
add kaldi feats ark dataset
3 years ago
lym0302
35357e775e
update, test=doc
3 years ago
lym0302
e5aa24fa5a
resolve setup.py conflicts, test=doc
3 years ago
lym0302
fe6be4a65e
Merge branch 'develop' of https://github.com/lym0302/PaddleSpeech into paddlespeech_stats
3 years ago
lym0302
f8375764b9
add paddlespeech stats, test=doc
3 years ago
Hui Zhang
8d474c2658
Merge pull request #1482 from lym0302/servercli_update
...
[server] update server cli
3 years ago
lym0302
162361d878
format code, test=doc
3 years ago
lym0302
434708cff4
set device cpu, test=doc
3 years ago
lym0302
920b2c808c
paras required, test=doc
3 years ago
Hui Zhang
6b1fe70100
format code,test=doc
3 years ago
lym0302
6b2dd16845
update server cli, test=doc
3 years ago
WilliamZhang06
78c9b7342c
deleted wav file , test=doc
3 years ago
WilliamZhang06
a6ec3a26f1
Merge branch 'develop' into server_asr
3 years ago
WilliamZhang06
8b4602f738
added isinstance code, test=doc
3 years ago
lym0302
bb60561c66
update util, test=doc
3 years ago
WilliamZhang06
147018a8b4
added cli changed code, test=doc
3 years ago
lym0302
332009142b
add server demo, test=doc
3 years ago
WilliamZhang06
7ebe904e20
fixed overload , test=doc
3 years ago
Hui Zhang
60c0877e7a
Merge pull request #1472 from KPatr1ck/cli_batch
...
[CLI][Logger]Add cli logger control.
3 years ago
WilliamZhang06
b8f16ac9b0
Merge branch 'develop' into server_asr
3 years ago
WilliamZhang06
da3ea7bb40
added engine type and asr inference , test=doc
3 years ago
Hui Zhang
49f80afe6a
Merge pull request #1381 from PaddlePaddle/server
...
[server] speech server init version
3 years ago
lym0302
b508c4d0cb
add readme, test=doc
3 years ago
KP
d36a4ccfc8
Add cli logger control.
3 years ago
KP
94ed5969fa
Add cli logger control.
3 years ago
lym0302
42cbe313c2
improve cli code, test=doc
3 years ago
lym0302
2bf4b4521f
add cli, test=doc
3 years ago
lym0302
8fd117e4da
add cli, test=doc
3 years ago
lym0302
80b83b7434
add cli, test=doc
3 years ago
KP
7814fba07f
Update batch input.
3 years ago
KP
05288fe1c3
Update batch input and stdin input.
3 years ago
KP
1818b058aa
Support batch input in cls task.
3 years ago
WilliamZhang06
35e3be9ac8
Merge remote-tracking branch 'remote/develop' into server
3 years ago
TianYuan
ae521d3700
Update infer.py
3 years ago
lym0302
07158b2f12
move dir, test=doc
3 years ago
lym0302
76391275fc
move dir, test=doc
3 years ago
TianYuan
67ec6242c3
fix ci for waveflow, test=tts
3 years ago
TianYuan
f51097618b
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into fix_ci_waveflow
3 years ago
TianYuan
fc8c0e3ea2
fix ci for waveflow, test=tts
3 years ago
huangyuxin
95d5274aef
fix sortagrad, test=asr
3 years ago
Hui Zhang
718c849f68
Merge pull request #1445 from yt605155624/update_train
...
[TTS]init for all works in train.py when ngpu>1
3 years ago
Hui Zhang
f3ec985aaf
Merge pull request #1439 from Jackwaterveg/tipc
...
[TIPC]Add tipc_benchmark of conformer
3 years ago
TianYuan
4ac7db185e
init for all works in train.py when ngpu>1, test=tts
3 years ago
Jackwaterveg
426bae3de1
Merge pull request #1440 from yt605155624/merge_datasets
...
[TTS]Merge datasets, change style of docstring
3 years ago
TianYuan
2cec8f6c76
update tts cli, test=doc
3 years ago
TianYuan
9699c00769
change the docstring style from numpydoc to google, test=tts
3 years ago
huangyuxin
aefe9e93a7
add tipc benchmark of conformer
3 years ago
TianYuan
683679bec7
merge data and datasets, test=tts
3 years ago
TianYuan
7dc1f2daa3
fix some librosa bugs, test=tts
3 years ago
TianYuan
30085ac229
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into rename_tacotron2
3 years ago
TianYuan
25347bb6a3
rename tacotron2, test=tts
3 years ago
huangyuxin
9a55783aa0
fix resample
3 years ago
Hui Zhang
dcfc32f1ec
Merge pull request #1379 from yt605155624/new_wavernn
...
[TTS] add wavernn
3 years ago
TianYuan
0747600c95
[TTS]add ljspeech new tacotron2 ( #1416 )
...
* add ljspeech new tacotron2, test=tts
* update ljspeech waveflow's synthesize
* add config, test=doc
Co-authored-by: Hui Zhang <zhtclz@foxmail.com>
3 years ago
TianYuan
348a1a33bf
update tacotron2 voice cloning in aishell3 with new tacotron2, test=tts ( #1419 )
3 years ago
huangyuxin
f428ec4431
change log of cli/asr/infer
3 years ago
TianYuan
1b0c034134
update wavernn, test=tts
3 years ago
TianYuan
89e69ee10e
[TTS]fix tacotron2 dygraph to static ( #1414 )
...
* fix tacotron2 dygraph to static , test=tts
* fix tacotron2 dygraph to static , test=tts
* simplify synthesize_e2e.py , test=tts
3 years ago
huangyuxin
2a42421a63
cli add ds2-librispeech offline, fix versionm, test=asr
3 years ago
Hui Zhang
4128f4d61f
fix __version__ error in develop ( #1398 )
3 years ago
TianYuan
001afee644
fix wavernn dygraph to static , test=tts
3 years ago
TianYuan
2844f388dc
[doc ]add tacotron2 readme ( #1385 )
...
* add tacotron2 readme, test=doc
* update changelog.md, test=doc
3 years ago
TianYuan
2071774d81
add wavernn in synthesize_e2e, test=tts
3 years ago
TianYuan
1cc7905d51
rm csmsc.py, test=tts
3 years ago
TianYuan
4c3e57a23c
align preprocess of wavernn, test=tts
3 years ago
Jackwaterveg
f49cf838a8
Update u2.py ( #1378 )
3 years ago
TianYuan
fb0acd40a2
add wavernn, test=tts
3 years ago
Jackwaterveg
d7222c0453
[ASR] Support CTC decoder online ( #821 )
...
* fix the destructer problem for prefixes
* unified offline and online in ctcdecoders, test=asr
* rename swig_decoders to paddlespeech_ctcdecoders, test=asr
* add reset_stage for ctcdecoder
* fix some problems
* fix ctconline
* fix a bug
* fix the format
* fix 1xt2x
3 years ago
Jerryuhoo
f515416c4a
fix missing model choice, test=doc
3 years ago
Jerryuhoo
a22080130b
Add speedyspeech multi-speaker support for synthesize_e2e.py, test=tts
3 years ago
Hui Zhang
97db74ca60
Merge pull request #1314 from yt605155624/add_new_tacotron2
...
[TTS]Add new tacotron2
3 years ago
huangyuxin
3845804cc9
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into Setup
3 years ago
TianYuan
96323816e9
fix yamls, change labels to stop_labels, test=tts
3 years ago
TianYuan
1bf1a876ae
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_new_tacotron2, test=tts
3 years ago
TianYuan
3fd7a7790b
add typehit for updater and evaluator, test=tts
3 years ago
huangyuxin
4e31247633
refacto the code
3 years ago
TianYuan
41d24337cb
fix fastspeech2 multi speaker to static, test=tts
3 years ago
TianYuan
1a9e59612a
fix fastspeech2 multi speaker to static, test=tts
3 years ago
huangyuxin
565a63c5ef
refactor the setup in paddleaudio
3 years ago
huangyuxin
eb91ce84f9
refactor the version
3 years ago
Hui Zhang
4a133619a1
Merge pull request #1356 from Jackwaterveg/CLI
...
[CLI] asr, Add Deepspeech2 online and offline model
3 years ago
Hui Zhang
d4acf4704f
Merge pull request #1350 from LittleChenCc/develop
...
[ST] beam search with optimality guarantees
3 years ago
huangyuxin
ab759b16de
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into CLI
3 years ago
huangyuxin
38edfd1a89
Add Deepspeech2 online and offline in cli
3 years ago
TianYuan
d368d57d67
fix low ips bug of speedyspeech and fastspeech2, test=tts ( #1349 )
3 years ago
TianYuan
9c7f0762b0
update racotron2 and transformer tts, test=tts
3 years ago
huangyuxin
8028f33b7f
synchronize the version
3 years ago
Junkun
44408e5211
sync the variable name to others
3 years ago
Junkun
f866059b74
config and formalize
3 years ago
Junkun
43aad7a018
beam search with optimality guarantees
3 years ago
Jackwaterveg
26524031d2
Merge pull request #1343 from Jackwaterveg/fix
...
[ASR] Fix some bugs
3 years ago
huangyuxin
5e7e8a3e24
fix the u2 export, test=asr
3 years ago
TianYuan
a1867c20c3
fix slice bug of speedyspeech expand, test=tts ( #1337 )
3 years ago
Hui Zhang
ec1c88ae1a
[s2t] remove nltk ( #1332 )
3 years ago
TianYuan
7ae4f7221e
Update length_regulator.py
3 years ago
TianYuan
acfe2b9084
Update duration_predictor.py
3 years ago
TianYuan
caa391f461
fix speedyspeech inference, test=tts ( #1322 )
3 years ago
Jackwaterveg
0c4895cd0b
mv the ctcdecoders to third_part ( #1313 )
3 years ago
TianYuan
8f507ba4ba
Merge pull request #1302 from jerryuhoo/develop
...
[TTS] Add support for finetuning speedyspeech
3 years ago
Jerryuhoo
111a452378
Fix the code format, test=tts
3 years ago
TianYuan
89e988a69e
add csmsc tacotron2, test=tts
3 years ago
TianYuan
c088b9a304
add csmsc tacotron2
3 years ago
huangyuxin
fe1dc9d211
refactor the cli/st, test=st
3 years ago
TianYuan
27bb76bdb9
fix tone_sandhi of yi, test=tts
3 years ago
Jerryuhoo
be99807d61
Add durations to gen_gta_mel.py inference
3 years ago
KP
52a8b2f320
Add ECAPA_TDNN. ( #1301 )
3 years ago
Jerryuhoo
fcc34e3e95
[tts] add gen_gta_mel.py for finetuning speedypeech, test=tts
3 years ago
Jackwaterveg
010aa65b2b
[cli] asr - support English, decode_metod and unified config ( #1297 )
...
* fix config, test=asr
* fix config, test=doc_fix
* add en and decode_method for cli/asr, test=asr
* test=asr
* fix, test=doc_fix
3 years ago
KP
c09466ebbe
Add ECAPA_TDNN. ( #1295 )
3 years ago
TianYuan
fb238d83f4
update vctk voc1, test=tts ( #1294 )
3 years ago
TianYuan
73dc0e2535
fix_ning
3 years ago
billishyahao
ddf184be60
fix some typos
3 years ago
TianYuan
318cc9e539
Merge branch 'develop' into develop
3 years ago
Jackwaterveg
e69abc9265
Merge pull request #1273 from zh794390558/batch_sampler
...
[s2t] Fix Batch sampler set epoch
3 years ago
KP
a810cd4e5c
Add cli logging. ( #1274 )
3 years ago