Jackwaterveg
|
c852776bc6
|
test=doc
|
3 years ago |
TianYuan
|
f264b912fc
|
add warmup for frontend, test=doc
|
3 years ago |
Jackwaterveg
|
4922e697e1
|
update cli, test = asr
|
3 years ago |
Jackwaterveg
|
1c05d03806
|
test=asr
|
3 years ago |
xiongxinlei
|
9b5f7f71ac
|
add part ecapa-tdnn note, test=doc
|
3 years ago |
Hui Zhang
|
6eed542c08
|
Merge pull request #1660 from yt605155624/fix_pre
[TTS]fix preprocess bug, test=tts
|
3 years ago |
Honei
|
83310b6379
|
Merge branch 'develop' into develop
|
3 years ago |
huangyuxin
|
faf21f033f
|
add duration limitation for asr
|
3 years ago |
TianYuan
|
7aecb2c4bb
|
add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
|
3 years ago |
xiongxinlei
|
d064c8196e
|
update the speaker verification model, test=doc
|
3 years ago |
xiongxinlei
|
e72912adb9
|
update the speaker verification model, test=doc
|
3 years ago |
TianYuan
|
a8f5990869
|
fix preprocess bug, test=tts
|
3 years ago |
lym0302
|
759a9e61e4
|
update server cli, test=doc
|
3 years ago |
lym0302
|
603e565ab1
|
add stream tts server, test=doc
|
3 years ago |
ccrrong
|
378fe5909f
|
add ami diarization pipeline, test=doc
|
3 years ago |
xiongxinlei
|
48b8cc8937
|
add score method, test=doc
|
3 years ago |
xiongxinlei
|
ebfe3e6b13
|
test.py update the CSVDataset, test=doc
|
3 years ago |
xiongxinlei
|
acebfad7b7
|
change the vector csv.spk_id to csv.label, test=doc
|
3 years ago |
xiongxinlei
|
57c11dcab0
|
add some annotations, test=doc
|
3 years ago |
xiongxinlei
|
30b5b3cb9e
|
add vector csv dataset format, test=doc
|
3 years ago |
TianYuan
|
e366fb6b2f
|
Merge pull request #1643 from Jackwaterveg/check
[Doc] supplement note
|
3 years ago |
huangyuxin
|
ca860e3d2f
|
supplement note
|
3 years ago |
TianYuan
|
828ee14404
|
add license and reference for some models, test=doc
|
3 years ago |
xiongxinlei
|
5b05300e53
|
train process add new voxceleb and rirs dataset, test=doc
|
3 years ago |
xiongxinlei
|
965f486dd5
|
add voxceleb and rirs noise dataset
|
3 years ago |
Hui Zhang
|
36df70cbe6
|
Merge pull request #1638 from zh794390558/spx_refactor
[speechx] refactor audio/data/feature cache
|
3 years ago |
TianYuan
|
5bff096715
|
Merge pull request #1634 from yt605155624/cnn_decoder
[TTS]Cnn decoder
|
3 years ago |
TianYuan
|
3aec266ca5
|
add chunk size and pad size in args, test=doc
|
3 years ago |
Hui Zhang
|
cb39777a60
|
format code
|
3 years ago |
TianYuan
|
4d7cd0e063
|
add streaming synthesize, test=tts
|
3 years ago |
liangym
|
602b0b0da3
|
Merge pull request #1632 from lym0302/develop
[server] fix output bug
|
3 years ago |
Hui Zhang
|
61941d14b0
|
Merge pull request #1627 from WilliamZhang06/ws-develop
[websocket] added online asr engine
|
3 years ago |
WilliamZhang06
|
2ec8d608bf
|
fixed comments, test=doc
|
3 years ago |
liangym
|
21c4132eda
|
Update paddlespeech_client.py
|
3 years ago |
TianYuan
|
005aa4066c
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into cnn_decoder
|
3 years ago |
TianYuan
|
0fc79f474d
|
add CNNDecoder, test=tts
|
3 years ago |
WilliamZhang06
|
d847fe29cf
|
added online asr engine , test=doc
|
3 years ago |
TianYuan
|
318edec303
|
Merge pull request #1613 from yt605155624/restructure_expand
[tts]restructure expand in length_regulator.py for paddle2onnx, test=tts
|
3 years ago |
Hui Zhang
|
943d4ac1ee
|
Merge pull request #1612 from Jackwaterveg/update
[ASR] Replace kaidi_fbank with paddleaudio
|
3 years ago |
huangyuxin
|
f47146af49
|
add docstring, test=asr
|
3 years ago |
huangyuxin
|
ed490b66cb
|
update spectrogram, test=asr
|
3 years ago |
Hui Zhang
|
84d712d493
|
format code, test=doc
|
3 years ago |
Honei
|
d60856b1ed
|
Merge pull request #1614 from Honei/vox12
[vec]change the vector output to numpy.array
|
3 years ago |
xiongxinlei
|
ed7113f320
|
change the vector output to numpy.array
|
3 years ago |
TianYuan
|
bc5ae43d3a
|
restructure expand in length_regulator.py for paddle2onnx, test=tts
|
3 years ago |
huangyuxin
|
0ffe1f9114
|
replace kaidi_fbank with paddleaudio
|
3 years ago |
Hui Zhang
|
caee809513
|
Merge pull request #1605 from Honei/vox12
[vec]add speaker verification demo and doc
|
3 years ago |
xiongxinlei
|
5ae57206f3
|
add paddlespeech vector modules __init__.py
|
3 years ago |
xiongxinlei
|
2c9dc0c89b
|
add some vector cli comments, test=doc
|
3 years ago |
xiongxinlei
|
ef1bc5e815
|
vector cli output dim info, test=doc
|
3 years ago |
xiongxinlei
|
d5142e5e15
|
add vector cli annotation, test=doc
|
3 years ago |
xiongxinlei
|
ad2caf2ccb
|
add speaker verification demo and doc, test=doc
|
3 years ago |
TianYuan
|
3cc0ec950e
|
Merge pull request #1604 from lym0302/add_readme
[server] update readme
|
3 years ago |
lym0302
|
829f1e332e
|
update readme, test=doc
|
3 years ago |
xiongxinlei
|
0f78d25f76
|
add vector cli batch and pipeline test demo, test=doc
|
3 years ago |
Honei
|
305bacdcf2
|
Merge branch 'develop' into vox12
|
3 years ago |
xiongxinlei
|
0bb67d8b8e
|
add vector cli unit test, test=doc
|
3 years ago |
KP
|
b6e976a860
|
Merge pull request #1602 from yt605155624/fix_dtype
[TTS]fix dtype of window of stft
|
3 years ago |
xiongxinlei
|
62cbce6915
|
add vectorwrapper to extract audio embedding
|
3 years ago |
TianYuan
|
8938483529
|
Merge pull request #1601 from yt605155624/add_ljspeech_hifigan
[TTS] update readme for ljspeech hifigan
|
3 years ago |
TianYuan
|
5347dbad3f
|
fix dtype of window of stft, test=tts
|
3 years ago |
TianYuan
|
342b487383
|
update readme for ljspeech hifigan, test=tts
|
3 years ago |
Hui Zhang
|
4051e7b762
|
fix compliance test bug, and format
|
3 years ago |
TianYuan
|
26ef47810d
|
Merge pull request #1593 from windstamp/npu_dev_20220322
[NPU] Add NPU support for TransformerTTS
|
3 years ago |
zhangkeliang
|
59b3de6a6d
|
[NPU] test TransformerTTS with NPU
|
3 years ago |
Jackwaterveg
|
fcc1762048
|
Merge pull request #1577 from Jackwaterveg/change_init
[ASR] change default initializer to kaiming_uniform
|
3 years ago |
huangyuxin
|
e1b581b622
|
fix some bug, test=asr
|
3 years ago |
Hui Zhang
|
b5315657ff
|
Merge pull request #1509 from qingen/cluster
[vec] add clustering of vectors
|
3 years ago |
huangyuxin
|
6da8465f14
|
add dist_sampler args, test=asr
|
3 years ago |
TianYuan
|
e5e8b8a129
|
Merge pull request #1587 from yt605155624/add_vctk_hifigan
[TTS]Add vctk hifigan
|
3 years ago |
TianYuan
|
6469568d2a
|
update readme for vctk hifigan, test=tts
|
3 years ago |
huangyuxin
|
a4f5a68074
|
fix some format, test=asr
|
3 years ago |
xiongxinlei
|
d85d1deef5
|
exec pre-commit in paddlespeech vector, test=doc
|
3 years ago |
xiongxinlei
|
9874fb7d75
|
add some comments in code
|
3 years ago |
huangyuxin
|
e991d82ae7
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init
|
3 years ago |
huangyuxin
|
d53e1163a6
|
update the code, test=asr
|
3 years ago |
xiongxinlei
|
b9eafddd94
|
change - to _ to distinguish field
|
3 years ago |
xiongxinlei
|
9c6735f921
|
add vector voxceleb12 base mode url, test=doc
|
3 years ago |
xiongxinlei
|
d28ccfa96b
|
add vector cli component, test=doc
|
3 years ago |
KP
|
831cadacc7
|
Add paddleaudio doc.
|
3 years ago |
TianYuan
|
5ab2601759
|
update readme for aishell3 hifigan, test=tts
|
3 years ago |
Hui Zhang
|
6abc5d9f7e
|
format
|
3 years ago |
huangyuxin
|
ab16d8ce3c
|
change default initializer to kaiming_uniform, test=asr
|
3 years ago |
qingen
|
0f7ede11ef
|
Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster
|
3 years ago |
qingen
|
d16ce21d47
|
[wip][vec] update cluster of diarization, test=doc #1304
|
3 years ago |
xiongxinlei
|
506d26a957
|
change the code style to s2t code style, test=doc
|
3 years ago |
xiongxinlei
|
311fa87a11
|
add some comments to the code
|
3 years ago |
Hui Zhang
|
90deeca06f
|
Merge pull request #1554 from lym0302/develop
[server] add server cls
|
3 years ago |
lym0302
|
89457b273a
|
modify, test=doc
|
3 years ago |
xiongxinlei
|
8ed5c287a3
|
add vox2 data into VoxCeleb class
|
3 years ago |
lym0302
|
77bad44e8b
|
modify readme, test=doc
|
3 years ago |
lym0302
|
8ef92a9495
|
modify, test=doc
|
3 years ago |
lym0302
|
89dbda58f6
|
add cls static model, test=doc
|
3 years ago |
Hui Zhang
|
40ab05a462
|
Merge pull request #1552 from yt605155624/format_syn
[TTS]format synthesize
|
3 years ago |
lym0302
|
5187df847f
|
modify server demo, test=doc
|
3 years ago |
xiongxinlei
|
584a2c0e39
|
add ecapa-tdnn config yaml file
|
3 years ago |
lym0302
|
0a6602c708
|
modify application.yaml, test=doc
|
3 years ago |
TianYuan
|
544c372b50
|
fix cr, test=tts
|
3 years ago |
lym0302
|
99fa7a8205
|
add server cls, test=doc
|
3 years ago |
TianYuan
|
fe8bf2a38c
|
format synthesize, test=tts
|
3 years ago |
xiongxinlei
|
993d6783d7
|
remove unused code, test=doc
|
3 years ago |
xiongxinlei
|
0e87037f2c
|
refactor to compilance paddleaudio
|
3 years ago |
xiongxinlei
|
4473405f82
|
merge develop to vox12, test=doc
|
3 years ago |
Honei
|
0dee8f40e9
|
Merge branch 'PaddlePaddle:develop' into develop
|
3 years ago |
xiongxinlei
|
60d73bb7bd
|
add state 0 to prepare the voxcele data and augment data
|
3 years ago |
xiongxinlei
|
14efbf5b15
|
check extract embedding result, test=doc
|
3 years ago |
xiongxinlei
|
386ef3f161
|
add voxceleb augment unit test, test=doc
|
3 years ago |
Hui Zhang
|
5147163592
|
Merge pull request #1544 from yt605155624/add_vctk_hifigan
[tts]add vctk hifigan egs
|
3 years ago |
TianYuan
|
81d964f0a0
|
add vctk hifigan, test=tts
|
3 years ago |
xiongxinlei
|
2d89c80e6f
|
add waveform augment pipeline, test=doc
|
3 years ago |
lym0302
|
3b304544f6
|
modify yaml, test=doc
|
3 years ago |
xiongxinlei
|
ac4967e204
|
optimize the data prepare process
|
3 years ago |
xiongxinlei
|
016ed6d69c
|
repair the code according to the part comment, test=doc
|
3 years ago |
Hui Zhang
|
2886ab9373
|
Merge pull request #1530 from lym0302/server_cli
[server] add server test
|
3 years ago |
xiongxinlei
|
1f74af110b
|
add training log info and comment, test=doc
|
3 years ago |
lym0302
|
e50c1b3b1d
|
add server test, test=doc
|
3 years ago |
xiongxinlei
|
4648059b5f
|
add training process for sid, test=doc
|
3 years ago |
xiongxinlei
|
7668f61422
|
add sid dataloader for training, test=doc
|
3 years ago |
xiongxinlei
|
6af2bc3d5b
|
add sid loss wraper for voxceleb, test=doc
|
3 years ago |
xiongxinlei
|
57c4f4a68c
|
add sid learning rate and training model
|
3 years ago |
TianYuan
|
4d2f2191a8
|
fix gbk encode bug
|
3 years ago |
Honei
|
1395b5f5fa
|
Merge branch 'PaddlePaddle:develop' into develop
|
3 years ago |
TianYuan
|
175c39b4a4
|
Merge pull request #1511 from yt605155624/pre_fix_for_streaming
[TTS]add rtf for synthesize, add more vocoder for synthesize.sh
|
3 years ago |
Hui Zhang
|
5ba4907c44
|
Merge pull request #1514 from lym0302/server_cli
[server] update server cli
|
3 years ago |
lym0302
|
85d4a31e04
|
update application.yaml, test=doc
|
3 years ago |
Jerryuhoo
|
c116a3a926
|
fix Speedyspeech multi-speaker inference, test=tts
|
3 years ago |
lym0302
|
ab04488738
|
update server cli, test=doc
|
3 years ago |
TianYuan
|
cb07bd2a94
|
add rtf for synthesize, add more vocoder for synthesize_e2e.sh, test=tts
|
3 years ago |
Hui Zhang
|
26d413ce8f
|
Merge pull request #1510 from lym0302/paddlespeech_stats
[server] add paddlespeech_server stats
|
3 years ago |
lym0302
|
72c0cda30c
|
add paddlespeech_server stats, test=doc
|
3 years ago |
Hui Zhang
|
e8f2d8f11b
|
Merge pull request #1507 from zh794390558/cli
[cli] add cli batch/pipe example to readme
|
3 years ago |
Hui Zhang
|
2517df92a0
|
Merge pull request #1508 from lym0302/paddlespeech_stats
[CLI] modified text sr to lang
|
3 years ago |
TianYuan
|
b6d33a7fb4
|
Merge pull request #1506 from yt605155624/fix_frontend
[TTS]update text frontend, test=tts
|
3 years ago |
lym0302
|
395c923dee
|
modified text sr to lang, test=doc
|
3 years ago |
Hui Zhang
|
75098698d8
|
format,test=doc
|
3 years ago |
TianYuan
|
66a8beb27f
|
update text frontend, test=tts
|
3 years ago |
lym0302
|
96abb33b5b
|
add __call__, test=doc
|
3 years ago |
lym0302
|
5f1728f855
|
rm server related, test=doc
|
3 years ago |
xiongxinlei
|
70d3b01c0d
|
remove invalid code
|
3 years ago |
xiongxinlei
|
d7da629302
|
add kaldi feats egs dataset
|
3 years ago |
xiongxinlei
|
6f7e9656fe
|
add kaldi feats ark dataset
|
3 years ago |
lym0302
|
35357e775e
|
update, test=doc
|
3 years ago |
lym0302
|
e5aa24fa5a
|
resolve setup.py conflicts, test=doc
|
3 years ago |
lym0302
|
fe6be4a65e
|
Merge branch 'develop' of https://github.com/lym0302/PaddleSpeech into paddlespeech_stats
|
3 years ago |
lym0302
|
f8375764b9
|
add paddlespeech stats, test=doc
|
3 years ago |
Hui Zhang
|
8d474c2658
|
Merge pull request #1482 from lym0302/servercli_update
[server] update server cli
|
3 years ago |
lym0302
|
162361d878
|
format code, test=doc
|
3 years ago |
lym0302
|
434708cff4
|
set device cpu, test=doc
|
3 years ago |
lym0302
|
920b2c808c
|
paras required, test=doc
|
3 years ago |
Hui Zhang
|
6b1fe70100
|
format code,test=doc
|
3 years ago |
lym0302
|
6b2dd16845
|
update server cli, test=doc
|
3 years ago |
WilliamZhang06
|
78c9b7342c
|
deleted wav file , test=doc
|
3 years ago |
WilliamZhang06
|
a6ec3a26f1
|
Merge branch 'develop' into server_asr
|
3 years ago |
WilliamZhang06
|
8b4602f738
|
added isinstance code, test=doc
|
3 years ago |
lym0302
|
bb60561c66
|
update util, test=doc
|
3 years ago |
WilliamZhang06
|
147018a8b4
|
added cli changed code, test=doc
|
3 years ago |
lym0302
|
332009142b
|
add server demo, test=doc
|
3 years ago |
WilliamZhang06
|
7ebe904e20
|
fixed overload , test=doc
|
3 years ago |
Hui Zhang
|
60c0877e7a
|
Merge pull request #1472 from KPatr1ck/cli_batch
[CLI][Logger]Add cli logger control.
|
3 years ago |
WilliamZhang06
|
b8f16ac9b0
|
Merge branch 'develop' into server_asr
|
3 years ago |
WilliamZhang06
|
da3ea7bb40
|
added engine type and asr inference , test=doc
|
3 years ago |
Hui Zhang
|
49f80afe6a
|
Merge pull request #1381 from PaddlePaddle/server
[server] speech server init version
|
3 years ago |
lym0302
|
b508c4d0cb
|
add readme, test=doc
|
3 years ago |
KP
|
d36a4ccfc8
|
Add cli logger control.
|
3 years ago |
KP
|
94ed5969fa
|
Add cli logger control.
|
3 years ago |
lym0302
|
42cbe313c2
|
improve cli code, test=doc
|
3 years ago |
lym0302
|
2bf4b4521f
|
add cli, test=doc
|
3 years ago |
lym0302
|
8fd117e4da
|
add cli, test=doc
|
3 years ago |
lym0302
|
80b83b7434
|
add cli, test=doc
|
3 years ago |
KP
|
7814fba07f
|
Update batch input.
|
3 years ago |
KP
|
05288fe1c3
|
Update batch input and stdin input.
|
3 years ago |
KP
|
1818b058aa
|
Support batch input in cls task.
|
3 years ago |
WilliamZhang06
|
35e3be9ac8
|
Merge remote-tracking branch 'remote/develop' into server
|
3 years ago |
TianYuan
|
ae521d3700
|
Update infer.py
|
3 years ago |
lym0302
|
07158b2f12
|
move dir, test=doc
|
3 years ago |
lym0302
|
76391275fc
|
move dir, test=doc
|
3 years ago |
TianYuan
|
67ec6242c3
|
fix ci for waveflow, test=tts
|
3 years ago |
TianYuan
|
f51097618b
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into fix_ci_waveflow
|
3 years ago |
TianYuan
|
fc8c0e3ea2
|
fix ci for waveflow, test=tts
|
3 years ago |
huangyuxin
|
95d5274aef
|
fix sortagrad, test=asr
|
3 years ago |
Hui Zhang
|
718c849f68
|
Merge pull request #1445 from yt605155624/update_train
[TTS]init for all works in train.py when ngpu>1
|
3 years ago |
Hui Zhang
|
f3ec985aaf
|
Merge pull request #1439 from Jackwaterveg/tipc
[TIPC]Add tipc_benchmark of conformer
|
3 years ago |
TianYuan
|
4ac7db185e
|
init for all works in train.py when ngpu>1, test=tts
|
3 years ago |
Jackwaterveg
|
426bae3de1
|
Merge pull request #1440 from yt605155624/merge_datasets
[TTS]Merge datasets, change style of docstring
|
3 years ago |
TianYuan
|
2cec8f6c76
|
update tts cli, test=doc
|
3 years ago |
TianYuan
|
9699c00769
|
change the docstring style from numpydoc to google, test=tts
|
3 years ago |
huangyuxin
|
aefe9e93a7
|
add tipc benchmark of conformer
|
3 years ago |
TianYuan
|
683679bec7
|
merge data and datasets, test=tts
|
3 years ago |
TianYuan
|
7dc1f2daa3
|
fix some librosa bugs, test=tts
|
3 years ago |
TianYuan
|
30085ac229
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into rename_tacotron2
|
3 years ago |
TianYuan
|
25347bb6a3
|
rename tacotron2, test=tts
|
3 years ago |
huangyuxin
|
9a55783aa0
|
fix resample
|
3 years ago |
Hui Zhang
|
dcfc32f1ec
|
Merge pull request #1379 from yt605155624/new_wavernn
[TTS] add wavernn
|
3 years ago |
TianYuan
|
0747600c95
|
[TTS]add ljspeech new tacotron2 (#1416)
* add ljspeech new tacotron2, test=tts
* update ljspeech waveflow's synthesize
* add config, test=doc
Co-authored-by: Hui Zhang <zhtclz@foxmail.com>
|
3 years ago |
TianYuan
|
348a1a33bf
|
update tacotron2 voice cloning in aishell3 with new tacotron2, test=tts (#1419)
|
3 years ago |
huangyuxin
|
f428ec4431
|
change log of cli/asr/infer
|
3 years ago |
TianYuan
|
1b0c034134
|
update wavernn, test=tts
|
3 years ago |
TianYuan
|
89e69ee10e
|
[TTS]fix tacotron2 dygraph to static (#1414)
* fix tacotron2 dygraph to static , test=tts
* fix tacotron2 dygraph to static , test=tts
* simplify synthesize_e2e.py , test=tts
|
3 years ago |
huangyuxin
|
2a42421a63
|
cli add ds2-librispeech offline, fix versionm, test=asr
|
3 years ago |
Hui Zhang
|
4128f4d61f
|
fix __version__ error in develop (#1398)
|
3 years ago |