qingen
|
6446f72cab
|
[vec][score] add plda model, test=doc fix #1667
|
3 years ago |
qingen
|
84576d6956
|
[vec][score] add plda model, test=doc fix #1667
|
3 years ago |
lym0302
|
1a3c811f04
|
code format, test=doc
|
3 years ago |
TianYuan
|
0d6f5868ea
|
Merge pull request #1665 from yt605155624/add_onnx
[TTS]add onnx inference for fastspeech2 + hifigan/mb_melgan
|
3 years ago |
Honei
|
f500fa8bde
|
Merge pull request #1646 from Honei/develop
[vec]add speaker verification score method
|
3 years ago |
TianYuan
|
0282d45c62
|
remove fill_constant_batch_size_like in static model of speedyspeech, test=tts
|
3 years ago |
TianYuan
|
c765fca6b4
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_onnx
|
3 years ago |
TianYuan
|
124eb6af8f
|
update notes, test=doc
|
3 years ago |
TianYuan
|
e0d222e674
|
update notes, test=doc
|
3 years ago |
Hui Zhang
|
1843bed458
|
Merge pull request #1666 from Jackwaterveg/cli
[CLI] ASR: Add duration limitation for asr
|
3 years ago |
xiongxinlei
|
a8244dc5b0
|
update the note, test=doc
|
3 years ago |
Jackwaterveg
|
c852776bc6
|
test=doc
|
3 years ago |
TianYuan
|
f264b912fc
|
add warmup for frontend, test=doc
|
3 years ago |
Jackwaterveg
|
4922e697e1
|
update cli, test = asr
|
3 years ago |
Jackwaterveg
|
1c05d03806
|
test=asr
|
3 years ago |
xiongxinlei
|
9b5f7f71ac
|
add part ecapa-tdnn note, test=doc
|
3 years ago |
Hui Zhang
|
6eed542c08
|
Merge pull request #1660 from yt605155624/fix_pre
[TTS]fix preprocess bug, test=tts
|
3 years ago |
Honei
|
83310b6379
|
Merge branch 'develop' into develop
|
3 years ago |
huangyuxin
|
faf21f033f
|
add duration limitation for asr
|
3 years ago |
TianYuan
|
7aecb2c4bb
|
add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
|
3 years ago |
xiongxinlei
|
d064c8196e
|
update the speaker verification model, test=doc
|
3 years ago |
xiongxinlei
|
e72912adb9
|
update the speaker verification model, test=doc
|
3 years ago |
TianYuan
|
a8f5990869
|
fix preprocess bug, test=tts
|
3 years ago |
lym0302
|
759a9e61e4
|
update server cli, test=doc
|
3 years ago |
lym0302
|
603e565ab1
|
add stream tts server, test=doc
|
3 years ago |
ccrrong
|
378fe5909f
|
add ami diarization pipeline, test=doc
|
3 years ago |
xiongxinlei
|
48b8cc8937
|
add score method, test=doc
|
3 years ago |
xiongxinlei
|
ebfe3e6b13
|
test.py update the CSVDataset, test=doc
|
3 years ago |
xiongxinlei
|
acebfad7b7
|
change the vector csv.spk_id to csv.label, test=doc
|
3 years ago |
xiongxinlei
|
57c11dcab0
|
add some annotations, test=doc
|
3 years ago |
xiongxinlei
|
30b5b3cb9e
|
add vector csv dataset format, test=doc
|
3 years ago |
TianYuan
|
e366fb6b2f
|
Merge pull request #1643 from Jackwaterveg/check
[Doc] supplement note
|
3 years ago |
huangyuxin
|
ca860e3d2f
|
supplement note
|
3 years ago |
TianYuan
|
828ee14404
|
add license and reference for some models, test=doc
|
3 years ago |
xiongxinlei
|
5b05300e53
|
train process add new voxceleb and rirs dataset, test=doc
|
3 years ago |
xiongxinlei
|
965f486dd5
|
add voxceleb and rirs noise dataset
|
3 years ago |
Hui Zhang
|
36df70cbe6
|
Merge pull request #1638 from zh794390558/spx_refactor
[speechx] refactor audio/data/feature cache
|
3 years ago |
TianYuan
|
5bff096715
|
Merge pull request #1634 from yt605155624/cnn_decoder
[TTS]Cnn decoder
|
3 years ago |
TianYuan
|
3aec266ca5
|
add chunk size and pad size in args, test=doc
|
3 years ago |
Hui Zhang
|
cb39777a60
|
format code
|
3 years ago |
TianYuan
|
4d7cd0e063
|
add streaming synthesize, test=tts
|
3 years ago |
liangym
|
602b0b0da3
|
Merge pull request #1632 from lym0302/develop
[server] fix output bug
|
3 years ago |
Hui Zhang
|
61941d14b0
|
Merge pull request #1627 from WilliamZhang06/ws-develop
[websocket] added online asr engine
|
3 years ago |
WilliamZhang06
|
2ec8d608bf
|
fixed comments, test=doc
|
3 years ago |
liangym
|
21c4132eda
|
Update paddlespeech_client.py
|
3 years ago |
TianYuan
|
005aa4066c
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into cnn_decoder
|
3 years ago |
TianYuan
|
0fc79f474d
|
add CNNDecoder, test=tts
|
3 years ago |
WilliamZhang06
|
d847fe29cf
|
added online asr engine , test=doc
|
3 years ago |
TianYuan
|
318edec303
|
Merge pull request #1613 from yt605155624/restructure_expand
[tts]restructure expand in length_regulator.py for paddle2onnx, test=tts
|
3 years ago |
Hui Zhang
|
943d4ac1ee
|
Merge pull request #1612 from Jackwaterveg/update
[ASR] Replace kaidi_fbank with paddleaudio
|
3 years ago |
huangyuxin
|
f47146af49
|
add docstring, test=asr
|
3 years ago |
huangyuxin
|
ed490b66cb
|
update spectrogram, test=asr
|
3 years ago |
Hui Zhang
|
84d712d493
|
format code, test=doc
|
3 years ago |
Honei
|
d60856b1ed
|
Merge pull request #1614 from Honei/vox12
[vec]change the vector output to numpy.array
|
3 years ago |
xiongxinlei
|
ed7113f320
|
change the vector output to numpy.array
|
3 years ago |
TianYuan
|
bc5ae43d3a
|
restructure expand in length_regulator.py for paddle2onnx, test=tts
|
3 years ago |
huangyuxin
|
0ffe1f9114
|
replace kaidi_fbank with paddleaudio
|
3 years ago |
Hui Zhang
|
caee809513
|
Merge pull request #1605 from Honei/vox12
[vec]add speaker verification demo and doc
|
3 years ago |
xiongxinlei
|
5ae57206f3
|
add paddlespeech vector modules __init__.py
|
3 years ago |
xiongxinlei
|
2c9dc0c89b
|
add some vector cli comments, test=doc
|
3 years ago |
xiongxinlei
|
ef1bc5e815
|
vector cli output dim info, test=doc
|
3 years ago |
xiongxinlei
|
d5142e5e15
|
add vector cli annotation, test=doc
|
3 years ago |
xiongxinlei
|
ad2caf2ccb
|
add speaker verification demo and doc, test=doc
|
3 years ago |
TianYuan
|
3cc0ec950e
|
Merge pull request #1604 from lym0302/add_readme
[server] update readme
|
3 years ago |
lym0302
|
829f1e332e
|
update readme, test=doc
|
3 years ago |
xiongxinlei
|
0f78d25f76
|
add vector cli batch and pipeline test demo, test=doc
|
3 years ago |
Honei
|
305bacdcf2
|
Merge branch 'develop' into vox12
|
3 years ago |
xiongxinlei
|
0bb67d8b8e
|
add vector cli unit test, test=doc
|
3 years ago |
KP
|
b6e976a860
|
Merge pull request #1602 from yt605155624/fix_dtype
[TTS]fix dtype of window of stft
|
3 years ago |
xiongxinlei
|
62cbce6915
|
add vectorwrapper to extract audio embedding
|
3 years ago |
TianYuan
|
8938483529
|
Merge pull request #1601 from yt605155624/add_ljspeech_hifigan
[TTS] update readme for ljspeech hifigan
|
3 years ago |
TianYuan
|
5347dbad3f
|
fix dtype of window of stft, test=tts
|
3 years ago |
TianYuan
|
342b487383
|
update readme for ljspeech hifigan, test=tts
|
3 years ago |
Hui Zhang
|
4051e7b762
|
fix compliance test bug, and format
|
3 years ago |
TianYuan
|
26ef47810d
|
Merge pull request #1593 from windstamp/npu_dev_20220322
[NPU] Add NPU support for TransformerTTS
|
3 years ago |
zhangkeliang
|
59b3de6a6d
|
[NPU] test TransformerTTS with NPU
|
3 years ago |
Jackwaterveg
|
fcc1762048
|
Merge pull request #1577 from Jackwaterveg/change_init
[ASR] change default initializer to kaiming_uniform
|
3 years ago |
huangyuxin
|
e1b581b622
|
fix some bug, test=asr
|
3 years ago |
Hui Zhang
|
b5315657ff
|
Merge pull request #1509 from qingen/cluster
[vec] add clustering of vectors
|
3 years ago |
huangyuxin
|
6da8465f14
|
add dist_sampler args, test=asr
|
3 years ago |
TianYuan
|
e5e8b8a129
|
Merge pull request #1587 from yt605155624/add_vctk_hifigan
[TTS]Add vctk hifigan
|
3 years ago |
TianYuan
|
6469568d2a
|
update readme for vctk hifigan, test=tts
|
3 years ago |
huangyuxin
|
a4f5a68074
|
fix some format, test=asr
|
3 years ago |
xiongxinlei
|
d85d1deef5
|
exec pre-commit in paddlespeech vector, test=doc
|
3 years ago |
xiongxinlei
|
9874fb7d75
|
add some comments in code
|
3 years ago |
huangyuxin
|
e991d82ae7
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init
|
3 years ago |
huangyuxin
|
d53e1163a6
|
update the code, test=asr
|
3 years ago |
xiongxinlei
|
b9eafddd94
|
change - to _ to distinguish field
|
3 years ago |
xiongxinlei
|
9c6735f921
|
add vector voxceleb12 base mode url, test=doc
|
3 years ago |
xiongxinlei
|
d28ccfa96b
|
add vector cli component, test=doc
|
3 years ago |
KP
|
831cadacc7
|
Add paddleaudio doc.
|
3 years ago |
TianYuan
|
5ab2601759
|
update readme for aishell3 hifigan, test=tts
|
3 years ago |
Hui Zhang
|
6abc5d9f7e
|
format
|
3 years ago |
huangyuxin
|
ab16d8ce3c
|
change default initializer to kaiming_uniform, test=asr
|
3 years ago |
qingen
|
0f7ede11ef
|
Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster
|
3 years ago |
qingen
|
d16ce21d47
|
[wip][vec] update cluster of diarization, test=doc #1304
|
3 years ago |
xiongxinlei
|
506d26a957
|
change the code style to s2t code style, test=doc
|
3 years ago |
xiongxinlei
|
311fa87a11
|
add some comments to the code
|
3 years ago |
Hui Zhang
|
90deeca06f
|
Merge pull request #1554 from lym0302/develop
[server] add server cls
|
3 years ago |
lym0302
|
89457b273a
|
modify, test=doc
|
3 years ago |
xiongxinlei
|
8ed5c287a3
|
add vox2 data into VoxCeleb class
|
3 years ago |
lym0302
|
77bad44e8b
|
modify readme, test=doc
|
3 years ago |
lym0302
|
8ef92a9495
|
modify, test=doc
|
3 years ago |
lym0302
|
89dbda58f6
|
add cls static model, test=doc
|
3 years ago |
Hui Zhang
|
40ab05a462
|
Merge pull request #1552 from yt605155624/format_syn
[TTS]format synthesize
|
3 years ago |
lym0302
|
5187df847f
|
modify server demo, test=doc
|
3 years ago |
xiongxinlei
|
584a2c0e39
|
add ecapa-tdnn config yaml file
|
3 years ago |
lym0302
|
0a6602c708
|
modify application.yaml, test=doc
|
3 years ago |
TianYuan
|
544c372b50
|
fix cr, test=tts
|
3 years ago |
lym0302
|
99fa7a8205
|
add server cls, test=doc
|
3 years ago |
TianYuan
|
fe8bf2a38c
|
format synthesize, test=tts
|
3 years ago |
xiongxinlei
|
993d6783d7
|
remove unused code, test=doc
|
3 years ago |
xiongxinlei
|
0e87037f2c
|
refactor to compilance paddleaudio
|
3 years ago |
xiongxinlei
|
4473405f82
|
merge develop to vox12, test=doc
|
3 years ago |
Honei
|
0dee8f40e9
|
Merge branch 'PaddlePaddle:develop' into develop
|
3 years ago |
xiongxinlei
|
60d73bb7bd
|
add state 0 to prepare the voxcele data and augment data
|
3 years ago |
xiongxinlei
|
14efbf5b15
|
check extract embedding result, test=doc
|
3 years ago |
xiongxinlei
|
386ef3f161
|
add voxceleb augment unit test, test=doc
|
3 years ago |
Hui Zhang
|
5147163592
|
Merge pull request #1544 from yt605155624/add_vctk_hifigan
[tts]add vctk hifigan egs
|
3 years ago |
TianYuan
|
81d964f0a0
|
add vctk hifigan, test=tts
|
3 years ago |
xiongxinlei
|
2d89c80e6f
|
add waveform augment pipeline, test=doc
|
3 years ago |
lym0302
|
3b304544f6
|
modify yaml, test=doc
|
3 years ago |
xiongxinlei
|
ac4967e204
|
optimize the data prepare process
|
3 years ago |
xiongxinlei
|
016ed6d69c
|
repair the code according to the part comment, test=doc
|
3 years ago |
Hui Zhang
|
2886ab9373
|
Merge pull request #1530 from lym0302/server_cli
[server] add server test
|
3 years ago |
xiongxinlei
|
1f74af110b
|
add training log info and comment, test=doc
|
3 years ago |
lym0302
|
e50c1b3b1d
|
add server test, test=doc
|
3 years ago |
xiongxinlei
|
4648059b5f
|
add training process for sid, test=doc
|
3 years ago |
xiongxinlei
|
7668f61422
|
add sid dataloader for training, test=doc
|
3 years ago |
xiongxinlei
|
6af2bc3d5b
|
add sid loss wraper for voxceleb, test=doc
|
3 years ago |
xiongxinlei
|
57c4f4a68c
|
add sid learning rate and training model
|
3 years ago |
TianYuan
|
4d2f2191a8
|
fix gbk encode bug
|
3 years ago |
Honei
|
1395b5f5fa
|
Merge branch 'PaddlePaddle:develop' into develop
|
3 years ago |
TianYuan
|
175c39b4a4
|
Merge pull request #1511 from yt605155624/pre_fix_for_streaming
[TTS]add rtf for synthesize, add more vocoder for synthesize.sh
|
3 years ago |
Hui Zhang
|
5ba4907c44
|
Merge pull request #1514 from lym0302/server_cli
[server] update server cli
|
3 years ago |
lym0302
|
85d4a31e04
|
update application.yaml, test=doc
|
3 years ago |
Jerryuhoo
|
c116a3a926
|
fix Speedyspeech multi-speaker inference, test=tts
|
3 years ago |
lym0302
|
ab04488738
|
update server cli, test=doc
|
3 years ago |
TianYuan
|
cb07bd2a94
|
add rtf for synthesize, add more vocoder for synthesize_e2e.sh, test=tts
|
3 years ago |
Hui Zhang
|
26d413ce8f
|
Merge pull request #1510 from lym0302/paddlespeech_stats
[server] add paddlespeech_server stats
|
3 years ago |
lym0302
|
72c0cda30c
|
add paddlespeech_server stats, test=doc
|
3 years ago |
Hui Zhang
|
e8f2d8f11b
|
Merge pull request #1507 from zh794390558/cli
[cli] add cli batch/pipe example to readme
|
3 years ago |
Hui Zhang
|
2517df92a0
|
Merge pull request #1508 from lym0302/paddlespeech_stats
[CLI] modified text sr to lang
|
3 years ago |
TianYuan
|
b6d33a7fb4
|
Merge pull request #1506 from yt605155624/fix_frontend
[TTS]update text frontend, test=tts
|
3 years ago |
lym0302
|
395c923dee
|
modified text sr to lang, test=doc
|
3 years ago |
Hui Zhang
|
75098698d8
|
format,test=doc
|
3 years ago |
TianYuan
|
66a8beb27f
|
update text frontend, test=tts
|
3 years ago |
lym0302
|
96abb33b5b
|
add __call__, test=doc
|
3 years ago |
lym0302
|
5f1728f855
|
rm server related, test=doc
|
3 years ago |
xiongxinlei
|
70d3b01c0d
|
remove invalid code
|
3 years ago |