xiongxinlei
|
ac9fcf7f4a
|
fix the asr infernece model, paddle.no_grad, test=doc
|
3 years ago |
xiongxinlei
|
ff4ddd229e
|
fix the unuseful code, test=doc
|
3 years ago |
xiongxinlei
|
9c03280ca6
|
remove debug info, test=doc
|
3 years ago |
xiongxinlei
|
48fa84bee9
|
fix the asr online client bug, return None, test=doc
|
3 years ago |
qingen
|
00febff734
|
[vec][loss] update docstring, test=doc fix #1717
|
3 years ago |
xiongxinlei
|
babac27a79
|
fix ds2 online edge bug, test=doc
|
3 years ago |
liangym
|
ab656aab57
|
Merge pull request #1713 from lym0302/tts_stream
[server] add streaming am infer
|
3 years ago |
xiongxinlei
|
dcab04a799
|
merge develop to server
|
3 years ago |
xiongxinlei
|
f56dba0ca7
|
fix the code format, test=doc
|
3 years ago |
Honei
|
55122cfc86
|
Merge branch 'develop' into server
|
3 years ago |
TianYuan
|
7c0ec3c249
|
Merge pull request #1716 from yt605155624/update_cli
[CLI]update cli, test=doc
|
3 years ago |
xiongxinlei
|
380afbbc5d
|
add ds2 model multi session, test=doc
|
3 years ago |
qingen
|
166757703f
|
[vec][loss] add NCE Loss from RNNLM, test=doc fix #1717
|
3 years ago |
lym0302
|
9e41ac8550
|
code format, test=doc
|
3 years ago |
qingen
|
880829fe89
|
Merge pull request #1681 from qingen/cluster
[vec][score] add plda model
|
3 years ago |
TianYuan
|
a44f5c099e
|
update cli, test=doc
|
3 years ago |
lym0302
|
40dde22fc4
|
code format, test=doc
|
3 years ago |
huangyuxin
|
6e80618e3d
|
add ds2
|
3 years ago |
xiongxinlei
|
5acb0b5252
|
fix the websocket chunk edge bug, test=doc
|
3 years ago |
Hui Zhang
|
b78bc6375b
|
Merge pull request #1712 from yt605155624/add_cnndecoder_onnx
[TTS]add fastspeech2 cnndecoder onnx model
|
3 years ago |
xiongxinlei
|
05a8a4b5fc
|
add connection stability, test=doc
|
3 years ago |
lym0302
|
00a6236fe2
|
remove test code, test=doc
|
3 years ago |
lym0302
|
9c0ceaacb6
|
add streaming am infer, test=doc
|
3 years ago |
xiongxinlei
|
68731c61f4
|
add multi session result, test=doc
|
3 years ago |
xiongxinlei
|
10e825d9b2
|
check chunk window process, test=doc
|
3 years ago |
qingen
|
159d8fd628
|
Merge branch 'develop' into cluster
|
3 years ago |
xiongxinlei
|
d2640c1406
|
add mult sesssion process, test=doc
|
3 years ago |
TianYuan
|
dafe7c3657
|
add fastspeech2 cnndecoder onnx model, test=tts
|
3 years ago |
qingen
|
deb3ba070b
|
[vec] update mata info, test=doc
|
3 years ago |
xiongxinlei
|
97d31f9aac
|
update the attention_rescoring method, test=doc
|
3 years ago |
xiongxinlei
|
0c5dbbee5b
|
add conformer ctc prefix beam search decoding method, test=doc
|
3 years ago |
Honei
|
9d20a10b5a
|
Merge branch 'develop' into server
|
3 years ago |
Hui Zhang
|
0cde9f87ab
|
Merge pull request #1710 from Honei/deepspeech_server
[asr][websocket]fix the ws send bug, cache buffer, text=doc
|
3 years ago |
xiongxinlei
|
3ce4301665
|
add asr websocket server note, test=doc
|
3 years ago |
xiongxinlei
|
efc269b75f
|
remove unuseful code, test=doc
|
3 years ago |
xiongxinlei
|
89b102a7dd
|
fix the ws send bug, cache buffer, text=doc
|
3 years ago |
xiongxinlei
|
d21ccd0287
|
add conformer online server, test=doc
|
3 years ago |
Hui Zhang
|
c7b987c55d
|
format
|
3 years ago |
Hui Zhang
|
ec469179bf
|
Merge pull request #1696 from qingen/database-search
[vec][server] vpr demo support
|
3 years ago |
Hui Zhang
|
72933abc70
|
Merge pull request #1701 from WilliamZhang06/web
[WebSocket] added online asr doc and online asr command line, test=doc
|
3 years ago |
xiongxinlei
|
af484fc980
|
convert websockert results to str from bytest, test=doc
|
3 years ago |
WilliamZhang06
|
39895f6a25
|
added online asr doc and online asr command line, test=doc
|
3 years ago |
qingen
|
240520c0ca
|
Merge branch 'PaddlePaddle:develop' into cluster
|
3 years ago |
TianYuan
|
8bebf81199
|
[doc]fix typo, test=doc
|
3 years ago |
TianYuan
|
98f67870ea
|
Merge pull request #1693 from yt605155624/fix_ss_NHWC
[TTS]change NLC to NCL in speedyspeech, test=tts
|
3 years ago |
buchongyu
|
48358055d0
|
修改hack 单词拼写错误
|
3 years ago |
qingen
|
d3f8715b0a
|
Merge branch 'database-search' of github.com:qingen/PaddleSpeech into database-search
|
3 years ago |
qingen
|
89a0ec9018
|
[vec][server] vpr demo support, test=doc fix #1695
|
3 years ago |
TianYuan
|
8b801ca18b
|
change NLC to NCL in speedyspeech, test=tts
|
3 years ago |
WilliamZhang06
|
1dc02c7295
|
added online web client, test=doc
|
3 years ago |
Hui Zhang
|
1759116bd7
|
Revert "[WebSocket] fixed online model md5 error , test=doc"
|
3 years ago |
xiongxinlei
|
d1935d8552
|
add vector necessary note, test=doc
|
3 years ago |
lym0302
|
9d0224460b
|
code format, test=doc
|
3 years ago |
lym0302
|
4b111146dc
|
code format, test=doc
|
3 years ago |
qingen
|
0d8e2deb61
|
Merge branch 'PaddlePaddle:develop' into cluster
|
3 years ago |
Honei
|
48e0177767
|
Merge pull request #1630 from Honei/vox12
[vec]voxceleb convert dataset format to paddlespeech
|
3 years ago |
qingen
|
fc72295334
|
Merge pull request #1651 from ccrrong/ami
[vec] add speaker diarization pipeline
|
3 years ago |
xiongxinlei
|
4af007c3fc
|
fix vector ips log bug, test=doc
|
3 years ago |
lym0302
|
82992b3ed6
|
add test code, test=doc
|
3 years ago |
qingen
|
8d9bd9a93a
|
[vec][score] update Copyright, test=doc fix #1667
|
3 years ago |
xiongxinlei
|
567286add3
|
wrap the embedding mean and std norm, test=doc
|
3 years ago |
Hui Zhang
|
d65b63b28d
|
Merge pull request #1652 from lym0302/tts_stream
[server] add stream tts server
|
3 years ago |
qingen
|
44c6623448
|
[vec][score] update plda model, test=doc fix #1667
|
3 years ago |
ccrrong
|
bc53f726fe
|
convert dataset format to paddlespeech, test=doc
|
3 years ago |
Hui Zhang
|
2f97b81346
|
Merge pull request #1682 from WilliamZhang06/ws-develop
[WebSocket] fixed online model md5 error , test=doc
|
3 years ago |
root
|
9dacfb405f
|
fixed online model md5 error , test=doc
|
3 years ago |
qingen
|
6446f72cab
|
[vec][score] add plda model, test=doc fix #1667
|
3 years ago |
qingen
|
84576d6956
|
[vec][score] add plda model, test=doc fix #1667
|
3 years ago |
lym0302
|
1a3c811f04
|
code format, test=doc
|
3 years ago |
TianYuan
|
0d6f5868ea
|
Merge pull request #1665 from yt605155624/add_onnx
[TTS]add onnx inference for fastspeech2 + hifigan/mb_melgan
|
3 years ago |
Honei
|
f500fa8bde
|
Merge pull request #1646 from Honei/develop
[vec]add speaker verification score method
|
3 years ago |
TianYuan
|
0282d45c62
|
remove fill_constant_batch_size_like in static model of speedyspeech, test=tts
|
3 years ago |
TianYuan
|
c765fca6b4
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_onnx
|
3 years ago |
TianYuan
|
124eb6af8f
|
update notes, test=doc
|
3 years ago |
TianYuan
|
e0d222e674
|
update notes, test=doc
|
3 years ago |
Hui Zhang
|
1843bed458
|
Merge pull request #1666 from Jackwaterveg/cli
[CLI] ASR: Add duration limitation for asr
|
3 years ago |
xiongxinlei
|
a8244dc5b0
|
update the note, test=doc
|
3 years ago |
Jackwaterveg
|
c852776bc6
|
test=doc
|
3 years ago |
TianYuan
|
f264b912fc
|
add warmup for frontend, test=doc
|
3 years ago |
Jackwaterveg
|
4922e697e1
|
update cli, test = asr
|
3 years ago |
Jackwaterveg
|
1c05d03806
|
test=asr
|
3 years ago |
xiongxinlei
|
9b5f7f71ac
|
add part ecapa-tdnn note, test=doc
|
3 years ago |
Hui Zhang
|
6eed542c08
|
Merge pull request #1660 from yt605155624/fix_pre
[TTS]fix preprocess bug, test=tts
|
3 years ago |
Honei
|
83310b6379
|
Merge branch 'develop' into develop
|
3 years ago |
huangyuxin
|
faf21f033f
|
add duration limitation for asr
|
3 years ago |
TianYuan
|
7aecb2c4bb
|
add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
|
3 years ago |
xiongxinlei
|
d064c8196e
|
update the speaker verification model, test=doc
|
3 years ago |
xiongxinlei
|
e72912adb9
|
update the speaker verification model, test=doc
|
3 years ago |
TianYuan
|
a8f5990869
|
fix preprocess bug, test=tts
|
3 years ago |
lym0302
|
759a9e61e4
|
update server cli, test=doc
|
3 years ago |
lym0302
|
603e565ab1
|
add stream tts server, test=doc
|
3 years ago |
ccrrong
|
378fe5909f
|
add ami diarization pipeline, test=doc
|
3 years ago |
xiongxinlei
|
48b8cc8937
|
add score method, test=doc
|
3 years ago |
xiongxinlei
|
ebfe3e6b13
|
test.py update the CSVDataset, test=doc
|
3 years ago |
xiongxinlei
|
acebfad7b7
|
change the vector csv.spk_id to csv.label, test=doc
|
3 years ago |
xiongxinlei
|
57c11dcab0
|
add some annotations, test=doc
|
3 years ago |
xiongxinlei
|
30b5b3cb9e
|
add vector csv dataset format, test=doc
|
3 years ago |
TianYuan
|
e366fb6b2f
|
Merge pull request #1643 from Jackwaterveg/check
[Doc] supplement note
|
3 years ago |
huangyuxin
|
ca860e3d2f
|
supplement note
|
3 years ago |
TianYuan
|
828ee14404
|
add license and reference for some models, test=doc
|
3 years ago |
xiongxinlei
|
5b05300e53
|
train process add new voxceleb and rirs dataset, test=doc
|
3 years ago |
xiongxinlei
|
965f486dd5
|
add voxceleb and rirs noise dataset
|
3 years ago |
Hui Zhang
|
36df70cbe6
|
Merge pull request #1638 from zh794390558/spx_refactor
[speechx] refactor audio/data/feature cache
|
3 years ago |
TianYuan
|
5bff096715
|
Merge pull request #1634 from yt605155624/cnn_decoder
[TTS]Cnn decoder
|
3 years ago |
TianYuan
|
3aec266ca5
|
add chunk size and pad size in args, test=doc
|
3 years ago |
Hui Zhang
|
cb39777a60
|
format code
|
3 years ago |
TianYuan
|
4d7cd0e063
|
add streaming synthesize, test=tts
|
3 years ago |
liangym
|
602b0b0da3
|
Merge pull request #1632 from lym0302/develop
[server] fix output bug
|
3 years ago |
Hui Zhang
|
61941d14b0
|
Merge pull request #1627 from WilliamZhang06/ws-develop
[websocket] added online asr engine
|
3 years ago |
WilliamZhang06
|
2ec8d608bf
|
fixed comments, test=doc
|
3 years ago |
liangym
|
21c4132eda
|
Update paddlespeech_client.py
|
3 years ago |
TianYuan
|
005aa4066c
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into cnn_decoder
|
3 years ago |
TianYuan
|
0fc79f474d
|
add CNNDecoder, test=tts
|
3 years ago |
WilliamZhang06
|
d847fe29cf
|
added online asr engine , test=doc
|
3 years ago |
TianYuan
|
318edec303
|
Merge pull request #1613 from yt605155624/restructure_expand
[tts]restructure expand in length_regulator.py for paddle2onnx, test=tts
|
3 years ago |
Hui Zhang
|
943d4ac1ee
|
Merge pull request #1612 from Jackwaterveg/update
[ASR] Replace kaidi_fbank with paddleaudio
|
3 years ago |
huangyuxin
|
f47146af49
|
add docstring, test=asr
|
3 years ago |
huangyuxin
|
ed490b66cb
|
update spectrogram, test=asr
|
3 years ago |
Hui Zhang
|
84d712d493
|
format code, test=doc
|
3 years ago |
Honei
|
d60856b1ed
|
Merge pull request #1614 from Honei/vox12
[vec]change the vector output to numpy.array
|
3 years ago |
xiongxinlei
|
ed7113f320
|
change the vector output to numpy.array
|
3 years ago |
TianYuan
|
bc5ae43d3a
|
restructure expand in length_regulator.py for paddle2onnx, test=tts
|
3 years ago |
huangyuxin
|
0ffe1f9114
|
replace kaidi_fbank with paddleaudio
|
3 years ago |
Hui Zhang
|
caee809513
|
Merge pull request #1605 from Honei/vox12
[vec]add speaker verification demo and doc
|
3 years ago |
xiongxinlei
|
5ae57206f3
|
add paddlespeech vector modules __init__.py
|
3 years ago |
xiongxinlei
|
2c9dc0c89b
|
add some vector cli comments, test=doc
|
3 years ago |
xiongxinlei
|
ef1bc5e815
|
vector cli output dim info, test=doc
|
3 years ago |
xiongxinlei
|
d5142e5e15
|
add vector cli annotation, test=doc
|
3 years ago |
xiongxinlei
|
ad2caf2ccb
|
add speaker verification demo and doc, test=doc
|
3 years ago |
TianYuan
|
3cc0ec950e
|
Merge pull request #1604 from lym0302/add_readme
[server] update readme
|
3 years ago |
lym0302
|
829f1e332e
|
update readme, test=doc
|
3 years ago |
xiongxinlei
|
0f78d25f76
|
add vector cli batch and pipeline test demo, test=doc
|
3 years ago |
Honei
|
305bacdcf2
|
Merge branch 'develop' into vox12
|
3 years ago |
xiongxinlei
|
0bb67d8b8e
|
add vector cli unit test, test=doc
|
3 years ago |
KP
|
b6e976a860
|
Merge pull request #1602 from yt605155624/fix_dtype
[TTS]fix dtype of window of stft
|
3 years ago |
xiongxinlei
|
62cbce6915
|
add vectorwrapper to extract audio embedding
|
3 years ago |
TianYuan
|
8938483529
|
Merge pull request #1601 from yt605155624/add_ljspeech_hifigan
[TTS] update readme for ljspeech hifigan
|
3 years ago |
TianYuan
|
5347dbad3f
|
fix dtype of window of stft, test=tts
|
3 years ago |
TianYuan
|
342b487383
|
update readme for ljspeech hifigan, test=tts
|
3 years ago |
Hui Zhang
|
4051e7b762
|
fix compliance test bug, and format
|
3 years ago |
TianYuan
|
26ef47810d
|
Merge pull request #1593 from windstamp/npu_dev_20220322
[NPU] Add NPU support for TransformerTTS
|
3 years ago |
zhangkeliang
|
59b3de6a6d
|
[NPU] test TransformerTTS with NPU
|
3 years ago |
Jackwaterveg
|
fcc1762048
|
Merge pull request #1577 from Jackwaterveg/change_init
[ASR] change default initializer to kaiming_uniform
|
3 years ago |
huangyuxin
|
e1b581b622
|
fix some bug, test=asr
|
3 years ago |
Hui Zhang
|
b5315657ff
|
Merge pull request #1509 from qingen/cluster
[vec] add clustering of vectors
|
3 years ago |
huangyuxin
|
6da8465f14
|
add dist_sampler args, test=asr
|
3 years ago |
TianYuan
|
e5e8b8a129
|
Merge pull request #1587 from yt605155624/add_vctk_hifigan
[TTS]Add vctk hifigan
|
3 years ago |
TianYuan
|
6469568d2a
|
update readme for vctk hifigan, test=tts
|
3 years ago |
huangyuxin
|
a4f5a68074
|
fix some format, test=asr
|
3 years ago |
xiongxinlei
|
d85d1deef5
|
exec pre-commit in paddlespeech vector, test=doc
|
3 years ago |
xiongxinlei
|
9874fb7d75
|
add some comments in code
|
3 years ago |
huangyuxin
|
e991d82ae7
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init
|
3 years ago |
huangyuxin
|
d53e1163a6
|
update the code, test=asr
|
3 years ago |
xiongxinlei
|
b9eafddd94
|
change - to _ to distinguish field
|
3 years ago |
xiongxinlei
|
9c6735f921
|
add vector voxceleb12 base mode url, test=doc
|
3 years ago |
xiongxinlei
|
d28ccfa96b
|
add vector cli component, test=doc
|
3 years ago |
KP
|
831cadacc7
|
Add paddleaudio doc.
|
3 years ago |
TianYuan
|
5ab2601759
|
update readme for aishell3 hifigan, test=tts
|
3 years ago |
Hui Zhang
|
6abc5d9f7e
|
format
|
3 years ago |
huangyuxin
|
ab16d8ce3c
|
change default initializer to kaiming_uniform, test=asr
|
3 years ago |
qingen
|
0f7ede11ef
|
Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster
|
3 years ago |
qingen
|
d16ce21d47
|
[wip][vec] update cluster of diarization, test=doc #1304
|
3 years ago |
xiongxinlei
|
506d26a957
|
change the code style to s2t code style, test=doc
|
3 years ago |
xiongxinlei
|
311fa87a11
|
add some comments to the code
|
3 years ago |
Hui Zhang
|
90deeca06f
|
Merge pull request #1554 from lym0302/develop
[server] add server cls
|
3 years ago |
lym0302
|
89457b273a
|
modify, test=doc
|
3 years ago |
xiongxinlei
|
8ed5c287a3
|
add vox2 data into VoxCeleb class
|
3 years ago |
lym0302
|
77bad44e8b
|
modify readme, test=doc
|
3 years ago |
lym0302
|
8ef92a9495
|
modify, test=doc
|
3 years ago |
lym0302
|
89dbda58f6
|
add cls static model, test=doc
|
3 years ago |
Hui Zhang
|
40ab05a462
|
Merge pull request #1552 from yt605155624/format_syn
[TTS]format synthesize
|
3 years ago |
lym0302
|
5187df847f
|
modify server demo, test=doc
|
3 years ago |
xiongxinlei
|
584a2c0e39
|
add ecapa-tdnn config yaml file
|
3 years ago |
lym0302
|
0a6602c708
|
modify application.yaml, test=doc
|
3 years ago |
TianYuan
|
544c372b50
|
fix cr, test=tts
|
3 years ago |
lym0302
|
99fa7a8205
|
add server cls, test=doc
|
3 years ago |
TianYuan
|
fe8bf2a38c
|
format synthesize, test=tts
|
3 years ago |
xiongxinlei
|
993d6783d7
|
remove unused code, test=doc
|
3 years ago |
xiongxinlei
|
0e87037f2c
|
refactor to compilance paddleaudio
|
3 years ago |
xiongxinlei
|
4473405f82
|
merge develop to vox12, test=doc
|
3 years ago |
Honei
|
0dee8f40e9
|
Merge branch 'PaddlePaddle:develop' into develop
|
3 years ago |
xiongxinlei
|
60d73bb7bd
|
add state 0 to prepare the voxcele data and augment data
|
3 years ago |
xiongxinlei
|
14efbf5b15
|
check extract embedding result, test=doc
|
3 years ago |
xiongxinlei
|
386ef3f161
|
add voxceleb augment unit test, test=doc
|
3 years ago |
Hui Zhang
|
5147163592
|
Merge pull request #1544 from yt605155624/add_vctk_hifigan
[tts]add vctk hifigan egs
|
3 years ago |
TianYuan
|
81d964f0a0
|
add vctk hifigan, test=tts
|
3 years ago |
xiongxinlei
|
2d89c80e6f
|
add waveform augment pipeline, test=doc
|
3 years ago |
lym0302
|
3b304544f6
|
modify yaml, test=doc
|
3 years ago |
xiongxinlei
|
ac4967e204
|
optimize the data prepare process
|
3 years ago |
xiongxinlei
|
016ed6d69c
|
repair the code according to the part comment, test=doc
|
3 years ago |
Hui Zhang
|
2886ab9373
|
Merge pull request #1530 from lym0302/server_cli
[server] add server test
|
3 years ago |
xiongxinlei
|
1f74af110b
|
add training log info and comment, test=doc
|
3 years ago |
lym0302
|
e50c1b3b1d
|
add server test, test=doc
|
3 years ago |
xiongxinlei
|
4648059b5f
|
add training process for sid, test=doc
|
3 years ago |
xiongxinlei
|
7668f61422
|
add sid dataloader for training, test=doc
|
3 years ago |
xiongxinlei
|
6af2bc3d5b
|
add sid loss wraper for voxceleb, test=doc
|
3 years ago |
xiongxinlei
|
57c4f4a68c
|
add sid learning rate and training model
|
3 years ago |
TianYuan
|
4d2f2191a8
|
fix gbk encode bug
|
3 years ago |
Honei
|
1395b5f5fa
|
Merge branch 'PaddlePaddle:develop' into develop
|
3 years ago |
TianYuan
|
175c39b4a4
|
Merge pull request #1511 from yt605155624/pre_fix_for_streaming
[TTS]add rtf for synthesize, add more vocoder for synthesize.sh
|
3 years ago |