Jerryuhoo
|
fba0693a20
|
fix random speaker embedding bug, test=tts
|
3 years ago |
Hui Zhang
|
cdb9a1b20b
|
Merge pull request #1813 from Honei/v0.3
[R1.0]update the paddlespeech_client asr_online cli
|
3 years ago |
Honei
|
ff7dbcc2de
|
Merge branch 'develop' into v0.3
|
3 years ago |
xiongxinlei
|
f7af037cb1
|
add the note for offline asr, test=doc
|
3 years ago |
xiongxinlei
|
3f80464926
|
update the streaming asr readme, test=doc
|
3 years ago |
Hui Zhang
|
fc96130fdc
|
fix speechx core dump when stop immediately after start
|
3 years ago |
xiongxinlei
|
c5fe181405
|
update the paddlespeech_client asr_online cli, test=doc
|
3 years ago |
huangyuxin
|
4494f5a1fc
|
add cli models, test=doc
|
3 years ago |
Hui Zhang
|
903cc67a4d
|
Merge pull request #1801 from Honei/v0.3
[R1.0]update the streaming output and punc default ip, port
|
3 years ago |
xiongxinlei
|
e844e0e0bb
|
update the streaming output and punc default ip, port, test=doc
|
3 years ago |
huangyuxin
|
18197cd3a5
|
renew ds2 model, test=doc
|
3 years ago |
Hui Zhang
|
ebde26030b
|
patch func to var
|
3 years ago |
Honei
|
f72cbc9b6d
|
Merge branch 'develop' into v0.3
|
3 years ago |
xiongxinlei
|
9125cb076d
|
update the ws asr response, final_result to result, test=doc
|
3 years ago |
xiongxinlei
|
7007b0ecac
|
update the asr server api, test=doc
|
3 years ago |
Hui Zhang
|
5e23025c31
|
fix speechx ws server to return dummpy partial result, fix hang for ws client
|
3 years ago |
Hui Zhang
|
d7c8c1779f
|
Merge pull request #1786 from Jackwaterveg/debug
[CLI] renew ds2 online model
|
3 years ago |
Hui Zhang
|
9cc7662512
|
Merge pull request #1782 from lym0302/add_streaming_cli
[server] update readme
|
3 years ago |
huangyuxin
|
e145b26355
|
fix
|
3 years ago |
huangyuxin
|
4f9e8bfa90
|
renew ds2 online, test=doc
|
3 years ago |
xiongxinlei
|
833900a8b4
|
asr client add punctuatjion server, test=doc
|
3 years ago |
KP
|
abb15ac6e8
|
Update KWS example.
|
3 years ago |
lym0302
|
651012616a
|
add info, test=doc
|
3 years ago |
Hui Zhang
|
33ca17359f
|
Merge pull request #1776 from Jackwaterveg/ds2
[ASR] Replace fbank
|
3 years ago |
huangyuxin
|
0df8d80833
|
remove logfbank from python_speech_features, test=asr
|
3 years ago |
Honei
|
119143d0f1
|
Merge pull request #1731 from qingen/cluster
[vec] add GE2E to support unlabeled data training
|
3 years ago |
huangyuxin
|
fcdaef6cb4
|
replace fbank, test=asr
|
3 years ago |
Hui Zhang
|
f11855415c
|
Merge pull request #1770 from Jackwaterveg/cli
[ASR] fix asr cli infer
|
3 years ago |
Hui Zhang
|
3b0004345c
|
Merge pull request #1772 from Honei/v0.3
[text][server]add text punc server
|
3 years ago |
Hui Zhang
|
962a278996
|
Merge pull request #1558 from KPatr1ck/kws
[KWS]Add kws example on HeySnips dataset.
|
3 years ago |
liangym
|
0de4d25ab8
|
Merge pull request #1774 from lym0302/add_streaming_cli
[server] fix tts streaming server
|
3 years ago |
huangyuxin
|
1e999c27e9
|
fix exit, test=doc
|
3 years ago |
lym0302
|
dc52c313fa
|
fix code, test=doc
|
3 years ago |
lym0302
|
c6e6210964
|
code format, test=tts
|
3 years ago |
xiongxinlei
|
9e50448039
|
update the punc text model, text=doc
|
3 years ago |
lym0302
|
88adcaa6dc
|
fix code, test=doc
|
3 years ago |
TianYuan
|
f256bb9c0e
|
Merge pull request #1771 from lym0302/add_streaming_cli
[server] add streaming tts demos
|
3 years ago |
KP
|
caa8eb4d0d
|
Add KWS example.
|
3 years ago |
KP
|
f9761d532c
|
Add KWS example.
|
3 years ago |
KP
|
b60b1dadde
|
Add KWS example.
|
3 years ago |
KP
|
e01abc5099
|
Add KWS example.
|
3 years ago |
KP
|
521e222db8
|
Add mdtc model.
|
3 years ago |
xiongxinlei
|
2fa1522bdd
|
update the punc yaml to application.yaml, test=doc
|
3 years ago |
xiongxinlei
|
ba62b85e9b
|
add text punc server, test=doc
|
3 years ago |
lym0302
|
c00c31594c
|
updata readme, test=doc
|
3 years ago |
lym0302
|
70424e1ef9
|
add streaming tts demos, test=doc
|
3 years ago |
huangyuxin
|
c21c3d220d
|
fix infer, test=doc
|
3 years ago |
Hui Zhang
|
312fc4e11e
|
Merge pull request #1766 from Jackwaterveg/fix
[ASR] ds2: add log_interval and fix lr problem when resume training
|
3 years ago |
TianYuan
|
0b1b573a3f
|
Merge pull request #1767 from Jackwaterveg/cli
[CLI] Add conformer_aishell, conformer_online_aishell
|
3 years ago |
huangyuxin
|
ad4e04fc82
|
add conformer_online_aishell, test=doc
|
3 years ago |
huangyuxin
|
12d2f6ea95
|
fix conformer_aishell of cli, test=doc
|
3 years ago |
huangyuxin
|
5912ba53e4
|
fix log_interval and lr when resume training, test=asr
|
3 years ago |
Hui Zhang
|
91e24b0480
|
format code
|
3 years ago |
xiongxinlei
|
13a37b4892
|
update the online protocal note, test=doc
|
3 years ago |
xiongxinlei
|
2f2cb7eaaf
|
update the audio handler note, test=doc
|
3 years ago |
xiongxinlei
|
7aa5a3df2b
|
fix the streaming asr server bug, server client, test=doc
|
3 years ago |
huangyuxin
|
19998ea29b
|
add aishell conformer, test=doc
|
3 years ago |
TianYuan
|
24f0a7d44b
|
Merge pull request #1733 from lym0302/tts_stream
[server] add onnx tts engine
|
3 years ago |
TianYuan
|
9121dfc046
|
Merge pull request #1752 from yt605155624/fix_wavernn
[TTS]fix wavernn white noise bug for paddle develop(2.3)
|
3 years ago |
TianYuan
|
08a4673355
|
fix wavernn bug, test=tts
|
3 years ago |
Jackwaterveg
|
85b50c4700
|
Merge pull request #1741 from Jackwaterveg/debug
[ASR] remove redundant log
|
3 years ago |
lym0302
|
104c7ff27d
|
code format, test=doc
|
3 years ago |
lym0302
|
e398fe9c74
|
remove code, test=doc
|
3 years ago |
Hui Zhang
|
3561875dd0
|
Merge branch 'develop' into fix
|
3 years ago |
Hui Zhang
|
c7d9b11529
|
format
|
3 years ago |
huangyuxin
|
8e37a7c7f0
|
remove redundant log, test=doc
|
3 years ago |
xiongxinlei
|
56751a1ed5
|
update the server device to paddle.device, test=doc
|
3 years ago |
xiongxinlei
|
4b76a01c85
|
update en readme.md, test=doc
|
3 years ago |
xiongxinlei
|
1a0c2bea5d
|
add streaming asr demo, test=doc
|
3 years ago |
lym0302
|
4e9db4ff71
|
add onnx tts engine, test=doc
|
3 years ago |
Jackwaterveg
|
8d1ee8262e
|
Merge branch 'develop' into CER
|
3 years ago |
qingen
|
e98845d778
|
[vec][loss] add GE2E to support unlabeled data training, test=doc fix #1730
|
3 years ago |
qingen
|
0186f522af
|
Merge pull request #1725 from qingen/database-search
[vec] add GRL to domain adaptation
|
3 years ago |
TianYuan
|
e089268642
|
Merge pull request #1727 from yt605155624/refactor_syn_util
[TTS]add paddle device set for ort and inference
|
3 years ago |
TianYuan
|
4646f7cc8d
|
add paddle device set for ort and inference, test=doc
|
3 years ago |
Hui Zhang
|
523d5bd6d4
|
Merge pull request #1723 from yt605155624/refactor_syn_util
[TTS]restructure syn_utils.py, test=tts
|
3 years ago |
qingen
|
7e8f9f5336
|
[vec][layer] add GRL to domain adaptation, test=doc fix #1724
|
3 years ago |
TianYuan
|
c74fa9ada8
|
restructure syn_utils.py, test=tts
|
3 years ago |
qingen
|
26d5dded7c
|
Merge branch 'PaddlePaddle:develop' into cluster
|
3 years ago |
qingen
|
6a7245657f
|
[vec][loss] add FocalLoss to deal with class imbalances, test=doc fix #1721
|
3 years ago |
qingen
|
9382ad8a16
|
Merge pull request #1719 from qingen/cluster
[vec][loss] add NCE Loss from RNNLM
|
3 years ago |
Hui Zhang
|
cf9a590fa5
|
Merge pull request #1704 from Honei/server
[asr][websocket] add asr conformer websocket server
|
3 years ago |
xiongxinlei
|
ac9fcf7f4a
|
fix the asr infernece model, paddle.no_grad, test=doc
|
3 years ago |
xiongxinlei
|
ff4ddd229e
|
fix the unuseful code, test=doc
|
3 years ago |
xiongxinlei
|
9c03280ca6
|
remove debug info, test=doc
|
3 years ago |
xiongxinlei
|
48fa84bee9
|
fix the asr online client bug, return None, test=doc
|
3 years ago |
qingen
|
00febff734
|
[vec][loss] update docstring, test=doc fix #1717
|
3 years ago |
xiongxinlei
|
babac27a79
|
fix ds2 online edge bug, test=doc
|
3 years ago |
liangym
|
ab656aab57
|
Merge pull request #1713 from lym0302/tts_stream
[server] add streaming am infer
|
3 years ago |
xiongxinlei
|
dcab04a799
|
merge develop to server
|
3 years ago |
xiongxinlei
|
f56dba0ca7
|
fix the code format, test=doc
|
3 years ago |
Honei
|
55122cfc86
|
Merge branch 'develop' into server
|
3 years ago |
TianYuan
|
7c0ec3c249
|
Merge pull request #1716 from yt605155624/update_cli
[CLI]update cli, test=doc
|
3 years ago |
xiongxinlei
|
380afbbc5d
|
add ds2 model multi session, test=doc
|
3 years ago |
qingen
|
166757703f
|
[vec][loss] add NCE Loss from RNNLM, test=doc fix #1717
|
3 years ago |
lym0302
|
9e41ac8550
|
code format, test=doc
|
3 years ago |
qingen
|
880829fe89
|
Merge pull request #1681 from qingen/cluster
[vec][score] add plda model
|
3 years ago |
TianYuan
|
a44f5c099e
|
update cli, test=doc
|
3 years ago |
lym0302
|
40dde22fc4
|
code format, test=doc
|
3 years ago |
huangyuxin
|
6e80618e3d
|
add ds2
|
3 years ago |
xiongxinlei
|
5acb0b5252
|
fix the websocket chunk edge bug, test=doc
|
3 years ago |
Hui Zhang
|
b78bc6375b
|
Merge pull request #1712 from yt605155624/add_cnndecoder_onnx
[TTS]add fastspeech2 cnndecoder onnx model
|
3 years ago |
xiongxinlei
|
05a8a4b5fc
|
add connection stability, test=doc
|
3 years ago |
lym0302
|
00a6236fe2
|
remove test code, test=doc
|
3 years ago |
lym0302
|
9c0ceaacb6
|
add streaming am infer, test=doc
|
3 years ago |
xiongxinlei
|
68731c61f4
|
add multi session result, test=doc
|
3 years ago |
xiongxinlei
|
10e825d9b2
|
check chunk window process, test=doc
|
3 years ago |
qingen
|
159d8fd628
|
Merge branch 'develop' into cluster
|
3 years ago |
xiongxinlei
|
d2640c1406
|
add mult sesssion process, test=doc
|
3 years ago |
TianYuan
|
dafe7c3657
|
add fastspeech2 cnndecoder onnx model, test=tts
|
3 years ago |
qingen
|
deb3ba070b
|
[vec] update mata info, test=doc
|
3 years ago |
xiongxinlei
|
97d31f9aac
|
update the attention_rescoring method, test=doc
|
3 years ago |
xiongxinlei
|
0c5dbbee5b
|
add conformer ctc prefix beam search decoding method, test=doc
|
3 years ago |
Honei
|
9d20a10b5a
|
Merge branch 'develop' into server
|
3 years ago |
Hui Zhang
|
0cde9f87ab
|
Merge pull request #1710 from Honei/deepspeech_server
[asr][websocket]fix the ws send bug, cache buffer, text=doc
|
3 years ago |
xiongxinlei
|
3ce4301665
|
add asr websocket server note, test=doc
|
3 years ago |
xiongxinlei
|
efc269b75f
|
remove unuseful code, test=doc
|
3 years ago |
xiongxinlei
|
89b102a7dd
|
fix the ws send bug, cache buffer, text=doc
|
3 years ago |
xiongxinlei
|
d21ccd0287
|
add conformer online server, test=doc
|
3 years ago |
Hui Zhang
|
c7b987c55d
|
format
|
3 years ago |
Hui Zhang
|
ec469179bf
|
Merge pull request #1696 from qingen/database-search
[vec][server] vpr demo support
|
3 years ago |
Hui Zhang
|
72933abc70
|
Merge pull request #1701 from WilliamZhang06/web
[WebSocket] added online asr doc and online asr command line, test=doc
|
3 years ago |
xiongxinlei
|
af484fc980
|
convert websockert results to str from bytest, test=doc
|
3 years ago |
WilliamZhang06
|
39895f6a25
|
added online asr doc and online asr command line, test=doc
|
3 years ago |
qingen
|
240520c0ca
|
Merge branch 'PaddlePaddle:develop' into cluster
|
3 years ago |
TianYuan
|
8bebf81199
|
[doc]fix typo, test=doc
|
3 years ago |
TianYuan
|
98f67870ea
|
Merge pull request #1693 from yt605155624/fix_ss_NHWC
[TTS]change NLC to NCL in speedyspeech, test=tts
|
3 years ago |
buchongyu
|
48358055d0
|
修改hack 单词拼写错误
|
3 years ago |
qingen
|
d3f8715b0a
|
Merge branch 'database-search' of github.com:qingen/PaddleSpeech into database-search
|
3 years ago |
qingen
|
89a0ec9018
|
[vec][server] vpr demo support, test=doc fix #1695
|
3 years ago |
TianYuan
|
8b801ca18b
|
change NLC to NCL in speedyspeech, test=tts
|
3 years ago |
WilliamZhang06
|
1dc02c7295
|
added online web client, test=doc
|
3 years ago |
Hui Zhang
|
1759116bd7
|
Revert "[WebSocket] fixed online model md5 error , test=doc"
|
3 years ago |
xiongxinlei
|
d1935d8552
|
add vector necessary note, test=doc
|
3 years ago |
lym0302
|
9d0224460b
|
code format, test=doc
|
3 years ago |
lym0302
|
4b111146dc
|
code format, test=doc
|
3 years ago |
qingen
|
0d8e2deb61
|
Merge branch 'PaddlePaddle:develop' into cluster
|
3 years ago |
Honei
|
48e0177767
|
Merge pull request #1630 from Honei/vox12
[vec]voxceleb convert dataset format to paddlespeech
|
3 years ago |
qingen
|
fc72295334
|
Merge pull request #1651 from ccrrong/ami
[vec] add speaker diarization pipeline
|
3 years ago |
xiongxinlei
|
4af007c3fc
|
fix vector ips log bug, test=doc
|
3 years ago |
lym0302
|
82992b3ed6
|
add test code, test=doc
|
3 years ago |
qingen
|
8d9bd9a93a
|
[vec][score] update Copyright, test=doc fix #1667
|
3 years ago |
xiongxinlei
|
567286add3
|
wrap the embedding mean and std norm, test=doc
|
3 years ago |
Hui Zhang
|
d65b63b28d
|
Merge pull request #1652 from lym0302/tts_stream
[server] add stream tts server
|
3 years ago |
qingen
|
44c6623448
|
[vec][score] update plda model, test=doc fix #1667
|
3 years ago |
ccrrong
|
bc53f726fe
|
convert dataset format to paddlespeech, test=doc
|
3 years ago |
Hui Zhang
|
2f97b81346
|
Merge pull request #1682 from WilliamZhang06/ws-develop
[WebSocket] fixed online model md5 error , test=doc
|
3 years ago |
root
|
9dacfb405f
|
fixed online model md5 error , test=doc
|
3 years ago |
qingen
|
6446f72cab
|
[vec][score] add plda model, test=doc fix #1667
|
3 years ago |
qingen
|
84576d6956
|
[vec][score] add plda model, test=doc fix #1667
|
3 years ago |
lym0302
|
1a3c811f04
|
code format, test=doc
|
3 years ago |
TianYuan
|
0d6f5868ea
|
Merge pull request #1665 from yt605155624/add_onnx
[TTS]add onnx inference for fastspeech2 + hifigan/mb_melgan
|
3 years ago |
Honei
|
f500fa8bde
|
Merge pull request #1646 from Honei/develop
[vec]add speaker verification score method
|
3 years ago |
TianYuan
|
0282d45c62
|
remove fill_constant_batch_size_like in static model of speedyspeech, test=tts
|
3 years ago |
TianYuan
|
c765fca6b4
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_onnx
|
3 years ago |
TianYuan
|
124eb6af8f
|
update notes, test=doc
|
3 years ago |
TianYuan
|
e0d222e674
|
update notes, test=doc
|
3 years ago |
Hui Zhang
|
1843bed458
|
Merge pull request #1666 from Jackwaterveg/cli
[CLI] ASR: Add duration limitation for asr
|
3 years ago |
xiongxinlei
|
a8244dc5b0
|
update the note, test=doc
|
3 years ago |
Jackwaterveg
|
c852776bc6
|
test=doc
|
3 years ago |
TianYuan
|
f264b912fc
|
add warmup for frontend, test=doc
|
3 years ago |
Jackwaterveg
|
4922e697e1
|
update cli, test = asr
|
3 years ago |
Jackwaterveg
|
1c05d03806
|
test=asr
|
3 years ago |
xiongxinlei
|
9b5f7f71ac
|
add part ecapa-tdnn note, test=doc
|
3 years ago |
Hui Zhang
|
6eed542c08
|
Merge pull request #1660 from yt605155624/fix_pre
[TTS]fix preprocess bug, test=tts
|
3 years ago |
Honei
|
83310b6379
|
Merge branch 'develop' into develop
|
3 years ago |
huangyuxin
|
faf21f033f
|
add duration limitation for asr
|
3 years ago |
TianYuan
|
7aecb2c4bb
|
add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
|
3 years ago |
xiongxinlei
|
d064c8196e
|
update the speaker verification model, test=doc
|
3 years ago |
xiongxinlei
|
e72912adb9
|
update the speaker verification model, test=doc
|
3 years ago |
TianYuan
|
a8f5990869
|
fix preprocess bug, test=tts
|
3 years ago |
lym0302
|
759a9e61e4
|
update server cli, test=doc
|
3 years ago |
lym0302
|
603e565ab1
|
add stream tts server, test=doc
|
3 years ago |
ccrrong
|
378fe5909f
|
add ami diarization pipeline, test=doc
|
3 years ago |
xiongxinlei
|
48b8cc8937
|
add score method, test=doc
|
3 years ago |
xiongxinlei
|
ebfe3e6b13
|
test.py update the CSVDataset, test=doc
|
3 years ago |
xiongxinlei
|
acebfad7b7
|
change the vector csv.spk_id to csv.label, test=doc
|
3 years ago |
xiongxinlei
|
57c11dcab0
|
add some annotations, test=doc
|
3 years ago |
xiongxinlei
|
30b5b3cb9e
|
add vector csv dataset format, test=doc
|
3 years ago |
TianYuan
|
e366fb6b2f
|
Merge pull request #1643 from Jackwaterveg/check
[Doc] supplement note
|
3 years ago |
huangyuxin
|
ca860e3d2f
|
supplement note
|
3 years ago |
TianYuan
|
828ee14404
|
add license and reference for some models, test=doc
|
3 years ago |
xiongxinlei
|
5b05300e53
|
train process add new voxceleb and rirs dataset, test=doc
|
3 years ago |
xiongxinlei
|
965f486dd5
|
add voxceleb and rirs noise dataset
|
3 years ago |
Hui Zhang
|
36df70cbe6
|
Merge pull request #1638 from zh794390558/spx_refactor
[speechx] refactor audio/data/feature cache
|
3 years ago |
TianYuan
|
5bff096715
|
Merge pull request #1634 from yt605155624/cnn_decoder
[TTS]Cnn decoder
|
3 years ago |
TianYuan
|
3aec266ca5
|
add chunk size and pad size in args, test=doc
|
3 years ago |
Hui Zhang
|
cb39777a60
|
format code
|
3 years ago |
TianYuan
|
4d7cd0e063
|
add streaming synthesize, test=tts
|
3 years ago |
liangym
|
602b0b0da3
|
Merge pull request #1632 from lym0302/develop
[server] fix output bug
|
3 years ago |
Hui Zhang
|
61941d14b0
|
Merge pull request #1627 from WilliamZhang06/ws-develop
[websocket] added online asr engine
|
3 years ago |
WilliamZhang06
|
2ec8d608bf
|
fixed comments, test=doc
|
3 years ago |
liangym
|
21c4132eda
|
Update paddlespeech_client.py
|
3 years ago |
TianYuan
|
005aa4066c
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into cnn_decoder
|
3 years ago |
TianYuan
|
0fc79f474d
|
add CNNDecoder, test=tts
|
3 years ago |
WilliamZhang06
|
d847fe29cf
|
added online asr engine , test=doc
|
3 years ago |
TianYuan
|
318edec303
|
Merge pull request #1613 from yt605155624/restructure_expand
[tts]restructure expand in length_regulator.py for paddle2onnx, test=tts
|
3 years ago |
Hui Zhang
|
943d4ac1ee
|
Merge pull request #1612 from Jackwaterveg/update
[ASR] Replace kaidi_fbank with paddleaudio
|
3 years ago |
huangyuxin
|
f47146af49
|
add docstring, test=asr
|
3 years ago |
huangyuxin
|
ed490b66cb
|
update spectrogram, test=asr
|
3 years ago |
Hui Zhang
|
84d712d493
|
format code, test=doc
|
3 years ago |
Honei
|
d60856b1ed
|
Merge pull request #1614 from Honei/vox12
[vec]change the vector output to numpy.array
|
3 years ago |
xiongxinlei
|
ed7113f320
|
change the vector output to numpy.array
|
3 years ago |
TianYuan
|
bc5ae43d3a
|
restructure expand in length_regulator.py for paddle2onnx, test=tts
|
3 years ago |
huangyuxin
|
0ffe1f9114
|
replace kaidi_fbank with paddleaudio
|
3 years ago |
Hui Zhang
|
caee809513
|
Merge pull request #1605 from Honei/vox12
[vec]add speaker verification demo and doc
|
3 years ago |
xiongxinlei
|
5ae57206f3
|
add paddlespeech vector modules __init__.py
|
3 years ago |
xiongxinlei
|
2c9dc0c89b
|
add some vector cli comments, test=doc
|
3 years ago |
xiongxinlei
|
ef1bc5e815
|
vector cli output dim info, test=doc
|
3 years ago |
xiongxinlei
|
d5142e5e15
|
add vector cli annotation, test=doc
|
3 years ago |
xiongxinlei
|
ad2caf2ccb
|
add speaker verification demo and doc, test=doc
|
3 years ago |
TianYuan
|
3cc0ec950e
|
Merge pull request #1604 from lym0302/add_readme
[server] update readme
|
3 years ago |
lym0302
|
829f1e332e
|
update readme, test=doc
|
3 years ago |
xiongxinlei
|
0f78d25f76
|
add vector cli batch and pipeline test demo, test=doc
|
3 years ago |
Honei
|
305bacdcf2
|
Merge branch 'develop' into vox12
|
3 years ago |
xiongxinlei
|
0bb67d8b8e
|
add vector cli unit test, test=doc
|
3 years ago |
KP
|
b6e976a860
|
Merge pull request #1602 from yt605155624/fix_dtype
[TTS]fix dtype of window of stft
|
3 years ago |
xiongxinlei
|
62cbce6915
|
add vectorwrapper to extract audio embedding
|
3 years ago |
TianYuan
|
8938483529
|
Merge pull request #1601 from yt605155624/add_ljspeech_hifigan
[TTS] update readme for ljspeech hifigan
|
3 years ago |
TianYuan
|
5347dbad3f
|
fix dtype of window of stft, test=tts
|
3 years ago |
TianYuan
|
342b487383
|
update readme for ljspeech hifigan, test=tts
|
3 years ago |
Hui Zhang
|
4051e7b762
|
fix compliance test bug, and format
|
3 years ago |
TianYuan
|
26ef47810d
|
Merge pull request #1593 from windstamp/npu_dev_20220322
[NPU] Add NPU support for TransformerTTS
|
3 years ago |
zhangkeliang
|
59b3de6a6d
|
[NPU] test TransformerTTS with NPU
|
3 years ago |
Jackwaterveg
|
fcc1762048
|
Merge pull request #1577 from Jackwaterveg/change_init
[ASR] change default initializer to kaiming_uniform
|
3 years ago |
huangyuxin
|
e1b581b622
|
fix some bug, test=asr
|
3 years ago |
Hui Zhang
|
b5315657ff
|
Merge pull request #1509 from qingen/cluster
[vec] add clustering of vectors
|
3 years ago |
huangyuxin
|
6da8465f14
|
add dist_sampler args, test=asr
|
3 years ago |
TianYuan
|
e5e8b8a129
|
Merge pull request #1587 from yt605155624/add_vctk_hifigan
[TTS]Add vctk hifigan
|
3 years ago |
TianYuan
|
6469568d2a
|
update readme for vctk hifigan, test=tts
|
3 years ago |
huangyuxin
|
a4f5a68074
|
fix some format, test=asr
|
3 years ago |
xiongxinlei
|
d85d1deef5
|
exec pre-commit in paddlespeech vector, test=doc
|
3 years ago |
xiongxinlei
|
9874fb7d75
|
add some comments in code
|
3 years ago |
huangyuxin
|
e991d82ae7
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init
|
3 years ago |
huangyuxin
|
d53e1163a6
|
update the code, test=asr
|
3 years ago |
xiongxinlei
|
b9eafddd94
|
change - to _ to distinguish field
|
3 years ago |
xiongxinlei
|
9c6735f921
|
add vector voxceleb12 base mode url, test=doc
|
3 years ago |
xiongxinlei
|
d28ccfa96b
|
add vector cli component, test=doc
|
3 years ago |
KP
|
831cadacc7
|
Add paddleaudio doc.
|
3 years ago |
TianYuan
|
5ab2601759
|
update readme for aishell3 hifigan, test=tts
|
3 years ago |
Hui Zhang
|
6abc5d9f7e
|
format
|
3 years ago |
huangyuxin
|
ab16d8ce3c
|
change default initializer to kaiming_uniform, test=asr
|
3 years ago |
qingen
|
0f7ede11ef
|
Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster
|
3 years ago |
qingen
|
d16ce21d47
|
[wip][vec] update cluster of diarization, test=doc #1304
|
3 years ago |
xiongxinlei
|
506d26a957
|
change the code style to s2t code style, test=doc
|
3 years ago |
xiongxinlei
|
311fa87a11
|
add some comments to the code
|
3 years ago |
Hui Zhang
|
90deeca06f
|
Merge pull request #1554 from lym0302/develop
[server] add server cls
|
3 years ago |
lym0302
|
89457b273a
|
modify, test=doc
|
3 years ago |
xiongxinlei
|
8ed5c287a3
|
add vox2 data into VoxCeleb class
|
3 years ago |
lym0302
|
77bad44e8b
|
modify readme, test=doc
|
3 years ago |
lym0302
|
8ef92a9495
|
modify, test=doc
|
3 years ago |
lym0302
|
89dbda58f6
|
add cls static model, test=doc
|
3 years ago |
Hui Zhang
|
40ab05a462
|
Merge pull request #1552 from yt605155624/format_syn
[TTS]format synthesize
|
3 years ago |
lym0302
|
5187df847f
|
modify server demo, test=doc
|
3 years ago |
xiongxinlei
|
584a2c0e39
|
add ecapa-tdnn config yaml file
|
3 years ago |
lym0302
|
0a6602c708
|
modify application.yaml, test=doc
|
3 years ago |
TianYuan
|
544c372b50
|
fix cr, test=tts
|
3 years ago |
lym0302
|
99fa7a8205
|
add server cls, test=doc
|
3 years ago |
TianYuan
|
fe8bf2a38c
|
format synthesize, test=tts
|
3 years ago |
xiongxinlei
|
993d6783d7
|
remove unused code, test=doc
|
3 years ago |
xiongxinlei
|
0e87037f2c
|
refactor to compilance paddleaudio
|
3 years ago |
xiongxinlei
|
4473405f82
|
merge develop to vox12, test=doc
|
3 years ago |
Honei
|
0dee8f40e9
|
Merge branch 'PaddlePaddle:develop' into develop
|
3 years ago |
xiongxinlei
|
60d73bb7bd
|
add state 0 to prepare the voxcele data and augment data
|
3 years ago |
xiongxinlei
|
14efbf5b15
|
check extract embedding result, test=doc
|
3 years ago |
xiongxinlei
|
386ef3f161
|
add voxceleb augment unit test, test=doc
|
3 years ago |
Hui Zhang
|
5147163592
|
Merge pull request #1544 from yt605155624/add_vctk_hifigan
[tts]add vctk hifigan egs
|
3 years ago |
TianYuan
|
81d964f0a0
|
add vctk hifigan, test=tts
|
3 years ago |
xiongxinlei
|
2d89c80e6f
|
add waveform augment pipeline, test=doc
|
3 years ago |
lym0302
|
3b304544f6
|
modify yaml, test=doc
|
3 years ago |
xiongxinlei
|
ac4967e204
|
optimize the data prepare process
|
3 years ago |
xiongxinlei
|
016ed6d69c
|
repair the code according to the part comment, test=doc
|
3 years ago |
Hui Zhang
|
2886ab9373
|
Merge pull request #1530 from lym0302/server_cli
[server] add server test
|
3 years ago |
xiongxinlei
|
1f74af110b
|
add training log info and comment, test=doc
|
3 years ago |
lym0302
|
e50c1b3b1d
|
add server test, test=doc
|
3 years ago |
xiongxinlei
|
4648059b5f
|
add training process for sid, test=doc
|
3 years ago |
xiongxinlei
|
7668f61422
|
add sid dataloader for training, test=doc
|
3 years ago |
xiongxinlei
|
6af2bc3d5b
|
add sid loss wraper for voxceleb, test=doc
|
3 years ago |
xiongxinlei
|
57c4f4a68c
|
add sid learning rate and training model
|
3 years ago |
TianYuan
|
4d2f2191a8
|
fix gbk encode bug
|
3 years ago |
Honei
|
1395b5f5fa
|
Merge branch 'PaddlePaddle:develop' into develop
|
3 years ago |
TianYuan
|
175c39b4a4
|
Merge pull request #1511 from yt605155624/pre_fix_for_streaming
[TTS]add rtf for synthesize, add more vocoder for synthesize.sh
|
3 years ago |
Hui Zhang
|
5ba4907c44
|
Merge pull request #1514 from lym0302/server_cli
[server] update server cli
|
3 years ago |
lym0302
|
85d4a31e04
|
update application.yaml, test=doc
|
3 years ago |
Jerryuhoo
|
c116a3a926
|
fix Speedyspeech multi-speaker inference, test=tts
|
3 years ago |
lym0302
|
ab04488738
|
update server cli, test=doc
|
3 years ago |
TianYuan
|
cb07bd2a94
|
add rtf for synthesize, add more vocoder for synthesize_e2e.sh, test=tts
|
3 years ago |
Hui Zhang
|
26d413ce8f
|
Merge pull request #1510 from lym0302/paddlespeech_stats
[server] add paddlespeech_server stats
|
3 years ago |
lym0302
|
72c0cda30c
|
add paddlespeech_server stats, test=doc
|
3 years ago |
Hui Zhang
|
e8f2d8f11b
|
Merge pull request #1507 from zh794390558/cli
[cli] add cli batch/pipe example to readme
|
3 years ago |
Hui Zhang
|
2517df92a0
|
Merge pull request #1508 from lym0302/paddlespeech_stats
[CLI] modified text sr to lang
|
3 years ago |
TianYuan
|
b6d33a7fb4
|
Merge pull request #1506 from yt605155624/fix_frontend
[TTS]update text frontend, test=tts
|
3 years ago |
lym0302
|
395c923dee
|
modified text sr to lang, test=doc
|
3 years ago |
Hui Zhang
|
75098698d8
|
format,test=doc
|
3 years ago |
TianYuan
|
66a8beb27f
|
update text frontend, test=tts
|
3 years ago |
lym0302
|
96abb33b5b
|
add __call__, test=doc
|
3 years ago |
lym0302
|
5f1728f855
|
rm server related, test=doc
|
3 years ago |
xiongxinlei
|
70d3b01c0d
|
remove invalid code
|
3 years ago |
xiongxinlei
|
d7da629302
|
add kaldi feats egs dataset
|
3 years ago |
xiongxinlei
|
6f7e9656fe
|
add kaldi feats ark dataset
|
3 years ago |
lym0302
|
35357e775e
|
update, test=doc
|
3 years ago |
lym0302
|
e5aa24fa5a
|
resolve setup.py conflicts, test=doc
|
3 years ago |
lym0302
|
fe6be4a65e
|
Merge branch 'develop' of https://github.com/lym0302/PaddleSpeech into paddlespeech_stats
|
3 years ago |
lym0302
|
f8375764b9
|
add paddlespeech stats, test=doc
|
3 years ago |
Hui Zhang
|
8d474c2658
|
Merge pull request #1482 from lym0302/servercli_update
[server] update server cli
|
3 years ago |
lym0302
|
162361d878
|
format code, test=doc
|
3 years ago |
lym0302
|
434708cff4
|
set device cpu, test=doc
|
3 years ago |
lym0302
|
920b2c808c
|
paras required, test=doc
|
3 years ago |
Hui Zhang
|
6b1fe70100
|
format code,test=doc
|
3 years ago |
lym0302
|
6b2dd16845
|
update server cli, test=doc
|
3 years ago |
WilliamZhang06
|
78c9b7342c
|
deleted wav file , test=doc
|
3 years ago |
WilliamZhang06
|
a6ec3a26f1
|
Merge branch 'develop' into server_asr
|
3 years ago |
WilliamZhang06
|
8b4602f738
|
added isinstance code, test=doc
|
3 years ago |
lym0302
|
bb60561c66
|
update util, test=doc
|
3 years ago |
WilliamZhang06
|
147018a8b4
|
added cli changed code, test=doc
|
3 years ago |
lym0302
|
332009142b
|
add server demo, test=doc
|
3 years ago |
WilliamZhang06
|
7ebe904e20
|
fixed overload , test=doc
|
3 years ago |
Hui Zhang
|
60c0877e7a
|
Merge pull request #1472 from KPatr1ck/cli_batch
[CLI][Logger]Add cli logger control.
|
3 years ago |
WilliamZhang06
|
b8f16ac9b0
|
Merge branch 'develop' into server_asr
|
3 years ago |
WilliamZhang06
|
da3ea7bb40
|
added engine type and asr inference , test=doc
|
3 years ago |
Hui Zhang
|
49f80afe6a
|
Merge pull request #1381 from PaddlePaddle/server
[server] speech server init version
|
3 years ago |
lym0302
|
b508c4d0cb
|
add readme, test=doc
|
3 years ago |
KP
|
d36a4ccfc8
|
Add cli logger control.
|
3 years ago |
KP
|
94ed5969fa
|
Add cli logger control.
|
3 years ago |
lym0302
|
42cbe313c2
|
improve cli code, test=doc
|
3 years ago |
lym0302
|
2bf4b4521f
|
add cli, test=doc
|
3 years ago |
lym0302
|
8fd117e4da
|
add cli, test=doc
|
3 years ago |
lym0302
|
80b83b7434
|
add cli, test=doc
|
3 years ago |
KP
|
7814fba07f
|
Update batch input.
|
3 years ago |
KP
|
05288fe1c3
|
Update batch input and stdin input.
|
3 years ago |
KP
|
1818b058aa
|
Support batch input in cls task.
|
3 years ago |
WilliamZhang06
|
35e3be9ac8
|
Merge remote-tracking branch 'remote/develop' into server
|
3 years ago |
TianYuan
|
ae521d3700
|
Update infer.py
|
3 years ago |
lym0302
|
07158b2f12
|
move dir, test=doc
|
3 years ago |
lym0302
|
76391275fc
|
move dir, test=doc
|
3 years ago |
TianYuan
|
67ec6242c3
|
fix ci for waveflow, test=tts
|
3 years ago |
TianYuan
|
f51097618b
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into fix_ci_waveflow
|
3 years ago |
TianYuan
|
fc8c0e3ea2
|
fix ci for waveflow, test=tts
|
3 years ago |
huangyuxin
|
95d5274aef
|
fix sortagrad, test=asr
|
3 years ago |
Hui Zhang
|
718c849f68
|
Merge pull request #1445 from yt605155624/update_train
[TTS]init for all works in train.py when ngpu>1
|
3 years ago |
Hui Zhang
|
f3ec985aaf
|
Merge pull request #1439 from Jackwaterveg/tipc
[TIPC]Add tipc_benchmark of conformer
|
3 years ago |
TianYuan
|
4ac7db185e
|
init for all works in train.py when ngpu>1, test=tts
|
3 years ago |
Jackwaterveg
|
426bae3de1
|
Merge pull request #1440 from yt605155624/merge_datasets
[TTS]Merge datasets, change style of docstring
|
3 years ago |
TianYuan
|
2cec8f6c76
|
update tts cli, test=doc
|
3 years ago |
TianYuan
|
9699c00769
|
change the docstring style from numpydoc to google, test=tts
|
3 years ago |
huangyuxin
|
aefe9e93a7
|
add tipc benchmark of conformer
|
3 years ago |
TianYuan
|
683679bec7
|
merge data and datasets, test=tts
|
3 years ago |
TianYuan
|
7dc1f2daa3
|
fix some librosa bugs, test=tts
|
3 years ago |
TianYuan
|
30085ac229
|
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into rename_tacotron2
|
3 years ago |
TianYuan
|
25347bb6a3
|
rename tacotron2, test=tts
|
3 years ago |