lym0302
be21aed09b
trans remove file way, test=doc
3 years ago
lym0302
b1f9b8016d
add start and end request on ws tts, test=doc
3 years ago
xiongxinlei
347af638e2
changet vector train.py local_rank to rank, test=doc
3 years ago
lym0302
d4f863dc97
improve, test=doc
3 years ago
pollyyan
018dda6ee9
Merge pull request #1879 from QingshuChen/develop
...
speedyspeech support kunlun
3 years ago
Hui Zhang
c23a97e242
Merge pull request #1877 from Jackwaterveg/develop
...
[ASR] Replace dist.spawn with dist.launch
3 years ago
Hui Zhang
5b053cde6a
Merge pull request #1878 from Honei/develop
...
[server]update the vector and text readme
3 years ago
xiongxinlei
06bea5f03d
update the vector and text readme, test=doc
3 years ago
QingshuChen
e55177c3db
speedyspeech support kunlun
3 years ago
root
9f389a7a33
support cpu, test=asr
3 years ago
root
864041085f
replace dist.spawn with dist.launch, test=asr
3 years ago
TianYuan
4b7786f2ed
add vits network scripts, test=tts
3 years ago
KP
19d015b60a
Add RFT for asr task.
3 years ago
KP
da08f1c1af
Add RFT for asr task.
3 years ago
Hui Zhang
12ae137c83
update tts_api for ws
3 years ago
Hui Zhang
175c67b75e
asr socket to asr api
3 years ago
Hui Zhang
7be6b0e8cf
unify name style & frame with abs timestamp
3 years ago
Hui Zhang
15b25199c2
Merge pull request #1864 from zh794390558/doc
...
[doc] update readme with new feature
3 years ago
xiongxinlei
bb0db29d7e
update the streaming asr readme, test=doc
3 years ago
root
4d7046d244
updata released model info, test=doc
3 years ago
liangym
e7a35485e4
Merge pull request #1859 from lym0302/update_readme
...
[server] improve server code
3 years ago
Hui Zhang
02e7586394
update readme
3 years ago
lym0302
b361a73888
improve server code, test=doc
3 years ago
Hui Zhang
94aaa61726
Merge pull request #1858 from KPatr1ck/cli_version
...
[CLI]Add version command in cli.
3 years ago
KP
677898ab96
Add version command in cli.
3 years ago
Hui Zhang
13503613b4
Merge pull request #1853 from Jackwaterveg/develop
...
[CLI] update cli
3 years ago
root
3a7896fc96
update cli, test=asr
3 years ago
liangym
e87495f045
[server] update readme ( #1851 )
...
* update readme, test=doc
* update readme, test=doc
* update readme, test=doc
3 years ago
Hui Zhang
37c6106ee0
Merge pull request #1848 from zh794390558/spx
...
[speechx] rm to_float32 flags, default feature fbank
3 years ago
Hui Zhang
8522b82999
format
3 years ago
xiongxinlei
b7a77eebca
update the time stamp type, test=doc
3 years ago
Honei
43582f5091
Merge branch 'develop' into asr_time
3 years ago
Hui Zhang
d99e99ce2c
Merge pull request #1836 from Honei/punc
...
[asr][server]join streaming asr and punc server
3 years ago
Hui Zhang
435e86b335
Merge pull request #1835 from Honei/vec_server
...
[vec][server]add vector server
3 years ago
xiongxinlei
10da21a77b
update the vector cli for server, test=doc
3 years ago
xiongxinlei
2ab96187aa
streaming asr server add time stamp, test=doc
3 years ago
xiongxinlei
c78653850b
join streaming asr and punc server, test=doc
3 years ago
xiongxinlei
3950557e04
update the vector server note, test=doc
3 years ago
xiongxinlei
b1dddddbe0
add vector server, test=doc
3 years ago
Jerryuhoo
fba0693a20
fix random speaker embedding bug, test=tts
3 years ago
Hui Zhang
cdb9a1b20b
Merge pull request #1813 from Honei/v0.3
...
[R1.0]update the paddlespeech_client asr_online cli
3 years ago
Honei
ff7dbcc2de
Merge branch 'develop' into v0.3
3 years ago
xiongxinlei
f7af037cb1
add the note for offline asr, test=doc
3 years ago
xiongxinlei
3f80464926
update the streaming asr readme, test=doc
3 years ago
Hui Zhang
fc96130fdc
fix speechx core dump when stop immediately after start
3 years ago
xiongxinlei
c5fe181405
update the paddlespeech_client asr_online cli, test=doc
3 years ago
huangyuxin
4494f5a1fc
add cli models, test=doc
3 years ago
Hui Zhang
903cc67a4d
Merge pull request #1801 from Honei/v0.3
...
[R1.0]update the streaming output and punc default ip, port
3 years ago
xiongxinlei
e844e0e0bb
update the streaming output and punc default ip, port, test=doc
3 years ago
huangyuxin
18197cd3a5
renew ds2 model, test=doc
3 years ago
Hui Zhang
ebde26030b
patch func to var
3 years ago
Honei
f72cbc9b6d
Merge branch 'develop' into v0.3
3 years ago
xiongxinlei
9125cb076d
update the ws asr response, final_result to result, test=doc
3 years ago
xiongxinlei
7007b0ecac
update the asr server api, test=doc
3 years ago
Hui Zhang
5e23025c31
fix speechx ws server to return dummpy partial result, fix hang for ws client
3 years ago
Hui Zhang
d7c8c1779f
Merge pull request #1786 from Jackwaterveg/debug
...
[CLI] renew ds2 online model
3 years ago
Hui Zhang
9cc7662512
Merge pull request #1782 from lym0302/add_streaming_cli
...
[server] update readme
3 years ago
huangyuxin
e145b26355
fix
3 years ago
huangyuxin
4f9e8bfa90
renew ds2 online, test=doc
3 years ago
xiongxinlei
833900a8b4
asr client add punctuatjion server, test=doc
3 years ago
KP
abb15ac6e8
Update KWS example.
3 years ago
lym0302
651012616a
add info, test=doc
3 years ago
Hui Zhang
33ca17359f
Merge pull request #1776 from Jackwaterveg/ds2
...
[ASR] Replace fbank
3 years ago
huangyuxin
0df8d80833
remove logfbank from python_speech_features, test=asr
3 years ago
Honei
119143d0f1
Merge pull request #1731 from qingen/cluster
...
[vec] add GE2E to support unlabeled data training
3 years ago
huangyuxin
fcdaef6cb4
replace fbank, test=asr
3 years ago
Hui Zhang
f11855415c
Merge pull request #1770 from Jackwaterveg/cli
...
[ASR] fix asr cli infer
3 years ago
Hui Zhang
3b0004345c
Merge pull request #1772 from Honei/v0.3
...
[text][server]add text punc server
3 years ago
Hui Zhang
962a278996
Merge pull request #1558 from KPatr1ck/kws
...
[KWS]Add kws example on HeySnips dataset.
3 years ago
liangym
0de4d25ab8
Merge pull request #1774 from lym0302/add_streaming_cli
...
[server] fix tts streaming server
3 years ago
huangyuxin
1e999c27e9
fix exit, test=doc
3 years ago
lym0302
dc52c313fa
fix code, test=doc
3 years ago
lym0302
c6e6210964
code format, test=tts
3 years ago
xiongxinlei
9e50448039
update the punc text model, text=doc
3 years ago
lym0302
88adcaa6dc
fix code, test=doc
3 years ago
TianYuan
f256bb9c0e
Merge pull request #1771 from lym0302/add_streaming_cli
...
[server] add streaming tts demos
3 years ago
KP
caa8eb4d0d
Add KWS example.
3 years ago
KP
f9761d532c
Add KWS example.
3 years ago
KP
b60b1dadde
Add KWS example.
3 years ago
KP
e01abc5099
Add KWS example.
3 years ago
KP
521e222db8
Add mdtc model.
3 years ago
xiongxinlei
2fa1522bdd
update the punc yaml to application.yaml, test=doc
3 years ago
xiongxinlei
ba62b85e9b
add text punc server, test=doc
3 years ago
lym0302
c00c31594c
updata readme, test=doc
3 years ago
lym0302
70424e1ef9
add streaming tts demos, test=doc
3 years ago
huangyuxin
c21c3d220d
fix infer, test=doc
3 years ago
Hui Zhang
312fc4e11e
Merge pull request #1766 from Jackwaterveg/fix
...
[ASR] ds2: add log_interval and fix lr problem when resume training
3 years ago
TianYuan
0b1b573a3f
Merge pull request #1767 from Jackwaterveg/cli
...
[CLI] Add conformer_aishell, conformer_online_aishell
3 years ago
huangyuxin
ad4e04fc82
add conformer_online_aishell, test=doc
3 years ago
huangyuxin
12d2f6ea95
fix conformer_aishell of cli, test=doc
3 years ago
huangyuxin
5912ba53e4
fix log_interval and lr when resume training, test=asr
3 years ago
Hui Zhang
91e24b0480
format code
3 years ago
xiongxinlei
13a37b4892
update the online protocal note, test=doc
3 years ago
xiongxinlei
2f2cb7eaaf
update the audio handler note, test=doc
3 years ago
xiongxinlei
7aa5a3df2b
fix the streaming asr server bug, server client, test=doc
3 years ago
huangyuxin
19998ea29b
add aishell conformer, test=doc
3 years ago
TianYuan
24f0a7d44b
Merge pull request #1733 from lym0302/tts_stream
...
[server] add onnx tts engine
3 years ago
TianYuan
9121dfc046
Merge pull request #1752 from yt605155624/fix_wavernn
...
[TTS]fix wavernn white noise bug for paddle develop(2.3)
3 years ago
TianYuan
08a4673355
fix wavernn bug, test=tts
3 years ago
Jackwaterveg
85b50c4700
Merge pull request #1741 from Jackwaterveg/debug
...
[ASR] remove redundant log
3 years ago
lym0302
104c7ff27d
code format, test=doc
3 years ago
lym0302
e398fe9c74
remove code, test=doc
3 years ago
Hui Zhang
3561875dd0
Merge branch 'develop' into fix
3 years ago
Hui Zhang
c7d9b11529
format
3 years ago
huangyuxin
8e37a7c7f0
remove redundant log, test=doc
3 years ago
xiongxinlei
56751a1ed5
update the server device to paddle.device, test=doc
3 years ago
xiongxinlei
4b76a01c85
update en readme.md, test=doc
3 years ago
xiongxinlei
1a0c2bea5d
add streaming asr demo, test=doc
3 years ago
lym0302
4e9db4ff71
add onnx tts engine, test=doc
3 years ago
Jackwaterveg
8d1ee8262e
Merge branch 'develop' into CER
3 years ago
qingen
e98845d778
[vec][loss] add GE2E to support unlabeled data training, test=doc fix #1730
3 years ago
qingen
0186f522af
Merge pull request #1725 from qingen/database-search
...
[vec] add GRL to domain adaptation
3 years ago
TianYuan
e089268642
Merge pull request #1727 from yt605155624/refactor_syn_util
...
[TTS]add paddle device set for ort and inference
3 years ago
TianYuan
4646f7cc8d
add paddle device set for ort and inference, test=doc
3 years ago
Hui Zhang
523d5bd6d4
Merge pull request #1723 from yt605155624/refactor_syn_util
...
[TTS]restructure syn_utils.py, test=tts
3 years ago
qingen
7e8f9f5336
[vec][layer] add GRL to domain adaptation, test=doc fix #1724
3 years ago
TianYuan
c74fa9ada8
restructure syn_utils.py, test=tts
3 years ago
qingen
26d5dded7c
Merge branch 'PaddlePaddle:develop' into cluster
3 years ago
qingen
6a7245657f
[vec][loss] add FocalLoss to deal with class imbalances, test=doc fix #1721
3 years ago
qingen
9382ad8a16
Merge pull request #1719 from qingen/cluster
...
[vec][loss] add NCE Loss from RNNLM
3 years ago
Hui Zhang
cf9a590fa5
Merge pull request #1704 from Honei/server
...
[asr][websocket] add asr conformer websocket server
3 years ago
xiongxinlei
ac9fcf7f4a
fix the asr infernece model, paddle.no_grad, test=doc
3 years ago
xiongxinlei
ff4ddd229e
fix the unuseful code, test=doc
3 years ago
xiongxinlei
9c03280ca6
remove debug info, test=doc
3 years ago
xiongxinlei
48fa84bee9
fix the asr online client bug, return None, test=doc
3 years ago
qingen
00febff734
[vec][loss] update docstring, test=doc fix #1717
3 years ago
xiongxinlei
babac27a79
fix ds2 online edge bug, test=doc
3 years ago
liangym
ab656aab57
Merge pull request #1713 from lym0302/tts_stream
...
[server] add streaming am infer
3 years ago
xiongxinlei
dcab04a799
merge develop to server
3 years ago
xiongxinlei
f56dba0ca7
fix the code format, test=doc
3 years ago
Honei
55122cfc86
Merge branch 'develop' into server
3 years ago
TianYuan
7c0ec3c249
Merge pull request #1716 from yt605155624/update_cli
...
[CLI]update cli, test=doc
3 years ago
xiongxinlei
380afbbc5d
add ds2 model multi session, test=doc
3 years ago
qingen
166757703f
[vec][loss] add NCE Loss from RNNLM, test=doc fix #1717
3 years ago
lym0302
9e41ac8550
code format, test=doc
3 years ago
qingen
880829fe89
Merge pull request #1681 from qingen/cluster
...
[vec][score] add plda model
3 years ago
TianYuan
a44f5c099e
update cli, test=doc
3 years ago
lym0302
40dde22fc4
code format, test=doc
3 years ago
huangyuxin
6e80618e3d
add ds2
3 years ago
xiongxinlei
5acb0b5252
fix the websocket chunk edge bug, test=doc
3 years ago
Hui Zhang
b78bc6375b
Merge pull request #1712 from yt605155624/add_cnndecoder_onnx
...
[TTS]add fastspeech2 cnndecoder onnx model
3 years ago
xiongxinlei
05a8a4b5fc
add connection stability, test=doc
3 years ago
lym0302
00a6236fe2
remove test code, test=doc
3 years ago
lym0302
9c0ceaacb6
add streaming am infer, test=doc
3 years ago
xiongxinlei
68731c61f4
add multi session result, test=doc
3 years ago
xiongxinlei
10e825d9b2
check chunk window process, test=doc
3 years ago
qingen
159d8fd628
Merge branch 'develop' into cluster
3 years ago
xiongxinlei
d2640c1406
add mult sesssion process, test=doc
3 years ago
TianYuan
dafe7c3657
add fastspeech2 cnndecoder onnx model, test=tts
3 years ago
qingen
deb3ba070b
[vec] update mata info, test=doc
3 years ago
xiongxinlei
97d31f9aac
update the attention_rescoring method, test=doc
3 years ago
xiongxinlei
0c5dbbee5b
add conformer ctc prefix beam search decoding method, test=doc
3 years ago
Honei
9d20a10b5a
Merge branch 'develop' into server
3 years ago
Hui Zhang
0cde9f87ab
Merge pull request #1710 from Honei/deepspeech_server
...
[asr][websocket]fix the ws send bug, cache buffer, text=doc
3 years ago
xiongxinlei
3ce4301665
add asr websocket server note, test=doc
3 years ago
xiongxinlei
efc269b75f
remove unuseful code, test=doc
3 years ago
xiongxinlei
89b102a7dd
fix the ws send bug, cache buffer, text=doc
3 years ago
xiongxinlei
d21ccd0287
add conformer online server, test=doc
3 years ago
Hui Zhang
c7b987c55d
format
3 years ago
Hui Zhang
ec469179bf
Merge pull request #1696 from qingen/database-search
...
[vec][server] vpr demo support
3 years ago
Hui Zhang
72933abc70
Merge pull request #1701 from WilliamZhang06/web
...
[WebSocket] added online asr doc and online asr command line, test=doc
3 years ago
xiongxinlei
af484fc980
convert websockert results to str from bytest, test=doc
3 years ago
WilliamZhang06
39895f6a25
added online asr doc and online asr command line, test=doc
3 years ago
qingen
240520c0ca
Merge branch 'PaddlePaddle:develop' into cluster
3 years ago
TianYuan
8bebf81199
[doc]fix typo, test=doc
3 years ago
TianYuan
98f67870ea
Merge pull request #1693 from yt605155624/fix_ss_NHWC
...
[TTS]change NLC to NCL in speedyspeech, test=tts
3 years ago
buchongyu
48358055d0
修改hack 单词拼写错误
3 years ago
qingen
d3f8715b0a
Merge branch 'database-search' of github.com:qingen/PaddleSpeech into database-search
3 years ago
qingen
89a0ec9018
[vec][server] vpr demo support, test=doc fix #1695
3 years ago
TianYuan
8b801ca18b
change NLC to NCL in speedyspeech, test=tts
3 years ago
WilliamZhang06
1dc02c7295
added online web client, test=doc
3 years ago
Hui Zhang
1759116bd7
Revert "[WebSocket] fixed online model md5 error , test=doc"
3 years ago
xiongxinlei
d1935d8552
add vector necessary note, test=doc
3 years ago
lym0302
9d0224460b
code format, test=doc
3 years ago
lym0302
4b111146dc
code format, test=doc
3 years ago
qingen
0d8e2deb61
Merge branch 'PaddlePaddle:develop' into cluster
3 years ago
Honei
48e0177767
Merge pull request #1630 from Honei/vox12
...
[vec]voxceleb convert dataset format to paddlespeech
3 years ago
qingen
fc72295334
Merge pull request #1651 from ccrrong/ami
...
[vec] add speaker diarization pipeline
3 years ago
xiongxinlei
4af007c3fc
fix vector ips log bug, test=doc
3 years ago
lym0302
82992b3ed6
add test code, test=doc
3 years ago
qingen
8d9bd9a93a
[vec][score] update Copyright, test=doc fix #1667
3 years ago
xiongxinlei
567286add3
wrap the embedding mean and std norm, test=doc
3 years ago
Hui Zhang
d65b63b28d
Merge pull request #1652 from lym0302/tts_stream
...
[server] add stream tts server
3 years ago
qingen
44c6623448
[vec][score] update plda model, test=doc fix #1667
3 years ago
ccrrong
bc53f726fe
convert dataset format to paddlespeech, test=doc
3 years ago
Hui Zhang
2f97b81346
Merge pull request #1682 from WilliamZhang06/ws-develop
...
[WebSocket] fixed online model md5 error , test=doc
3 years ago
root
9dacfb405f
fixed online model md5 error , test=doc
3 years ago
qingen
6446f72cab
[vec][score] add plda model, test=doc fix #1667
3 years ago
qingen
84576d6956
[vec][score] add plda model, test=doc fix #1667
3 years ago
lym0302
1a3c811f04
code format, test=doc
3 years ago
TianYuan
0d6f5868ea
Merge pull request #1665 from yt605155624/add_onnx
...
[TTS]add onnx inference for fastspeech2 + hifigan/mb_melgan
3 years ago
Honei
f500fa8bde
Merge pull request #1646 from Honei/develop
...
[vec]add speaker verification score method
3 years ago
TianYuan
0282d45c62
remove fill_constant_batch_size_like in static model of speedyspeech, test=tts
3 years ago
TianYuan
c765fca6b4
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_onnx
3 years ago
TianYuan
124eb6af8f
update notes, test=doc
3 years ago
TianYuan
e0d222e674
update notes, test=doc
3 years ago
Hui Zhang
1843bed458
Merge pull request #1666 from Jackwaterveg/cli
...
[CLI] ASR: Add duration limitation for asr
3 years ago
xiongxinlei
a8244dc5b0
update the note, test=doc
3 years ago
Jackwaterveg
c852776bc6
test=doc
3 years ago
TianYuan
f264b912fc
add warmup for frontend, test=doc
3 years ago
Jackwaterveg
4922e697e1
update cli, test = asr
3 years ago
Jackwaterveg
1c05d03806
test=asr
3 years ago
xiongxinlei
9b5f7f71ac
add part ecapa-tdnn note, test=doc
3 years ago
Hui Zhang
6eed542c08
Merge pull request #1660 from yt605155624/fix_pre
...
[TTS]fix preprocess bug, test=tts
3 years ago
Honei
83310b6379
Merge branch 'develop' into develop
3 years ago
huangyuxin
faf21f033f
add duration limitation for asr
3 years ago
TianYuan
7aecb2c4bb
add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
3 years ago
xiongxinlei
d064c8196e
update the speaker verification model, test=doc
3 years ago
xiongxinlei
e72912adb9
update the speaker verification model, test=doc
3 years ago
TianYuan
a8f5990869
fix preprocess bug, test=tts
3 years ago
lym0302
759a9e61e4
update server cli, test=doc
3 years ago
lym0302
603e565ab1
add stream tts server, test=doc
3 years ago
ccrrong
378fe5909f
add ami diarization pipeline, test=doc
3 years ago
xiongxinlei
48b8cc8937
add score method, test=doc
3 years ago
xiongxinlei
ebfe3e6b13
test.py update the CSVDataset, test=doc
3 years ago
xiongxinlei
acebfad7b7
change the vector csv.spk_id to csv.label, test=doc
3 years ago
xiongxinlei
57c11dcab0
add some annotations, test=doc
3 years ago
xiongxinlei
30b5b3cb9e
add vector csv dataset format, test=doc
3 years ago
TianYuan
e366fb6b2f
Merge pull request #1643 from Jackwaterveg/check
...
[Doc] supplement note
3 years ago
huangyuxin
ca860e3d2f
supplement note
3 years ago
TianYuan
828ee14404
add license and reference for some models, test=doc
3 years ago
xiongxinlei
5b05300e53
train process add new voxceleb and rirs dataset, test=doc
3 years ago
xiongxinlei
965f486dd5
add voxceleb and rirs noise dataset
3 years ago
Hui Zhang
36df70cbe6
Merge pull request #1638 from zh794390558/spx_refactor
...
[speechx] refactor audio/data/feature cache
3 years ago
TianYuan
5bff096715
Merge pull request #1634 from yt605155624/cnn_decoder
...
[TTS]Cnn decoder
3 years ago
TianYuan
3aec266ca5
add chunk size and pad size in args, test=doc
3 years ago
Hui Zhang
cb39777a60
format code
3 years ago
TianYuan
4d7cd0e063
add streaming synthesize, test=tts
3 years ago
liangym
602b0b0da3
Merge pull request #1632 from lym0302/develop
...
[server] fix output bug
3 years ago
Hui Zhang
61941d14b0
Merge pull request #1627 from WilliamZhang06/ws-develop
...
[websocket] added online asr engine
3 years ago
WilliamZhang06
2ec8d608bf
fixed comments, test=doc
3 years ago
liangym
21c4132eda
Update paddlespeech_client.py
3 years ago
TianYuan
005aa4066c
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into cnn_decoder
3 years ago
TianYuan
0fc79f474d
add CNNDecoder, test=tts
3 years ago
WilliamZhang06
d847fe29cf
added online asr engine , test=doc
3 years ago
TianYuan
318edec303
Merge pull request #1613 from yt605155624/restructure_expand
...
[tts]restructure expand in length_regulator.py for paddle2onnx, test=tts
3 years ago
Hui Zhang
943d4ac1ee
Merge pull request #1612 from Jackwaterveg/update
...
[ASR] Replace kaidi_fbank with paddleaudio
3 years ago
huangyuxin
f47146af49
add docstring, test=asr
3 years ago
huangyuxin
ed490b66cb
update spectrogram, test=asr
3 years ago
Hui Zhang
84d712d493
format code, test=doc
3 years ago
Honei
d60856b1ed
Merge pull request #1614 from Honei/vox12
...
[vec]change the vector output to numpy.array
3 years ago
xiongxinlei
ed7113f320
change the vector output to numpy.array
3 years ago
TianYuan
bc5ae43d3a
restructure expand in length_regulator.py for paddle2onnx, test=tts
3 years ago
huangyuxin
0ffe1f9114
replace kaidi_fbank with paddleaudio
3 years ago
Hui Zhang
caee809513
Merge pull request #1605 from Honei/vox12
...
[vec]add speaker verification demo and doc
3 years ago
xiongxinlei
5ae57206f3
add paddlespeech vector modules __init__.py
3 years ago
xiongxinlei
2c9dc0c89b
add some vector cli comments, test=doc
3 years ago
xiongxinlei
ef1bc5e815
vector cli output dim info, test=doc
3 years ago
xiongxinlei
d5142e5e15
add vector cli annotation, test=doc
3 years ago
xiongxinlei
ad2caf2ccb
add speaker verification demo and doc, test=doc
3 years ago
TianYuan
3cc0ec950e
Merge pull request #1604 from lym0302/add_readme
...
[server] update readme
3 years ago
lym0302
829f1e332e
update readme, test=doc
3 years ago
xiongxinlei
0f78d25f76
add vector cli batch and pipeline test demo, test=doc
3 years ago
Honei
305bacdcf2
Merge branch 'develop' into vox12
3 years ago
xiongxinlei
0bb67d8b8e
add vector cli unit test, test=doc
3 years ago
KP
b6e976a860
Merge pull request #1602 from yt605155624/fix_dtype
...
[TTS]fix dtype of window of stft
3 years ago
xiongxinlei
62cbce6915
add vectorwrapper to extract audio embedding
3 years ago
TianYuan
8938483529
Merge pull request #1601 from yt605155624/add_ljspeech_hifigan
...
[TTS] update readme for ljspeech hifigan
3 years ago
TianYuan
5347dbad3f
fix dtype of window of stft, test=tts
3 years ago
TianYuan
342b487383
update readme for ljspeech hifigan, test=tts
3 years ago
Hui Zhang
4051e7b762
fix compliance test bug, and format
3 years ago
TianYuan
26ef47810d
Merge pull request #1593 from windstamp/npu_dev_20220322
...
[NPU] Add NPU support for TransformerTTS
3 years ago
zhangkeliang
59b3de6a6d
[NPU] test TransformerTTS with NPU
3 years ago
Jackwaterveg
fcc1762048
Merge pull request #1577 from Jackwaterveg/change_init
...
[ASR] change default initializer to kaiming_uniform
3 years ago
huangyuxin
e1b581b622
fix some bug, test=asr
3 years ago
Hui Zhang
b5315657ff
Merge pull request #1509 from qingen/cluster
...
[vec] add clustering of vectors
3 years ago
huangyuxin
6da8465f14
add dist_sampler args, test=asr
3 years ago
TianYuan
e5e8b8a129
Merge pull request #1587 from yt605155624/add_vctk_hifigan
...
[TTS]Add vctk hifigan
3 years ago
TianYuan
6469568d2a
update readme for vctk hifigan, test=tts
3 years ago
huangyuxin
a4f5a68074
fix some format, test=asr
3 years ago
xiongxinlei
d85d1deef5
exec pre-commit in paddlespeech vector, test=doc
3 years ago
xiongxinlei
9874fb7d75
add some comments in code
3 years ago
huangyuxin
e991d82ae7
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init
3 years ago
huangyuxin
d53e1163a6
update the code, test=asr
3 years ago
xiongxinlei
b9eafddd94
change - to _ to distinguish field
3 years ago
xiongxinlei
9c6735f921
add vector voxceleb12 base mode url, test=doc
3 years ago
xiongxinlei
d28ccfa96b
add vector cli component, test=doc
3 years ago
KP
831cadacc7
Add paddleaudio doc.
3 years ago
TianYuan
5ab2601759
update readme for aishell3 hifigan, test=tts
3 years ago
Hui Zhang
6abc5d9f7e
format
3 years ago
huangyuxin
ab16d8ce3c
change default initializer to kaiming_uniform, test=asr
3 years ago
qingen
0f7ede11ef
Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster
3 years ago
qingen
d16ce21d47
[wip][vec] update cluster of diarization, test=doc #1304
3 years ago
xiongxinlei
506d26a957
change the code style to s2t code style, test=doc
3 years ago
xiongxinlei
311fa87a11
add some comments to the code
3 years ago
Hui Zhang
90deeca06f
Merge pull request #1554 from lym0302/develop
...
[server] add server cls
3 years ago
lym0302
89457b273a
modify, test=doc
3 years ago
xiongxinlei
8ed5c287a3
add vox2 data into VoxCeleb class
3 years ago
lym0302
77bad44e8b
modify readme, test=doc
3 years ago
lym0302
8ef92a9495
modify, test=doc
3 years ago
lym0302
89dbda58f6
add cls static model, test=doc
3 years ago
Hui Zhang
40ab05a462
Merge pull request #1552 from yt605155624/format_syn
...
[TTS]format synthesize
3 years ago
lym0302
5187df847f
modify server demo, test=doc
3 years ago
xiongxinlei
584a2c0e39
add ecapa-tdnn config yaml file
3 years ago
lym0302
0a6602c708
modify application.yaml, test=doc
3 years ago
TianYuan
544c372b50
fix cr, test=tts
3 years ago
lym0302
99fa7a8205
add server cls, test=doc
3 years ago
TianYuan
fe8bf2a38c
format synthesize, test=tts
3 years ago
xiongxinlei
993d6783d7
remove unused code, test=doc
3 years ago
xiongxinlei
0e87037f2c
refactor to compilance paddleaudio
3 years ago
xiongxinlei
4473405f82
merge develop to vox12, test=doc
3 years ago
Honei
0dee8f40e9
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
xiongxinlei
60d73bb7bd
add state 0 to prepare the voxcele data and augment data
3 years ago
xiongxinlei
14efbf5b15
check extract embedding result, test=doc
3 years ago
xiongxinlei
386ef3f161
add voxceleb augment unit test, test=doc
3 years ago
Hui Zhang
5147163592
Merge pull request #1544 from yt605155624/add_vctk_hifigan
...
[tts]add vctk hifigan egs
3 years ago
TianYuan
81d964f0a0
add vctk hifigan, test=tts
3 years ago
xiongxinlei
2d89c80e6f
add waveform augment pipeline, test=doc
3 years ago
lym0302
3b304544f6
modify yaml, test=doc
3 years ago
xiongxinlei
ac4967e204
optimize the data prepare process
3 years ago
xiongxinlei
016ed6d69c
repair the code according to the part comment, test=doc
3 years ago
Hui Zhang
2886ab9373
Merge pull request #1530 from lym0302/server_cli
...
[server] add server test
3 years ago
xiongxinlei
1f74af110b
add training log info and comment, test=doc
3 years ago
lym0302
e50c1b3b1d
add server test, test=doc
3 years ago
xiongxinlei
4648059b5f
add training process for sid, test=doc
3 years ago
xiongxinlei
7668f61422
add sid dataloader for training, test=doc
3 years ago
xiongxinlei
6af2bc3d5b
add sid loss wraper for voxceleb, test=doc
3 years ago
xiongxinlei
57c4f4a68c
add sid learning rate and training model
3 years ago
TianYuan
4d2f2191a8
fix gbk encode bug
3 years ago
Honei
1395b5f5fa
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
TianYuan
175c39b4a4
Merge pull request #1511 from yt605155624/pre_fix_for_streaming
...
[TTS]add rtf for synthesize, add more vocoder for synthesize.sh
3 years ago
Hui Zhang
5ba4907c44
Merge pull request #1514 from lym0302/server_cli
...
[server] update server cli
3 years ago
lym0302
85d4a31e04
update application.yaml, test=doc
3 years ago
Jerryuhoo
c116a3a926
fix Speedyspeech multi-speaker inference, test=tts
3 years ago
lym0302
ab04488738
update server cli, test=doc
3 years ago
TianYuan
cb07bd2a94
add rtf for synthesize, add more vocoder for synthesize_e2e.sh, test=tts
3 years ago
Hui Zhang
26d413ce8f
Merge pull request #1510 from lym0302/paddlespeech_stats
...
[server] add paddlespeech_server stats
3 years ago
lym0302
72c0cda30c
add paddlespeech_server stats, test=doc
3 years ago
Hui Zhang
e8f2d8f11b
Merge pull request #1507 from zh794390558/cli
...
[cli] add cli batch/pipe example to readme
3 years ago
Hui Zhang
2517df92a0
Merge pull request #1508 from lym0302/paddlespeech_stats
...
[CLI] modified text sr to lang
3 years ago
TianYuan
b6d33a7fb4
Merge pull request #1506 from yt605155624/fix_frontend
...
[TTS]update text frontend, test=tts
3 years ago
lym0302
395c923dee
modified text sr to lang, test=doc
3 years ago
Hui Zhang
75098698d8
format,test=doc
3 years ago
TianYuan
66a8beb27f
update text frontend, test=tts
3 years ago
lym0302
96abb33b5b
add __call__, test=doc
3 years ago
lym0302
5f1728f855
rm server related, test=doc
3 years ago
xiongxinlei
70d3b01c0d
remove invalid code
3 years ago
xiongxinlei
d7da629302
add kaldi feats egs dataset
3 years ago
xiongxinlei
6f7e9656fe
add kaldi feats ark dataset
3 years ago
lym0302
35357e775e
update, test=doc
3 years ago
lym0302
e5aa24fa5a
resolve setup.py conflicts, test=doc
3 years ago
lym0302
fe6be4a65e
Merge branch 'develop' of https://github.com/lym0302/PaddleSpeech into paddlespeech_stats
3 years ago
lym0302
f8375764b9
add paddlespeech stats, test=doc
3 years ago
Hui Zhang
8d474c2658
Merge pull request #1482 from lym0302/servercli_update
...
[server] update server cli
3 years ago
lym0302
162361d878
format code, test=doc
3 years ago
lym0302
434708cff4
set device cpu, test=doc
3 years ago
lym0302
920b2c808c
paras required, test=doc
3 years ago
Hui Zhang
6b1fe70100
format code,test=doc
3 years ago
lym0302
6b2dd16845
update server cli, test=doc
3 years ago
WilliamZhang06
78c9b7342c
deleted wav file , test=doc
3 years ago
WilliamZhang06
a6ec3a26f1
Merge branch 'develop' into server_asr
3 years ago
WilliamZhang06
8b4602f738
added isinstance code, test=doc
3 years ago
lym0302
bb60561c66
update util, test=doc
3 years ago
WilliamZhang06
147018a8b4
added cli changed code, test=doc
3 years ago
lym0302
332009142b
add server demo, test=doc
3 years ago
WilliamZhang06
7ebe904e20
fixed overload , test=doc
3 years ago
Hui Zhang
60c0877e7a
Merge pull request #1472 from KPatr1ck/cli_batch
...
[CLI][Logger]Add cli logger control.
3 years ago
WilliamZhang06
b8f16ac9b0
Merge branch 'develop' into server_asr
3 years ago
WilliamZhang06
da3ea7bb40
added engine type and asr inference , test=doc
3 years ago
Hui Zhang
49f80afe6a
Merge pull request #1381 from PaddlePaddle/server
...
[server] speech server init version
3 years ago
lym0302
b508c4d0cb
add readme, test=doc
3 years ago
KP
d36a4ccfc8
Add cli logger control.
3 years ago
KP
94ed5969fa
Add cli logger control.
3 years ago
lym0302
42cbe313c2
improve cli code, test=doc
3 years ago
lym0302
2bf4b4521f
add cli, test=doc
3 years ago
lym0302
8fd117e4da
add cli, test=doc
3 years ago
lym0302
80b83b7434
add cli, test=doc
3 years ago
KP
7814fba07f
Update batch input.
3 years ago
KP
05288fe1c3
Update batch input and stdin input.
3 years ago
KP
1818b058aa
Support batch input in cls task.
3 years ago
WilliamZhang06
35e3be9ac8
Merge remote-tracking branch 'remote/develop' into server
3 years ago
TianYuan
ae521d3700
Update infer.py
3 years ago
lym0302
07158b2f12
move dir, test=doc
3 years ago
lym0302
76391275fc
move dir, test=doc
3 years ago
TianYuan
67ec6242c3
fix ci for waveflow, test=tts
3 years ago
TianYuan
f51097618b
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into fix_ci_waveflow
3 years ago
TianYuan
fc8c0e3ea2
fix ci for waveflow, test=tts
3 years ago
huangyuxin
95d5274aef
fix sortagrad, test=asr
3 years ago
Hui Zhang
718c849f68
Merge pull request #1445 from yt605155624/update_train
...
[TTS]init for all works in train.py when ngpu>1
3 years ago
Hui Zhang
f3ec985aaf
Merge pull request #1439 from Jackwaterveg/tipc
...
[TIPC]Add tipc_benchmark of conformer
3 years ago
TianYuan
4ac7db185e
init for all works in train.py when ngpu>1, test=tts
3 years ago
Jackwaterveg
426bae3de1
Merge pull request #1440 from yt605155624/merge_datasets
...
[TTS]Merge datasets, change style of docstring
3 years ago
TianYuan
2cec8f6c76
update tts cli, test=doc
3 years ago
TianYuan
9699c00769
change the docstring style from numpydoc to google, test=tts
3 years ago
huangyuxin
aefe9e93a7
add tipc benchmark of conformer
3 years ago
TianYuan
683679bec7
merge data and datasets, test=tts
3 years ago
TianYuan
7dc1f2daa3
fix some librosa bugs, test=tts
3 years ago
TianYuan
30085ac229
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into rename_tacotron2
3 years ago
TianYuan
25347bb6a3
rename tacotron2, test=tts
3 years ago
huangyuxin
9a55783aa0
fix resample
3 years ago
Hui Zhang
dcfc32f1ec
Merge pull request #1379 from yt605155624/new_wavernn
...
[TTS] add wavernn
3 years ago
TianYuan
0747600c95
[TTS]add ljspeech new tacotron2 ( #1416 )
...
* add ljspeech new tacotron2, test=tts
* update ljspeech waveflow's synthesize
* add config, test=doc
Co-authored-by: Hui Zhang <zhtclz@foxmail.com>
3 years ago
TianYuan
348a1a33bf
update tacotron2 voice cloning in aishell3 with new tacotron2, test=tts ( #1419 )
3 years ago
huangyuxin
f428ec4431
change log of cli/asr/infer
3 years ago
TianYuan
1b0c034134
update wavernn, test=tts
3 years ago
TianYuan
89e69ee10e
[TTS]fix tacotron2 dygraph to static ( #1414 )
...
* fix tacotron2 dygraph to static , test=tts
* fix tacotron2 dygraph to static , test=tts
* simplify synthesize_e2e.py , test=tts
3 years ago
huangyuxin
2a42421a63
cli add ds2-librispeech offline, fix versionm, test=asr
3 years ago
Hui Zhang
4128f4d61f
fix __version__ error in develop ( #1398 )
3 years ago
TianYuan
001afee644
fix wavernn dygraph to static , test=tts
3 years ago
TianYuan
2844f388dc
[doc ]add tacotron2 readme ( #1385 )
...
* add tacotron2 readme, test=doc
* update changelog.md, test=doc
3 years ago
TianYuan
2071774d81
add wavernn in synthesize_e2e, test=tts
3 years ago
TianYuan
1cc7905d51
rm csmsc.py, test=tts
3 years ago
TianYuan
4c3e57a23c
align preprocess of wavernn, test=tts
3 years ago
Jackwaterveg
f49cf838a8
Update u2.py ( #1378 )
3 years ago
TianYuan
fb0acd40a2
add wavernn, test=tts
3 years ago
Jackwaterveg
d7222c0453
[ASR] Support CTC decoder online ( #821 )
...
* fix the destructer problem for prefixes
* unified offline and online in ctcdecoders, test=asr
* rename swig_decoders to paddlespeech_ctcdecoders, test=asr
* add reset_stage for ctcdecoder
* fix some problems
* fix ctconline
* fix a bug
* fix the format
* fix 1xt2x
3 years ago
Jerryuhoo
f515416c4a
fix missing model choice, test=doc
3 years ago
Jerryuhoo
a22080130b
Add speedyspeech multi-speaker support for synthesize_e2e.py, test=tts
3 years ago
Hui Zhang
97db74ca60
Merge pull request #1314 from yt605155624/add_new_tacotron2
...
[TTS]Add new tacotron2
3 years ago
huangyuxin
3845804cc9
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into Setup
3 years ago
TianYuan
96323816e9
fix yamls, change labels to stop_labels, test=tts
3 years ago
TianYuan
1bf1a876ae
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_new_tacotron2, test=tts
3 years ago
TianYuan
3fd7a7790b
add typehit for updater and evaluator, test=tts
3 years ago
huangyuxin
4e31247633
refacto the code
3 years ago
TianYuan
41d24337cb
fix fastspeech2 multi speaker to static, test=tts
3 years ago
TianYuan
1a9e59612a
fix fastspeech2 multi speaker to static, test=tts
3 years ago
huangyuxin
565a63c5ef
refactor the setup in paddleaudio
3 years ago
huangyuxin
eb91ce84f9
refactor the version
3 years ago
Hui Zhang
4a133619a1
Merge pull request #1356 from Jackwaterveg/CLI
...
[CLI] asr, Add Deepspeech2 online and offline model
3 years ago
Hui Zhang
d4acf4704f
Merge pull request #1350 from LittleChenCc/develop
...
[ST] beam search with optimality guarantees
3 years ago
huangyuxin
ab759b16de
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into CLI
3 years ago
huangyuxin
38edfd1a89
Add Deepspeech2 online and offline in cli
3 years ago
TianYuan
d368d57d67
fix low ips bug of speedyspeech and fastspeech2, test=tts ( #1349 )
3 years ago
TianYuan
9c7f0762b0
update racotron2 and transformer tts, test=tts
3 years ago
huangyuxin
8028f33b7f
synchronize the version
3 years ago
Junkun
44408e5211
sync the variable name to others
3 years ago
Junkun
f866059b74
config and formalize
3 years ago
Junkun
43aad7a018
beam search with optimality guarantees
3 years ago
Jackwaterveg
26524031d2
Merge pull request #1343 from Jackwaterveg/fix
...
[ASR] Fix some bugs
3 years ago
huangyuxin
5e7e8a3e24
fix the u2 export, test=asr
3 years ago
TianYuan
a1867c20c3
fix slice bug of speedyspeech expand, test=tts ( #1337 )
3 years ago
Hui Zhang
ec1c88ae1a
[s2t] remove nltk ( #1332 )
3 years ago
TianYuan
7ae4f7221e
Update length_regulator.py
3 years ago
TianYuan
acfe2b9084
Update duration_predictor.py
3 years ago
TianYuan
caa391f461
fix speedyspeech inference, test=tts ( #1322 )
3 years ago
Jackwaterveg
0c4895cd0b
mv the ctcdecoders to third_part ( #1313 )
3 years ago
TianYuan
8f507ba4ba
Merge pull request #1302 from jerryuhoo/develop
...
[TTS] Add support for finetuning speedyspeech
3 years ago
Jerryuhoo
111a452378
Fix the code format, test=tts
3 years ago
TianYuan
89e988a69e
add csmsc tacotron2, test=tts
3 years ago
TianYuan
c088b9a304
add csmsc tacotron2
3 years ago
huangyuxin
fe1dc9d211
refactor the cli/st, test=st
3 years ago
TianYuan
27bb76bdb9
fix tone_sandhi of yi, test=tts
3 years ago
Jerryuhoo
be99807d61
Add durations to gen_gta_mel.py inference
3 years ago
KP
52a8b2f320
Add ECAPA_TDNN. ( #1301 )
3 years ago
Jerryuhoo
fcc34e3e95
[tts] add gen_gta_mel.py for finetuning speedypeech, test=tts
3 years ago
Jackwaterveg
010aa65b2b
[cli] asr - support English, decode_metod and unified config ( #1297 )
...
* fix config, test=asr
* fix config, test=doc_fix
* add en and decode_method for cli/asr, test=asr
* test=asr
* fix, test=doc_fix
3 years ago
KP
c09466ebbe
Add ECAPA_TDNN. ( #1295 )
3 years ago
TianYuan
fb238d83f4
update vctk voc1, test=tts ( #1294 )
3 years ago
TianYuan
73dc0e2535
fix_ning
3 years ago
billishyahao
ddf184be60
fix some typos
3 years ago