qingen
84576d6956
[vec][score] add plda model, test=doc fix #1667
3 years ago
lym0302
1a3c811f04
code format, test=doc
3 years ago
TianYuan
0d6f5868ea
Merge pull request #1665 from yt605155624/add_onnx
...
[TTS]add onnx inference for fastspeech2 + hifigan/mb_melgan
3 years ago
Honei
f500fa8bde
Merge pull request #1646 from Honei/develop
...
[vec]add speaker verification score method
3 years ago
TianYuan
0282d45c62
remove fill_constant_batch_size_like in static model of speedyspeech, test=tts
3 years ago
TianYuan
c765fca6b4
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_onnx
3 years ago
TianYuan
124eb6af8f
update notes, test=doc
3 years ago
TianYuan
e0d222e674
update notes, test=doc
3 years ago
Hui Zhang
1843bed458
Merge pull request #1666 from Jackwaterveg/cli
...
[CLI] ASR: Add duration limitation for asr
3 years ago
xiongxinlei
a8244dc5b0
update the note, test=doc
3 years ago
Jackwaterveg
c852776bc6
test=doc
3 years ago
TianYuan
f264b912fc
add warmup for frontend, test=doc
3 years ago
Jackwaterveg
4922e697e1
update cli, test = asr
3 years ago
Jackwaterveg
1c05d03806
test=asr
3 years ago
xiongxinlei
9b5f7f71ac
add part ecapa-tdnn note, test=doc
3 years ago
Hui Zhang
6eed542c08
Merge pull request #1660 from yt605155624/fix_pre
...
[TTS]fix preprocess bug, test=tts
3 years ago
Honei
83310b6379
Merge branch 'develop' into develop
3 years ago
huangyuxin
faf21f033f
add duration limitation for asr
3 years ago
TianYuan
7aecb2c4bb
add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
3 years ago
xiongxinlei
d064c8196e
update the speaker verification model, test=doc
3 years ago
xiongxinlei
e72912adb9
update the speaker verification model, test=doc
3 years ago
TianYuan
a8f5990869
fix preprocess bug, test=tts
3 years ago
lym0302
759a9e61e4
update server cli, test=doc
3 years ago
lym0302
603e565ab1
add stream tts server, test=doc
3 years ago
ccrrong
378fe5909f
add ami diarization pipeline, test=doc
3 years ago
xiongxinlei
48b8cc8937
add score method, test=doc
3 years ago
xiongxinlei
ebfe3e6b13
test.py update the CSVDataset, test=doc
3 years ago
xiongxinlei
acebfad7b7
change the vector csv.spk_id to csv.label, test=doc
3 years ago
xiongxinlei
57c11dcab0
add some annotations, test=doc
3 years ago
xiongxinlei
30b5b3cb9e
add vector csv dataset format, test=doc
3 years ago
TianYuan
e366fb6b2f
Merge pull request #1643 from Jackwaterveg/check
...
[Doc] supplement note
3 years ago
huangyuxin
ca860e3d2f
supplement note
3 years ago
TianYuan
828ee14404
add license and reference for some models, test=doc
3 years ago
xiongxinlei
5b05300e53
train process add new voxceleb and rirs dataset, test=doc
3 years ago
xiongxinlei
965f486dd5
add voxceleb and rirs noise dataset
3 years ago
Hui Zhang
36df70cbe6
Merge pull request #1638 from zh794390558/spx_refactor
...
[speechx] refactor audio/data/feature cache
3 years ago
TianYuan
5bff096715
Merge pull request #1634 from yt605155624/cnn_decoder
...
[TTS]Cnn decoder
3 years ago
TianYuan
3aec266ca5
add chunk size and pad size in args, test=doc
3 years ago
Hui Zhang
cb39777a60
format code
3 years ago
TianYuan
4d7cd0e063
add streaming synthesize, test=tts
3 years ago
liangym
602b0b0da3
Merge pull request #1632 from lym0302/develop
...
[server] fix output bug
3 years ago
Hui Zhang
61941d14b0
Merge pull request #1627 from WilliamZhang06/ws-develop
...
[websocket] added online asr engine
3 years ago
WilliamZhang06
2ec8d608bf
fixed comments, test=doc
3 years ago
liangym
21c4132eda
Update paddlespeech_client.py
3 years ago
TianYuan
005aa4066c
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into cnn_decoder
3 years ago
TianYuan
0fc79f474d
add CNNDecoder, test=tts
3 years ago
WilliamZhang06
d847fe29cf
added online asr engine , test=doc
3 years ago
TianYuan
318edec303
Merge pull request #1613 from yt605155624/restructure_expand
...
[tts]restructure expand in length_regulator.py for paddle2onnx, test=tts
3 years ago
Hui Zhang
943d4ac1ee
Merge pull request #1612 from Jackwaterveg/update
...
[ASR] Replace kaidi_fbank with paddleaudio
3 years ago
huangyuxin
f47146af49
add docstring, test=asr
3 years ago
huangyuxin
ed490b66cb
update spectrogram, test=asr
3 years ago
Hui Zhang
84d712d493
format code, test=doc
3 years ago
Honei
d60856b1ed
Merge pull request #1614 from Honei/vox12
...
[vec]change the vector output to numpy.array
3 years ago
xiongxinlei
ed7113f320
change the vector output to numpy.array
3 years ago
TianYuan
bc5ae43d3a
restructure expand in length_regulator.py for paddle2onnx, test=tts
3 years ago
huangyuxin
0ffe1f9114
replace kaidi_fbank with paddleaudio
3 years ago
Hui Zhang
caee809513
Merge pull request #1605 from Honei/vox12
...
[vec]add speaker verification demo and doc
3 years ago
xiongxinlei
5ae57206f3
add paddlespeech vector modules __init__.py
3 years ago
xiongxinlei
2c9dc0c89b
add some vector cli comments, test=doc
3 years ago
xiongxinlei
ef1bc5e815
vector cli output dim info, test=doc
3 years ago
xiongxinlei
d5142e5e15
add vector cli annotation, test=doc
3 years ago
xiongxinlei
ad2caf2ccb
add speaker verification demo and doc, test=doc
3 years ago
TianYuan
3cc0ec950e
Merge pull request #1604 from lym0302/add_readme
...
[server] update readme
3 years ago
lym0302
829f1e332e
update readme, test=doc
3 years ago
xiongxinlei
0f78d25f76
add vector cli batch and pipeline test demo, test=doc
3 years ago
Honei
305bacdcf2
Merge branch 'develop' into vox12
3 years ago
xiongxinlei
0bb67d8b8e
add vector cli unit test, test=doc
3 years ago
KP
b6e976a860
Merge pull request #1602 from yt605155624/fix_dtype
...
[TTS]fix dtype of window of stft
3 years ago
xiongxinlei
62cbce6915
add vectorwrapper to extract audio embedding
3 years ago
TianYuan
8938483529
Merge pull request #1601 from yt605155624/add_ljspeech_hifigan
...
[TTS] update readme for ljspeech hifigan
3 years ago
TianYuan
5347dbad3f
fix dtype of window of stft, test=tts
3 years ago
TianYuan
342b487383
update readme for ljspeech hifigan, test=tts
3 years ago
Hui Zhang
4051e7b762
fix compliance test bug, and format
3 years ago
TianYuan
26ef47810d
Merge pull request #1593 from windstamp/npu_dev_20220322
...
[NPU] Add NPU support for TransformerTTS
3 years ago
zhangkeliang
59b3de6a6d
[NPU] test TransformerTTS with NPU
3 years ago
Jackwaterveg
fcc1762048
Merge pull request #1577 from Jackwaterveg/change_init
...
[ASR] change default initializer to kaiming_uniform
3 years ago
huangyuxin
e1b581b622
fix some bug, test=asr
3 years ago
Hui Zhang
b5315657ff
Merge pull request #1509 from qingen/cluster
...
[vec] add clustering of vectors
3 years ago
huangyuxin
6da8465f14
add dist_sampler args, test=asr
3 years ago
TianYuan
e5e8b8a129
Merge pull request #1587 from yt605155624/add_vctk_hifigan
...
[TTS]Add vctk hifigan
3 years ago
TianYuan
6469568d2a
update readme for vctk hifigan, test=tts
3 years ago
huangyuxin
a4f5a68074
fix some format, test=asr
3 years ago
xiongxinlei
d85d1deef5
exec pre-commit in paddlespeech vector, test=doc
3 years ago
xiongxinlei
9874fb7d75
add some comments in code
3 years ago
huangyuxin
e991d82ae7
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init
3 years ago
huangyuxin
d53e1163a6
update the code, test=asr
3 years ago
xiongxinlei
b9eafddd94
change - to _ to distinguish field
3 years ago
xiongxinlei
9c6735f921
add vector voxceleb12 base mode url, test=doc
3 years ago
xiongxinlei
d28ccfa96b
add vector cli component, test=doc
3 years ago
KP
831cadacc7
Add paddleaudio doc.
3 years ago
TianYuan
5ab2601759
update readme for aishell3 hifigan, test=tts
3 years ago
Hui Zhang
6abc5d9f7e
format
3 years ago
huangyuxin
ab16d8ce3c
change default initializer to kaiming_uniform, test=asr
3 years ago
qingen
0f7ede11ef
Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster
3 years ago
qingen
d16ce21d47
[wip][vec] update cluster of diarization, test=doc #1304
3 years ago
xiongxinlei
506d26a957
change the code style to s2t code style, test=doc
3 years ago
xiongxinlei
311fa87a11
add some comments to the code
3 years ago
Hui Zhang
90deeca06f
Merge pull request #1554 from lym0302/develop
...
[server] add server cls
3 years ago
lym0302
89457b273a
modify, test=doc
3 years ago
xiongxinlei
8ed5c287a3
add vox2 data into VoxCeleb class
3 years ago
lym0302
77bad44e8b
modify readme, test=doc
3 years ago
lym0302
8ef92a9495
modify, test=doc
3 years ago
lym0302
89dbda58f6
add cls static model, test=doc
3 years ago
Hui Zhang
40ab05a462
Merge pull request #1552 from yt605155624/format_syn
...
[TTS]format synthesize
3 years ago
lym0302
5187df847f
modify server demo, test=doc
3 years ago
xiongxinlei
584a2c0e39
add ecapa-tdnn config yaml file
3 years ago
lym0302
0a6602c708
modify application.yaml, test=doc
3 years ago
TianYuan
544c372b50
fix cr, test=tts
3 years ago
lym0302
99fa7a8205
add server cls, test=doc
3 years ago
TianYuan
fe8bf2a38c
format synthesize, test=tts
3 years ago
xiongxinlei
993d6783d7
remove unused code, test=doc
3 years ago
xiongxinlei
0e87037f2c
refactor to compilance paddleaudio
3 years ago
xiongxinlei
4473405f82
merge develop to vox12, test=doc
3 years ago
Honei
0dee8f40e9
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
xiongxinlei
60d73bb7bd
add state 0 to prepare the voxcele data and augment data
3 years ago
xiongxinlei
14efbf5b15
check extract embedding result, test=doc
3 years ago
xiongxinlei
386ef3f161
add voxceleb augment unit test, test=doc
3 years ago
Hui Zhang
5147163592
Merge pull request #1544 from yt605155624/add_vctk_hifigan
...
[tts]add vctk hifigan egs
3 years ago
TianYuan
81d964f0a0
add vctk hifigan, test=tts
3 years ago
xiongxinlei
2d89c80e6f
add waveform augment pipeline, test=doc
3 years ago
lym0302
3b304544f6
modify yaml, test=doc
3 years ago
xiongxinlei
ac4967e204
optimize the data prepare process
3 years ago
xiongxinlei
016ed6d69c
repair the code according to the part comment, test=doc
3 years ago
Hui Zhang
2886ab9373
Merge pull request #1530 from lym0302/server_cli
...
[server] add server test
3 years ago
xiongxinlei
1f74af110b
add training log info and comment, test=doc
3 years ago
lym0302
e50c1b3b1d
add server test, test=doc
3 years ago
xiongxinlei
4648059b5f
add training process for sid, test=doc
3 years ago
xiongxinlei
7668f61422
add sid dataloader for training, test=doc
3 years ago
xiongxinlei
6af2bc3d5b
add sid loss wraper for voxceleb, test=doc
3 years ago
xiongxinlei
57c4f4a68c
add sid learning rate and training model
3 years ago
TianYuan
4d2f2191a8
fix gbk encode bug
3 years ago
Honei
1395b5f5fa
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
TianYuan
175c39b4a4
Merge pull request #1511 from yt605155624/pre_fix_for_streaming
...
[TTS]add rtf for synthesize, add more vocoder for synthesize.sh
3 years ago
Hui Zhang
5ba4907c44
Merge pull request #1514 from lym0302/server_cli
...
[server] update server cli
3 years ago
lym0302
85d4a31e04
update application.yaml, test=doc
3 years ago
Jerryuhoo
c116a3a926
fix Speedyspeech multi-speaker inference, test=tts
3 years ago
lym0302
ab04488738
update server cli, test=doc
3 years ago
TianYuan
cb07bd2a94
add rtf for synthesize, add more vocoder for synthesize_e2e.sh, test=tts
3 years ago
Hui Zhang
26d413ce8f
Merge pull request #1510 from lym0302/paddlespeech_stats
...
[server] add paddlespeech_server stats
3 years ago
lym0302
72c0cda30c
add paddlespeech_server stats, test=doc
3 years ago
Hui Zhang
e8f2d8f11b
Merge pull request #1507 from zh794390558/cli
...
[cli] add cli batch/pipe example to readme
3 years ago
Hui Zhang
2517df92a0
Merge pull request #1508 from lym0302/paddlespeech_stats
...
[CLI] modified text sr to lang
3 years ago
TianYuan
b6d33a7fb4
Merge pull request #1506 from yt605155624/fix_frontend
...
[TTS]update text frontend, test=tts
3 years ago
lym0302
395c923dee
modified text sr to lang, test=doc
3 years ago
Hui Zhang
75098698d8
format,test=doc
3 years ago
TianYuan
66a8beb27f
update text frontend, test=tts
3 years ago
lym0302
96abb33b5b
add __call__, test=doc
3 years ago
lym0302
5f1728f855
rm server related, test=doc
3 years ago
xiongxinlei
70d3b01c0d
remove invalid code
3 years ago
xiongxinlei
d7da629302
add kaldi feats egs dataset
3 years ago
xiongxinlei
6f7e9656fe
add kaldi feats ark dataset
3 years ago
lym0302
35357e775e
update, test=doc
3 years ago
lym0302
e5aa24fa5a
resolve setup.py conflicts, test=doc
3 years ago
lym0302
fe6be4a65e
Merge branch 'develop' of https://github.com/lym0302/PaddleSpeech into paddlespeech_stats
3 years ago
lym0302
f8375764b9
add paddlespeech stats, test=doc
3 years ago
Hui Zhang
8d474c2658
Merge pull request #1482 from lym0302/servercli_update
...
[server] update server cli
3 years ago
lym0302
162361d878
format code, test=doc
3 years ago
lym0302
434708cff4
set device cpu, test=doc
3 years ago
lym0302
920b2c808c
paras required, test=doc
3 years ago
Hui Zhang
6b1fe70100
format code,test=doc
3 years ago
lym0302
6b2dd16845
update server cli, test=doc
3 years ago
WilliamZhang06
78c9b7342c
deleted wav file , test=doc
3 years ago
WilliamZhang06
a6ec3a26f1
Merge branch 'develop' into server_asr
3 years ago
WilliamZhang06
8b4602f738
added isinstance code, test=doc
3 years ago
lym0302
bb60561c66
update util, test=doc
3 years ago
WilliamZhang06
147018a8b4
added cli changed code, test=doc
3 years ago
lym0302
332009142b
add server demo, test=doc
3 years ago
WilliamZhang06
7ebe904e20
fixed overload , test=doc
3 years ago
Hui Zhang
60c0877e7a
Merge pull request #1472 from KPatr1ck/cli_batch
...
[CLI][Logger]Add cli logger control.
3 years ago
WilliamZhang06
b8f16ac9b0
Merge branch 'develop' into server_asr
3 years ago
WilliamZhang06
da3ea7bb40
added engine type and asr inference , test=doc
3 years ago
Hui Zhang
49f80afe6a
Merge pull request #1381 from PaddlePaddle/server
...
[server] speech server init version
3 years ago
lym0302
b508c4d0cb
add readme, test=doc
3 years ago
KP
d36a4ccfc8
Add cli logger control.
3 years ago
KP
94ed5969fa
Add cli logger control.
3 years ago
lym0302
42cbe313c2
improve cli code, test=doc
3 years ago
lym0302
2bf4b4521f
add cli, test=doc
3 years ago
lym0302
8fd117e4da
add cli, test=doc
3 years ago
lym0302
80b83b7434
add cli, test=doc
3 years ago
KP
7814fba07f
Update batch input.
3 years ago
KP
05288fe1c3
Update batch input and stdin input.
3 years ago
KP
1818b058aa
Support batch input in cls task.
3 years ago
WilliamZhang06
35e3be9ac8
Merge remote-tracking branch 'remote/develop' into server
3 years ago
TianYuan
ae521d3700
Update infer.py
3 years ago
lym0302
07158b2f12
move dir, test=doc
3 years ago
lym0302
76391275fc
move dir, test=doc
3 years ago
TianYuan
67ec6242c3
fix ci for waveflow, test=tts
3 years ago
TianYuan
f51097618b
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into fix_ci_waveflow
3 years ago
TianYuan
fc8c0e3ea2
fix ci for waveflow, test=tts
3 years ago
huangyuxin
95d5274aef
fix sortagrad, test=asr
3 years ago
Hui Zhang
718c849f68
Merge pull request #1445 from yt605155624/update_train
...
[TTS]init for all works in train.py when ngpu>1
3 years ago
Hui Zhang
f3ec985aaf
Merge pull request #1439 from Jackwaterveg/tipc
...
[TIPC]Add tipc_benchmark of conformer
3 years ago
TianYuan
4ac7db185e
init for all works in train.py when ngpu>1, test=tts
3 years ago
Jackwaterveg
426bae3de1
Merge pull request #1440 from yt605155624/merge_datasets
...
[TTS]Merge datasets, change style of docstring
3 years ago
TianYuan
2cec8f6c76
update tts cli, test=doc
3 years ago
TianYuan
9699c00769
change the docstring style from numpydoc to google, test=tts
3 years ago
huangyuxin
aefe9e93a7
add tipc benchmark of conformer
3 years ago
TianYuan
683679bec7
merge data and datasets, test=tts
3 years ago
TianYuan
7dc1f2daa3
fix some librosa bugs, test=tts
3 years ago
TianYuan
30085ac229
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into rename_tacotron2
3 years ago
TianYuan
25347bb6a3
rename tacotron2, test=tts
3 years ago
huangyuxin
9a55783aa0
fix resample
3 years ago
Hui Zhang
dcfc32f1ec
Merge pull request #1379 from yt605155624/new_wavernn
...
[TTS] add wavernn
3 years ago
TianYuan
0747600c95
[TTS]add ljspeech new tacotron2 ( #1416 )
...
* add ljspeech new tacotron2, test=tts
* update ljspeech waveflow's synthesize
* add config, test=doc
Co-authored-by: Hui Zhang <zhtclz@foxmail.com>
3 years ago
TianYuan
348a1a33bf
update tacotron2 voice cloning in aishell3 with new tacotron2, test=tts ( #1419 )
3 years ago
huangyuxin
f428ec4431
change log of cli/asr/infer
3 years ago
TianYuan
1b0c034134
update wavernn, test=tts
3 years ago
TianYuan
89e69ee10e
[TTS]fix tacotron2 dygraph to static ( #1414 )
...
* fix tacotron2 dygraph to static , test=tts
* fix tacotron2 dygraph to static , test=tts
* simplify synthesize_e2e.py , test=tts
3 years ago
huangyuxin
2a42421a63
cli add ds2-librispeech offline, fix versionm, test=asr
3 years ago
Hui Zhang
4128f4d61f
fix __version__ error in develop ( #1398 )
3 years ago
TianYuan
001afee644
fix wavernn dygraph to static , test=tts
3 years ago
TianYuan
2844f388dc
[doc ]add tacotron2 readme ( #1385 )
...
* add tacotron2 readme, test=doc
* update changelog.md, test=doc
3 years ago
TianYuan
2071774d81
add wavernn in synthesize_e2e, test=tts
3 years ago
TianYuan
1cc7905d51
rm csmsc.py, test=tts
3 years ago
TianYuan
4c3e57a23c
align preprocess of wavernn, test=tts
3 years ago
Jackwaterveg
f49cf838a8
Update u2.py ( #1378 )
3 years ago
TianYuan
fb0acd40a2
add wavernn, test=tts
3 years ago
Jackwaterveg
d7222c0453
[ASR] Support CTC decoder online ( #821 )
...
* fix the destructer problem for prefixes
* unified offline and online in ctcdecoders, test=asr
* rename swig_decoders to paddlespeech_ctcdecoders, test=asr
* add reset_stage for ctcdecoder
* fix some problems
* fix ctconline
* fix a bug
* fix the format
* fix 1xt2x
3 years ago
Jerryuhoo
f515416c4a
fix missing model choice, test=doc
3 years ago
Jerryuhoo
a22080130b
Add speedyspeech multi-speaker support for synthesize_e2e.py, test=tts
3 years ago
Hui Zhang
97db74ca60
Merge pull request #1314 from yt605155624/add_new_tacotron2
...
[TTS]Add new tacotron2
3 years ago
huangyuxin
3845804cc9
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into Setup
3 years ago
TianYuan
96323816e9
fix yamls, change labels to stop_labels, test=tts
3 years ago
TianYuan
1bf1a876ae
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_new_tacotron2, test=tts
3 years ago
TianYuan
3fd7a7790b
add typehit for updater and evaluator, test=tts
3 years ago
huangyuxin
4e31247633
refacto the code
3 years ago
TianYuan
41d24337cb
fix fastspeech2 multi speaker to static, test=tts
3 years ago
TianYuan
1a9e59612a
fix fastspeech2 multi speaker to static, test=tts
3 years ago
huangyuxin
565a63c5ef
refactor the setup in paddleaudio
3 years ago
huangyuxin
eb91ce84f9
refactor the version
3 years ago
Hui Zhang
4a133619a1
Merge pull request #1356 from Jackwaterveg/CLI
...
[CLI] asr, Add Deepspeech2 online and offline model
3 years ago
Hui Zhang
d4acf4704f
Merge pull request #1350 from LittleChenCc/develop
...
[ST] beam search with optimality guarantees
3 years ago
huangyuxin
ab759b16de
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into CLI
3 years ago
huangyuxin
38edfd1a89
Add Deepspeech2 online and offline in cli
3 years ago
TianYuan
d368d57d67
fix low ips bug of speedyspeech and fastspeech2, test=tts ( #1349 )
3 years ago
TianYuan
9c7f0762b0
update racotron2 and transformer tts, test=tts
3 years ago
huangyuxin
8028f33b7f
synchronize the version
3 years ago
Junkun
44408e5211
sync the variable name to others
3 years ago
Junkun
f866059b74
config and formalize
3 years ago
Junkun
43aad7a018
beam search with optimality guarantees
3 years ago
Jackwaterveg
26524031d2
Merge pull request #1343 from Jackwaterveg/fix
...
[ASR] Fix some bugs
3 years ago
huangyuxin
5e7e8a3e24
fix the u2 export, test=asr
3 years ago
TianYuan
a1867c20c3
fix slice bug of speedyspeech expand, test=tts ( #1337 )
3 years ago
Hui Zhang
ec1c88ae1a
[s2t] remove nltk ( #1332 )
3 years ago
TianYuan
7ae4f7221e
Update length_regulator.py
3 years ago
TianYuan
acfe2b9084
Update duration_predictor.py
3 years ago
TianYuan
caa391f461
fix speedyspeech inference, test=tts ( #1322 )
3 years ago
Jackwaterveg
0c4895cd0b
mv the ctcdecoders to third_part ( #1313 )
3 years ago
TianYuan
8f507ba4ba
Merge pull request #1302 from jerryuhoo/develop
...
[TTS] Add support for finetuning speedyspeech
3 years ago
Jerryuhoo
111a452378
Fix the code format, test=tts
3 years ago
TianYuan
89e988a69e
add csmsc tacotron2, test=tts
3 years ago
TianYuan
c088b9a304
add csmsc tacotron2
3 years ago
huangyuxin
fe1dc9d211
refactor the cli/st, test=st
3 years ago
TianYuan
27bb76bdb9
fix tone_sandhi of yi, test=tts
3 years ago
Jerryuhoo
be99807d61
Add durations to gen_gta_mel.py inference
3 years ago
KP
52a8b2f320
Add ECAPA_TDNN. ( #1301 )
3 years ago
Jerryuhoo
fcc34e3e95
[tts] add gen_gta_mel.py for finetuning speedypeech, test=tts
3 years ago
Jackwaterveg
010aa65b2b
[cli] asr - support English, decode_metod and unified config ( #1297 )
...
* fix config, test=asr
* fix config, test=doc_fix
* add en and decode_method for cli/asr, test=asr
* test=asr
* fix, test=doc_fix
3 years ago
KP
c09466ebbe
Add ECAPA_TDNN. ( #1295 )
3 years ago
TianYuan
fb238d83f4
update vctk voc1, test=tts ( #1294 )
3 years ago
TianYuan
73dc0e2535
fix_ning
3 years ago
billishyahao
ddf184be60
fix some typos
3 years ago
TianYuan
318cc9e539
Merge branch 'develop' into develop
3 years ago
Jackwaterveg
e69abc9265
Merge pull request #1273 from zh794390558/batch_sampler
...
[s2t] Fix Batch sampler set epoch
3 years ago
KP
a810cd4e5c
Add cli logging. ( #1274 )
3 years ago
Jerryuhoo
d6e9b76e76
change link_wav.py path, test=tts
3 years ago
Jerryuhoo
c94f346207
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/
...
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/ so that different vocoders can use one script.
3 years ago
Jerryuhoo
e239ee1cd2
add multi-speaker support for finetuning hifigan vocoder
3 years ago
huangyuxin
07d457859d
use pre-commit, test=doc_fix
3 years ago
Hui Zhang
45832f6770
fix default dist_samlper to False
3 years ago
Hui Zhang
3a2db414e6
format code
3 years ago
Hui Zhang
6f651d762e
fix batch sampler set_epoch when epcoh start
3 years ago
TianYuan
680eac02b9
[tts]Update mb melgan ( #1272 )
...
* update mb melgan
* update mb melgan, test=tts
3 years ago
TianYuan
98ce69d0aa
Merge pull request #1259 from jerryuhoo/develop
...
[TTS]Add multi-speaker support for the SpeedySpeech model
3 years ago
huangyuxin
ffadbe22a7
merge the develop, test=asr
3 years ago
JiehangXie
bdc48114a4
Update text_normlization.py
3 years ago
JiehangXie
d88ceef7bc
Fix punctuation bug
...
修复顿号和英文冒号停顿和分句的问题
3 years ago
huangyuxin
8b63485ce3
fix some bug, test=asr
3 years ago
JiehangXie
6065b1b607
Fix punctuation bug
...
修复顿号和英文冒号停顿和分句的问题
3 years ago
Jerry
0719698118
Merge branch 'develop' into develop
3 years ago
AdamBear
36c9eaa437
Cache the TextFeaturizer instance for infer speed improvement. ( #1260 )
3 years ago
huangyuxin
3e2cc898cb
remove default cfg and fix some bugs,test=asr
3 years ago
Jerryuhoo
2dccd5315d
remove useless "other" dataset
3 years ago
Jerryuhoo
f191d0b022
change speaker embedding position
...
Change speaker embedding position into the encoder.
3 years ago
Jerryuhoo
11991b6d35
add multi-speaker support for speedyspeech
3 years ago
huangyuxin
a1d8ab0f99
merge the develop
3 years ago
huangyuxin
c907a8deda
change all recipes
3 years ago
TianYuan
b9a55262f1
Update fastspeech2.py
3 years ago
Hui Zhang
c81a3f0f83
[s2t] DataLoader with BatchSampler or DistributeBatchSampler ( #1242 )
...
* batchsampler or distributebatchsampler
* format
3 years ago
Junkun Chen
420709e5ce
[st] Distributed sampler and new dataloader with MIMO ( #1239 )
...
* update timit result, test=doc_fix
* result update
* fix bug
* add triplet loader
* empty preprocess file
* sync to u2, updating
* sync to u2 config
* fix bugs
* code refine
* update config
* customize decoding batch size
* update optimizer and lr scheduler
* minor
* minor
* minor
* fix bugs of refs
* minor
* distributed sampler
* minor
* refine the loader
3 years ago
TianYuan
fbe3c05137
add style_melgan and hifigan in tts cli, test=tts ( #1241 )
3 years ago
TianYuan
a232cd8b12
Update fastspeech2.py
3 years ago
huangyuxin
41eeed0450
add librispeech asr1
3 years ago
huangyuxin
fb6d1e2c11
merge the develop
3 years ago
huangyuxin
2c5902d7c5
rename decoding to decode
3 years ago
TianYuan
42c109216d
[tts]add style melgan pretraied model ( #1228 )
...
* add style melgan pretraied model
* add style melgan pretraied model, test=tts
Co-authored-by: Hui Zhang <zhtclz@foxmail.com>
3 years ago
Hui Zhang
bb2a370b23
[asr] remove useless conf of librispeech ( #1227 )
...
* remve useless conf
* format code
* update conf
* update conf
* update conf
3 years ago
huangyuxin
c40b6f4062
refactor the train and test config,test=asr
3 years ago
TianYuan
5692b0ff04
fix log for t2s ( #1219 )
3 years ago
TianYuan
b031ee43c4
Merge pull request #1215 from yt605155624/refactor_punc
...
[text]Refactor punc
3 years ago
TianYuan
e1798e1eeb
update
3 years ago
KP
d362d28d35
Remove logging file in cli api.
3 years ago
TianYuan
15b8904fa1
refactor punc
3 years ago
JiehangXie
927c9bbdb6
Fix a bug when sentence inputed contain English words
3 years ago
KP
1632af7706
Update examples/esc50. ( #1203 )
3 years ago
Jerryuhoo
3cbfd7bf35
Add speaker embedding and speaker id for style fastspeech2 inference
3 years ago
Hui Zhang
db121226b8
clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch ( #1191 )
3 years ago
Hui Zhang
d852aee2ff
[asr] logfbank with dither ( #1179 )
...
* fix logfbank dither
* format
3 years ago
KP
9ec2bc8e2e
Update README. test=doc_fix
3 years ago
Jackwaterveg
879857332d
[version]add paddlespeech.__version__ ( #1166 )
...
* add paddlespeech.__version__
* version 0.1.0 is ready
3 years ago
TianYuan
19ef7210a0
[TTS]Add hifigan ( #1097 )
...
* add hifigan
* add hifigan
* integrate synthesize synthesize_e2e, inference for tts, test=tts
* add some python files, test=tts
* update readme, test=doc_fix
3 years ago
TianYuan
675cff258b
[TTS]fix praatio version, test=tts ( #1158 )
...
* fix praatio version, test=tts
* fix praatio version, test=tts
3 years ago
Jackwaterveg
e9748faa71
[Cli]optimize the cli, add --yes, and delete transformer_aishell ( #1154 )
...
* optimize the cli/asr,test=asr
* test=doc_fix
3 years ago
Jackwaterveg
2bccde3def
update the version of ctcdecoders and feat,test=doc_fix ( #1155 )
3 years ago
Jackwaterveg
0151f2463f
fix bug of pad_sequence in u2,test=asr ( #1153 )
3 years ago
Jackwaterveg
68164dd39f
[asr]rename test_hub to test_wav ( #1132 )
...
* add the readme, librispeech_asr1
* fix the test_hub
* test=asr
3 years ago
KP
16d6ed3842
Add automatic_video_subtitiles demo.
3 years ago
KP
7394a18732
Add default arguments in cls python api.
3 years ago
TianYuan
f9efbf3063
Update generate_lexicon.py
3 years ago
Jackwaterveg
5b446f6321
[Config]clear the u2 decode config for asr ( #1107 )
...
* clear the u2 decode config
* rename the vocab_filepath and cmvn_path
3 years ago
KP
074559fe90
[CLI][Demo][Text]Refactor punctuation_restoration. ( #1013 )
...
* Refactor punctuation_restoration.
* Add text cli and punc demo.
3 years ago
Hui Zhang
51d7a07c6d
format and fix pre-commit ( #1120 )
3 years ago
TianYuan
5f0f76f249
add eval() for inference model ( #1114 )
3 years ago
TianYuan
59e4a34071
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into tts_cli
3 years ago
TianYuan
3de4130dfc
update am name
3 years ago
TianYuan
9db1710ba7
add conformer demos ( #1108 )
3 years ago
TianYuan
3fe75f833d
Merge pull request #1109 from yt605155624/tts_cli
...
[cli]update voc name
3 years ago
TianYuan
ca12a83d5a
update voc name
3 years ago
TianYuan
965a57ef0e
Update README.md
3 years ago
Jackwaterveg
9e31a606d1
set default encoding utf8 for win ( #1101 )
...
Co-authored-by: KP <109694228@qq.com>
3 years ago
Hui Zhang
764a5d4271
Merge branch 'develop' into ctc
3 years ago
Hui Zhang
b1c80c45e0
remove ctc grad norm type in config
3 years ago
huangyuxin
1d4002409f
separate the sox and soxbindings with the requirements
3 years ago
TianYuan
df5fe035e5
Update README.md
3 years ago
TianYuan
a6e0a69da8
Merge pull request #1095 from KPatr1ck/demo
...
[Demo]Add tts demo.
3 years ago
TianYuan
963e906f56
Merge pull request #1068 from yt605155624/add_style_melgan
...
[TTS]add style_melgan
3 years ago
KP
1909f2f620
Add tts demo.
3 years ago
KP
3701fba0be
Update download logic and fix README typos.
3 years ago
TianYuan
f701882b66
update add_style_melgan
3 years ago
gongel
dc60aeb8c2
format
3 years ago
gongel
31510d088c
refactor: rm kaldi_io
3 years ago
TianYuan
2189b46004
add tts cli
3 years ago
KP
70a8a75476
Add st demo.
3 years ago
Hui Zhang
6dedb63e8b
Merge pull request #1087 from Jackwaterveg/setup
...
[ctcdecoders] Separate the ctcdecoders
3 years ago
huangyuxin
9fe0beee54
fix the bug: miss import after install
3 years ago
huangyuxin
cea5ffe0e4
refactor the code
3 years ago
gongel
20d88ec673
refactor: update params/input/output/namestyle
3 years ago
KP
6c1e6e7876
Update recommended model to cnn14 and argument name in __call__.
3 years ago
huangyuxin
ed12db61a6
Separate the ctcdecoders
3 years ago
KP
0b7e0d1e2e
Update tags of pretrained_models.
3 years ago
KP
d08b824d72
Update README.
3 years ago
KP
61e39daccc
Optimize model init.
3 years ago
KP
528c70e515
Remove TODO.
3 years ago
KP
b072453ca8
Fix decompressing problem.
3 years ago
KP
29da318379
Add audio classification cli.
3 years ago
gongel
f5c61ced28
feat: add st cli
3 years ago
Hui Zhang
0818c1601d
add __init__.py
3 years ago
TianYuan
7b2ecb6eed
add style_melgan, test=tts
3 years ago
Hui Zhang
03678c08c5
Merge branch 'develop' into fix_cli
3 years ago
huangyuxin
1b57d05d1b
rm the os.chdir in cli asr
3 years ago
TianYuan
aead853b1d
Update zh_frontend.py
3 years ago
huangyuxin
021311c76b
add transformer to cli infer
3 years ago
TianYuan
a070524d37
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_style_melgan, test=tts
3 years ago
TianYuan
dd36eafe34
add style_melgan
3 years ago
KP
54cf048b2a
Merge remote-tracking branch 'update_stream/develop' into cli
3 years ago
huangyuxin
a258a34ec0
revise the convert pcm
3 years ago
Jackwaterveg
8ec576f477
Update infer.py
3 years ago
huangyuxin
b0356ae489
revise
3 years ago
huangyuxin
957f2e3a1c
revise
3 years ago
huangyuxin
aee530af27
revise the sample rate
3 years ago
Junkun
4e31a4445d
eval mode
3 years ago
KP
a19e51d7da
Update python api.
3 years ago
KP
e0642ffc77
Update doc strings.
3 years ago
huangyuxin
90d648a601
support using by __call__
3 years ago
huangyuxin
aecb5f567c
Merge branch 'tmp' into 1048
3 years ago
KP
44e9b032d5
Update inputs and outputs of executor.
3 years ago
huangyuxin
3fadcde5e2
revise the asr infer.py
3 years ago
Hui Zhang
4823892169
Merge pull request #1058 from Jackwaterveg/benchmark
...
[benchmark]fix the benchmark
3 years ago
Junkun
3a14b82844
minor
3 years ago
Junkun
f50a2ab4ca
fix bugs
3 years ago
huangyuxin
cb383a39c3
fix the benchmark
3 years ago
huangyuxin
d0bf506fee
fix the load checkpoint
3 years ago
KP
1707244472
Update device usage.
3 years ago
KP
000294132c
Rename s2t to asr.
3 years ago
huangyuxin
43f4d47bfa
add the call in infer.py
3 years ago
Hui Zhang
39228864bb
format code
3 years ago
Hui Zhang
d395c2b8e3
jsonlines reade manifest file
3 years ago
Hui Zhang
7554b6107a
using visualdl; fix read_manifest
3 years ago
huangyuxin
cdc8520969
add the infer
3 years ago
KP
c94ebdc52c
Add python api for executor.
3 years ago
Junkun
d2fab3238b
fix bugs
3 years ago
Junkun
cdd0845127
add translate function
3 years ago
KP
e9798498d6
Update asr inference in paddlespeech.cli.
3 years ago
huangyuxin
895a086fdd
rename the config.feat_size and the config.vocab.size to input_size and output_size
3 years ago
KP
4d39a7746e
Add paddlespeech.cli.
3 years ago
KP
98f0806353
Add paddlespeech.cli.
3 years ago
TianYuan
6e3257ab8a
Create __init__.py
3 years ago
TianYuan
022f1ce8e9
Merge pull request #1040 from yt605155624/fix_frontend
...
[TTS]update text frontend
3 years ago
TianYuan
a861e56e91
rm space for pure Chinese
3 years ago
TianYuan
dad1cbbcd6
update text frontend
3 years ago
KP
6e1ac1cc15
Add paddlespeech.cls and esc50 example.
3 years ago
KP
33f0e7622c
Add paddlespeech.cls and esc50 example.
3 years ago
KP
2c531d78ac
Add paddlespeech.cls and esc50 example.
3 years ago
KP
bdb3ce23ee
Add paddlespeech.cls and esc50 example.
3 years ago
KP
1189117784
Add paddlespeech.cls and esc50 example.
3 years ago
Hui Zhang
2bbfdbae91
Merge pull request #1015 from yt605155624/fs2_conformer
...
[TTS]fastspeech2 conformer
3 years ago
TianYuan
b0a1d8ab60
fix base
3 years ago
TianYuan
469329221b
refactor encoder, rm old code
3 years ago
Hui Zhang
fe83adfbcb
nproc to ngpu
3 years ago
Hui Zhang
789471bfca
test wav for u2
3 years ago
TianYuan
bc0dd51149
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into HEAD
3 years ago
Jackwaterveg
09931d2ccc
Merge pull request #1019 from zh794390558/feat
...
[bugfix] Kaldi Feature using dither in train
3 years ago
huangyuxin
8aebfeac81
fix the prc-commit
3 years ago
Hui Zhang
56480e1033
fix format
3 years ago
Hui Zhang
7ec0ed4aaf
kaldi feat dither when train
3 years ago
Hui Zhang
2ba3f00bbd
Merge branch 'develop' into datapipe
3 years ago
Hui Zhang
b944418d6f
new format data support ds2/st
3 years ago
Hui Zhang
0defc658e1
update aishell/librispeech transformer result; wenetspeech pretrain conformer result
3 years ago
Hui Zhang
d2a05df02e
Merge pull request #1014 from Jackwaterveg/auto_log
...
[asr]hidden the auto_log
3 years ago
huangyuxin
fb6974f950
update the auto_log
3 years ago
TianYuan
4370c5cfa6
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into fs2_conformer
3 years ago
Hui Zhang
638b96bf07
check if cmvn_file in config for u2
3 years ago
Hui Zhang
c354e9154b
Merge pull request #1003 from yt605155624/fs2_ge2e
...
[TTS]add fastspeech2 voice cloning in aishell3
3 years ago
TianYuan
133ee7db0b
rename num_speakers
3 years ago
TianYuan
3d5e078c91
add conformer
3 years ago
TianYuan
a97c7b5206
rename spembs
3 years ago
huangyuxin
f646d4c3a1
renew the setup.py for paddlespeech feat and ctcdecoders
3 years ago
huangyuxin
ca06b91fc4
renew the setup.py for paddlespeech feat and ctcdecoders
3 years ago
Hui Zhang
3bd87bc379
add wenet lincense
3 years ago
TianYuan
8d025451de
add fastspeech2 voice cloning in aishell3
3 years ago
TianYuan
c5c9f19091
rename to gen_gta_mel.py, remove stats compute when gen fintune data
3 years ago
TianYuan
a6ac497f8e
add multi-band melgan finetune scripts
3 years ago
Hui Zhang
fe29f74a1c
Merge pull request #992 from yt605155624/fix_docs
...
[TTS] add tts tutorial
3 years ago
TianYuan
30d09b411d
fix style_syn, replace DeepSpeech with PaddleSpeech in readme
3 years ago
TianYuan
0bc9450c51
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into fix_docs
3 years ago
Hui Zhang
f9b66d0d97
Remove useless folders ( #990 )
3 years ago
Hui Zhang
2d76638d62
more speech domain
3 years ago
TianYuan
0fcc5005a2
add tts tutorial
3 years ago
Hui Zhang
1ae1ead80f
more install scripts
3 years ago
Hui Zhang
51a6845564
Merge pull request #985 from Jackwaterveg/benchmark
...
revise the benchmark
3 years ago
huangyuxin
843ea1c12e
revise the benchmark
3 years ago
Hui Zhang
080b0431f4
format code
3 years ago
Junkun
7c8843448c
add word reward into beam search.
3 years ago
Hui Zhang
9a71c091c5
remove debug info and format code
3 years ago
Hui Zhang
8b0e344c69
fix logfbank using PCM16
3 years ago
Hui Zhang
7ceef6c3f5
format code
3 years ago
Hui Zhang
f9221b4b74
fix ctc align
3 years ago
Hui Zhang
fb853167d3
format code
3 years ago
Hui Zhang
18d9abc7a0
add sox speed pertrub
3 years ago
Hui Zhang
56d06f2aaf
Merge pull request #968 from yt605155624/merge_paddlespeech
...
[TTS] change nprocs to ngpu
3 years ago