Commit Graph

489 Commits (1133540682fef94e2baa5eef9288bfa10c82f57c)

Author SHA1 Message Date
ccrrong 378fe5909f add ami diarization pipeline, test=doc
2 years ago
xiongxinlei 48b8cc8937 add score method, test=doc
2 years ago
xiongxinlei ebfe3e6b13 test.py update the CSVDataset, test=doc
2 years ago
xiongxinlei acebfad7b7 change the vector csv.spk_id to csv.label, test=doc
2 years ago
xiongxinlei 57c11dcab0 add some annotations, test=doc
2 years ago
xiongxinlei 30b5b3cb9e add vector csv dataset format, test=doc
2 years ago
TianYuan e366fb6b2f
Merge pull request #1643 from Jackwaterveg/check
2 years ago
huangyuxin ca860e3d2f supplement note
2 years ago
TianYuan 828ee14404 add license and reference for some models, test=doc
2 years ago
xiongxinlei 5b05300e53 train process add new voxceleb and rirs dataset, test=doc
2 years ago
xiongxinlei 965f486dd5 add voxceleb and rirs noise dataset
2 years ago
Hui Zhang 36df70cbe6
Merge pull request #1638 from zh794390558/spx_refactor
2 years ago
TianYuan 5bff096715
Merge pull request #1634 from yt605155624/cnn_decoder
2 years ago
TianYuan 3aec266ca5 add chunk size and pad size in args, test=doc
2 years ago
Hui Zhang cb39777a60 format code
2 years ago
TianYuan 4d7cd0e063 add streaming synthesize, test=tts
2 years ago
liangym 602b0b0da3
Merge pull request #1632 from lym0302/develop
2 years ago
Hui Zhang 61941d14b0
Merge pull request #1627 from WilliamZhang06/ws-develop
2 years ago
WilliamZhang06 2ec8d608bf fixed comments, test=doc
2 years ago
liangym 21c4132eda
Update paddlespeech_client.py
2 years ago
TianYuan 005aa4066c Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into cnn_decoder
2 years ago
TianYuan 0fc79f474d add CNNDecoder, test=tts
2 years ago
WilliamZhang06 d847fe29cf added online asr engine , test=doc
2 years ago
TianYuan 318edec303
Merge pull request #1613 from yt605155624/restructure_expand
2 years ago
Hui Zhang 943d4ac1ee
Merge pull request #1612 from Jackwaterveg/update
2 years ago
huangyuxin f47146af49 add docstring, test=asr
2 years ago
huangyuxin ed490b66cb update spectrogram, test=asr
2 years ago
Hui Zhang 84d712d493 format code, test=doc
3 years ago
Honei d60856b1ed
Merge pull request #1614 from Honei/vox12
3 years ago
xiongxinlei ed7113f320 change the vector output to numpy.array
3 years ago
TianYuan bc5ae43d3a restructure expand in length_regulator.py for paddle2onnx, test=tts
3 years ago
huangyuxin 0ffe1f9114 replace kaidi_fbank with paddleaudio
3 years ago
Hui Zhang caee809513
Merge pull request #1605 from Honei/vox12
3 years ago
xiongxinlei 5ae57206f3 add paddlespeech vector modules __init__.py
3 years ago
xiongxinlei 2c9dc0c89b add some vector cli comments, test=doc
3 years ago
xiongxinlei ef1bc5e815 vector cli output dim info, test=doc
3 years ago
xiongxinlei d5142e5e15 add vector cli annotation, test=doc
3 years ago
xiongxinlei ad2caf2ccb add speaker verification demo and doc, test=doc
3 years ago
TianYuan 3cc0ec950e
Merge pull request #1604 from lym0302/add_readme
3 years ago
lym0302 829f1e332e update readme, test=doc
3 years ago
xiongxinlei 0f78d25f76 add vector cli batch and pipeline test demo, test=doc
3 years ago
Honei 305bacdcf2
Merge branch 'develop' into vox12
3 years ago
xiongxinlei 0bb67d8b8e add vector cli unit test, test=doc
3 years ago
KP b6e976a860
Merge pull request #1602 from yt605155624/fix_dtype
3 years ago
xiongxinlei 62cbce6915 add vectorwrapper to extract audio embedding
3 years ago
TianYuan 8938483529
Merge pull request #1601 from yt605155624/add_ljspeech_hifigan
3 years ago
TianYuan 5347dbad3f fix dtype of window of stft, test=tts
3 years ago
TianYuan 342b487383 update readme for ljspeech hifigan, test=tts
3 years ago
Hui Zhang 4051e7b762 fix compliance test bug, and format
3 years ago
TianYuan 26ef47810d
Merge pull request #1593 from windstamp/npu_dev_20220322
3 years ago
zhangkeliang 59b3de6a6d [NPU] test TransformerTTS with NPU
3 years ago
Jackwaterveg fcc1762048
Merge pull request #1577 from Jackwaterveg/change_init
3 years ago
huangyuxin e1b581b622 fix some bug, test=asr
3 years ago
Hui Zhang b5315657ff
Merge pull request #1509 from qingen/cluster
3 years ago
huangyuxin 6da8465f14 add dist_sampler args, test=asr
3 years ago
TianYuan e5e8b8a129
Merge pull request #1587 from yt605155624/add_vctk_hifigan
3 years ago
TianYuan 6469568d2a update readme for vctk hifigan, test=tts
3 years ago
huangyuxin a4f5a68074 fix some format, test=asr
3 years ago
xiongxinlei d85d1deef5 exec pre-commit in paddlespeech vector, test=doc
3 years ago
xiongxinlei 9874fb7d75 add some comments in code
3 years ago
huangyuxin e991d82ae7 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init
3 years ago
huangyuxin d53e1163a6 update the code, test=asr
3 years ago
xiongxinlei b9eafddd94 change - to _ to distinguish field
3 years ago
xiongxinlei 9c6735f921 add vector voxceleb12 base mode url, test=doc
3 years ago
xiongxinlei d28ccfa96b add vector cli component, test=doc
3 years ago
KP 831cadacc7 Add paddleaudio doc.
3 years ago
TianYuan 5ab2601759 update readme for aishell3 hifigan, test=tts
3 years ago
Hui Zhang 6abc5d9f7e format
3 years ago
huangyuxin ab16d8ce3c change default initializer to kaiming_uniform, test=asr
3 years ago
qingen 0f7ede11ef Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster
3 years ago
qingen d16ce21d47 [wip][vec] update cluster of diarization, test=doc #1304
3 years ago
xiongxinlei 506d26a957 change the code style to s2t code style, test=doc
3 years ago
xiongxinlei 311fa87a11 add some comments to the code
3 years ago
Hui Zhang 90deeca06f
Merge pull request #1554 from lym0302/develop
3 years ago
lym0302 89457b273a modify, test=doc
3 years ago
xiongxinlei 8ed5c287a3 add vox2 data into VoxCeleb class
3 years ago
lym0302 77bad44e8b modify readme, test=doc
3 years ago
lym0302 8ef92a9495 modify, test=doc
3 years ago
lym0302 89dbda58f6 add cls static model, test=doc
3 years ago
Hui Zhang 40ab05a462
Merge pull request #1552 from yt605155624/format_syn
3 years ago
lym0302 5187df847f modify server demo, test=doc
3 years ago
xiongxinlei 584a2c0e39 add ecapa-tdnn config yaml file
3 years ago
lym0302 0a6602c708 modify application.yaml, test=doc
3 years ago
TianYuan 544c372b50 fix cr, test=tts
3 years ago
lym0302 99fa7a8205 add server cls, test=doc
3 years ago
TianYuan fe8bf2a38c format synthesize, test=tts
3 years ago
xiongxinlei 993d6783d7 remove unused code, test=doc
3 years ago
xiongxinlei 0e87037f2c refactor to compilance paddleaudio
3 years ago
xiongxinlei 4473405f82 merge develop to vox12, test=doc
3 years ago
Honei 0dee8f40e9
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
xiongxinlei 60d73bb7bd add state 0 to prepare the voxcele data and augment data
3 years ago
xiongxinlei 14efbf5b15 check extract embedding result, test=doc
3 years ago
xiongxinlei 386ef3f161 add voxceleb augment unit test, test=doc
3 years ago
Hui Zhang 5147163592
Merge pull request #1544 from yt605155624/add_vctk_hifigan
3 years ago
TianYuan 81d964f0a0 add vctk hifigan, test=tts
3 years ago
xiongxinlei 2d89c80e6f add waveform augment pipeline, test=doc
3 years ago
lym0302 3b304544f6 modify yaml, test=doc
3 years ago
xiongxinlei ac4967e204 optimize the data prepare process
3 years ago
xiongxinlei 016ed6d69c repair the code according to the part comment, test=doc
3 years ago
Hui Zhang 2886ab9373
Merge pull request #1530 from lym0302/server_cli
3 years ago