Commit Graph

1051 Commits (e18170228cceb275c39accdc3da448ea74022849)

Author SHA1 Message Date
lym0302 be21aed09b trans remove file way, test=doc
3 years ago
lym0302 b1f9b8016d add start and end request on ws tts, test=doc
3 years ago
xiongxinlei 347af638e2 changet vector train.py local_rank to rank, test=doc
3 years ago
lym0302 d4f863dc97 improve, test=doc
3 years ago
pollyyan 018dda6ee9
Merge pull request #1879 from QingshuChen/develop
3 years ago
Hui Zhang c23a97e242
Merge pull request #1877 from Jackwaterveg/develop
3 years ago
Hui Zhang 5b053cde6a
Merge pull request #1878 from Honei/develop
3 years ago
xiongxinlei 06bea5f03d update the vector and text readme, test=doc
3 years ago
QingshuChen e55177c3db speedyspeech support kunlun
3 years ago
root 9f389a7a33 support cpu, test=asr
3 years ago
root 864041085f replace dist.spawn with dist.launch, test=asr
3 years ago
TianYuan 4b7786f2ed add vits network scripts, test=tts
3 years ago
KP 19d015b60a Add RFT for asr task.
3 years ago
KP da08f1c1af Add RFT for asr task.
3 years ago
Hui Zhang 12ae137c83 update tts_api for ws
3 years ago
Hui Zhang 175c67b75e asr socket to asr api
3 years ago
Hui Zhang 7be6b0e8cf unify name style & frame with abs timestamp
3 years ago
Hui Zhang 15b25199c2
Merge pull request #1864 from zh794390558/doc
3 years ago
xiongxinlei bb0db29d7e update the streaming asr readme, test=doc
3 years ago
root 4d7046d244 updata released model info, test=doc
3 years ago
liangym e7a35485e4
Merge pull request #1859 from lym0302/update_readme
3 years ago
Hui Zhang 02e7586394 update readme
3 years ago
lym0302 b361a73888 improve server code, test=doc
3 years ago
Hui Zhang 94aaa61726
Merge pull request #1858 from KPatr1ck/cli_version
3 years ago
KP 677898ab96 Add version command in cli.
3 years ago
Hui Zhang 13503613b4
Merge pull request #1853 from Jackwaterveg/develop
3 years ago
root 3a7896fc96 update cli, test=asr
3 years ago
liangym e87495f045
[server] update readme (#1851)
3 years ago
Hui Zhang 37c6106ee0
Merge pull request #1848 from zh794390558/spx
3 years ago
Hui Zhang 8522b82999 format
3 years ago
xiongxinlei b7a77eebca update the time stamp type, test=doc
3 years ago
Honei 43582f5091
Merge branch 'develop' into asr_time
3 years ago
Hui Zhang d99e99ce2c
Merge pull request #1836 from Honei/punc
3 years ago
Hui Zhang 435e86b335
Merge pull request #1835 from Honei/vec_server
3 years ago
xiongxinlei 10da21a77b update the vector cli for server, test=doc
3 years ago
xiongxinlei 2ab96187aa streaming asr server add time stamp, test=doc
3 years ago
xiongxinlei c78653850b join streaming asr and punc server, test=doc
3 years ago
xiongxinlei 3950557e04 update the vector server note, test=doc
3 years ago
xiongxinlei b1dddddbe0 add vector server, test=doc
3 years ago
Jerryuhoo fba0693a20 fix random speaker embedding bug, test=tts
3 years ago
Hui Zhang cdb9a1b20b
Merge pull request #1813 from Honei/v0.3
3 years ago
Honei ff7dbcc2de
Merge branch 'develop' into v0.3
3 years ago
xiongxinlei f7af037cb1 add the note for offline asr, test=doc
3 years ago
xiongxinlei 3f80464926 update the streaming asr readme, test=doc
3 years ago
Hui Zhang fc96130fdc fix speechx core dump when stop immediately after start
3 years ago
xiongxinlei c5fe181405 update the paddlespeech_client asr_online cli, test=doc
3 years ago
huangyuxin 4494f5a1fc add cli models, test=doc
3 years ago
Hui Zhang 903cc67a4d
Merge pull request #1801 from Honei/v0.3
3 years ago
xiongxinlei e844e0e0bb update the streaming output and punc default ip, port, test=doc
3 years ago
huangyuxin 18197cd3a5 renew ds2 model, test=doc
3 years ago
Hui Zhang ebde26030b patch func to var
3 years ago
Honei f72cbc9b6d
Merge branch 'develop' into v0.3
3 years ago
xiongxinlei 9125cb076d update the ws asr response, final_result to result, test=doc
3 years ago
xiongxinlei 7007b0ecac update the asr server api, test=doc
3 years ago
Hui Zhang 5e23025c31 fix speechx ws server to return dummpy partial result, fix hang for ws client
3 years ago
Hui Zhang d7c8c1779f
Merge pull request #1786 from Jackwaterveg/debug
3 years ago
Hui Zhang 9cc7662512
Merge pull request #1782 from lym0302/add_streaming_cli
3 years ago
huangyuxin e145b26355 fix
3 years ago
huangyuxin 4f9e8bfa90 renew ds2 online, test=doc
3 years ago
xiongxinlei 833900a8b4 asr client add punctuatjion server, test=doc
3 years ago
KP abb15ac6e8 Update KWS example.
3 years ago
lym0302 651012616a add info, test=doc
3 years ago
Hui Zhang 33ca17359f
Merge pull request #1776 from Jackwaterveg/ds2
3 years ago
huangyuxin 0df8d80833 remove logfbank from python_speech_features, test=asr
3 years ago
Honei 119143d0f1
Merge pull request #1731 from qingen/cluster
3 years ago
huangyuxin fcdaef6cb4 replace fbank, test=asr
3 years ago
Hui Zhang f11855415c
Merge pull request #1770 from Jackwaterveg/cli
3 years ago
Hui Zhang 3b0004345c
Merge pull request #1772 from Honei/v0.3
3 years ago
Hui Zhang 962a278996
Merge pull request #1558 from KPatr1ck/kws
3 years ago
liangym 0de4d25ab8
Merge pull request #1774 from lym0302/add_streaming_cli
3 years ago
huangyuxin 1e999c27e9 fix exit, test=doc
3 years ago
lym0302 dc52c313fa fix code, test=doc
3 years ago
lym0302 c6e6210964 code format, test=tts
3 years ago
xiongxinlei 9e50448039 update the punc text model, text=doc
3 years ago
lym0302 88adcaa6dc fix code, test=doc
3 years ago
TianYuan f256bb9c0e
Merge pull request #1771 from lym0302/add_streaming_cli
3 years ago
KP caa8eb4d0d Add KWS example.
3 years ago
KP f9761d532c Add KWS example.
3 years ago
KP b60b1dadde Add KWS example.
3 years ago
KP e01abc5099 Add KWS example.
3 years ago
KP 521e222db8 Add mdtc model.
3 years ago
xiongxinlei 2fa1522bdd update the punc yaml to application.yaml, test=doc
3 years ago
xiongxinlei ba62b85e9b add text punc server, test=doc
3 years ago
lym0302 c00c31594c updata readme, test=doc
3 years ago
lym0302 70424e1ef9 add streaming tts demos, test=doc
3 years ago
huangyuxin c21c3d220d fix infer, test=doc
3 years ago
Hui Zhang 312fc4e11e
Merge pull request #1766 from Jackwaterveg/fix
3 years ago
TianYuan 0b1b573a3f
Merge pull request #1767 from Jackwaterveg/cli
3 years ago
huangyuxin ad4e04fc82 add conformer_online_aishell, test=doc
3 years ago
huangyuxin 12d2f6ea95 fix conformer_aishell of cli, test=doc
3 years ago
huangyuxin 5912ba53e4 fix log_interval and lr when resume training, test=asr
3 years ago
Hui Zhang 91e24b0480 format code
3 years ago
xiongxinlei 13a37b4892 update the online protocal note, test=doc
3 years ago
xiongxinlei 2f2cb7eaaf update the audio handler note, test=doc
3 years ago
xiongxinlei 7aa5a3df2b fix the streaming asr server bug, server client, test=doc
3 years ago
huangyuxin 19998ea29b add aishell conformer, test=doc
3 years ago
TianYuan 24f0a7d44b
Merge pull request #1733 from lym0302/tts_stream
3 years ago
TianYuan 9121dfc046
Merge pull request #1752 from yt605155624/fix_wavernn
3 years ago
TianYuan 08a4673355 fix wavernn bug, test=tts
3 years ago
Jackwaterveg 85b50c4700
Merge pull request #1741 from Jackwaterveg/debug
3 years ago
lym0302 104c7ff27d code format, test=doc
3 years ago
lym0302 e398fe9c74 remove code, test=doc
3 years ago
Hui Zhang 3561875dd0 Merge branch 'develop' into fix
3 years ago
Hui Zhang c7d9b11529 format
3 years ago
huangyuxin 8e37a7c7f0 remove redundant log, test=doc
3 years ago
xiongxinlei 56751a1ed5 update the server device to paddle.device, test=doc
3 years ago
xiongxinlei 4b76a01c85 update en readme.md, test=doc
3 years ago
xiongxinlei 1a0c2bea5d add streaming asr demo, test=doc
3 years ago
lym0302 4e9db4ff71 add onnx tts engine, test=doc
3 years ago
Jackwaterveg 8d1ee8262e
Merge branch 'develop' into CER
3 years ago
qingen e98845d778 [vec][loss] add GE2E to support unlabeled data training, test=doc fix #1730
3 years ago
qingen 0186f522af
Merge pull request #1725 from qingen/database-search
3 years ago
TianYuan e089268642
Merge pull request #1727 from yt605155624/refactor_syn_util
3 years ago
TianYuan 4646f7cc8d add paddle device set for ort and inference, test=doc
3 years ago
Hui Zhang 523d5bd6d4
Merge pull request #1723 from yt605155624/refactor_syn_util
3 years ago
qingen 7e8f9f5336 [vec][layer] add GRL to domain adaptation, test=doc fix #1724
3 years ago
TianYuan c74fa9ada8 restructure syn_utils.py, test=tts
3 years ago
qingen 26d5dded7c
Merge branch 'PaddlePaddle:develop' into cluster
3 years ago
qingen 6a7245657f [vec][loss] add FocalLoss to deal with class imbalances, test=doc fix #1721
3 years ago
qingen 9382ad8a16
Merge pull request #1719 from qingen/cluster
3 years ago
Hui Zhang cf9a590fa5
Merge pull request #1704 from Honei/server
3 years ago
xiongxinlei ac9fcf7f4a fix the asr infernece model, paddle.no_grad, test=doc
3 years ago
xiongxinlei ff4ddd229e fix the unuseful code, test=doc
3 years ago
xiongxinlei 9c03280ca6 remove debug info, test=doc
3 years ago
xiongxinlei 48fa84bee9 fix the asr online client bug, return None, test=doc
3 years ago
qingen 00febff734 [vec][loss] update docstring, test=doc fix #1717
3 years ago
xiongxinlei babac27a79 fix ds2 online edge bug, test=doc
3 years ago
liangym ab656aab57
Merge pull request #1713 from lym0302/tts_stream
3 years ago
xiongxinlei dcab04a799 merge develop to server
3 years ago
xiongxinlei f56dba0ca7 fix the code format, test=doc
3 years ago
Honei 55122cfc86
Merge branch 'develop' into server
3 years ago
TianYuan 7c0ec3c249
Merge pull request #1716 from yt605155624/update_cli
3 years ago
xiongxinlei 380afbbc5d add ds2 model multi session, test=doc
3 years ago
qingen 166757703f [vec][loss] add NCE Loss from RNNLM, test=doc fix #1717
3 years ago
lym0302 9e41ac8550 code format, test=doc
3 years ago
qingen 880829fe89
Merge pull request #1681 from qingen/cluster
3 years ago
TianYuan a44f5c099e update cli, test=doc
3 years ago
lym0302 40dde22fc4 code format, test=doc
3 years ago
huangyuxin 6e80618e3d add ds2
3 years ago
xiongxinlei 5acb0b5252 fix the websocket chunk edge bug, test=doc
3 years ago
Hui Zhang b78bc6375b
Merge pull request #1712 from yt605155624/add_cnndecoder_onnx
3 years ago
xiongxinlei 05a8a4b5fc add connection stability, test=doc
3 years ago
lym0302 00a6236fe2 remove test code, test=doc
3 years ago
lym0302 9c0ceaacb6 add streaming am infer, test=doc
3 years ago
xiongxinlei 68731c61f4 add multi session result, test=doc
3 years ago
xiongxinlei 10e825d9b2 check chunk window process, test=doc
3 years ago
qingen 159d8fd628
Merge branch 'develop' into cluster
3 years ago
xiongxinlei d2640c1406 add mult sesssion process, test=doc
3 years ago
TianYuan dafe7c3657 add fastspeech2 cnndecoder onnx model, test=tts
3 years ago
qingen deb3ba070b [vec] update mata info, test=doc
3 years ago
xiongxinlei 97d31f9aac update the attention_rescoring method, test=doc
3 years ago
xiongxinlei 0c5dbbee5b add conformer ctc prefix beam search decoding method, test=doc
3 years ago
Honei 9d20a10b5a
Merge branch 'develop' into server
3 years ago
Hui Zhang 0cde9f87ab
Merge pull request #1710 from Honei/deepspeech_server
3 years ago
xiongxinlei 3ce4301665 add asr websocket server note, test=doc
3 years ago
xiongxinlei efc269b75f remove unuseful code, test=doc
3 years ago
xiongxinlei 89b102a7dd fix the ws send bug, cache buffer, text=doc
3 years ago
xiongxinlei d21ccd0287 add conformer online server, test=doc
3 years ago
Hui Zhang c7b987c55d format
3 years ago
Hui Zhang ec469179bf
Merge pull request #1696 from qingen/database-search
3 years ago
Hui Zhang 72933abc70
Merge pull request #1701 from WilliamZhang06/web
3 years ago
xiongxinlei af484fc980 convert websockert results to str from bytest, test=doc
3 years ago
WilliamZhang06 39895f6a25 added online asr doc and online asr command line, test=doc
3 years ago
qingen 240520c0ca
Merge branch 'PaddlePaddle:develop' into cluster
3 years ago
TianYuan 8bebf81199 [doc]fix typo, test=doc
3 years ago
TianYuan 98f67870ea
Merge pull request #1693 from yt605155624/fix_ss_NHWC
3 years ago
buchongyu 48358055d0 修改hack 单词拼写错误
3 years ago
qingen d3f8715b0a Merge branch 'database-search' of github.com:qingen/PaddleSpeech into database-search
3 years ago
qingen 89a0ec9018 [vec][server] vpr demo support, test=doc fix #1695
3 years ago
TianYuan 8b801ca18b change NLC to NCL in speedyspeech, test=tts
3 years ago
WilliamZhang06 1dc02c7295 added online web client, test=doc
3 years ago
Hui Zhang 1759116bd7
Revert "[WebSocket] fixed online model md5 error , test=doc"
3 years ago
xiongxinlei d1935d8552 add vector necessary note, test=doc
3 years ago
lym0302 9d0224460b code format, test=doc
3 years ago
lym0302 4b111146dc code format, test=doc
3 years ago
qingen 0d8e2deb61
Merge branch 'PaddlePaddle:develop' into cluster
3 years ago
Honei 48e0177767
Merge pull request #1630 from Honei/vox12
3 years ago
qingen fc72295334
Merge pull request #1651 from ccrrong/ami
3 years ago
xiongxinlei 4af007c3fc fix vector ips log bug, test=doc
3 years ago
lym0302 82992b3ed6 add test code, test=doc
3 years ago
qingen 8d9bd9a93a [vec][score] update Copyright, test=doc fix #1667
3 years ago
xiongxinlei 567286add3 wrap the embedding mean and std norm, test=doc
3 years ago
Hui Zhang d65b63b28d
Merge pull request #1652 from lym0302/tts_stream
3 years ago
qingen 44c6623448 [vec][score] update plda model, test=doc fix #1667
3 years ago
ccrrong bc53f726fe convert dataset format to paddlespeech, test=doc
3 years ago
Hui Zhang 2f97b81346
Merge pull request #1682 from WilliamZhang06/ws-develop
3 years ago
root 9dacfb405f fixed online model md5 error , test=doc
3 years ago
qingen 6446f72cab [vec][score] add plda model, test=doc fix #1667
3 years ago
qingen 84576d6956 [vec][score] add plda model, test=doc fix #1667
3 years ago
lym0302 1a3c811f04 code format, test=doc
3 years ago
TianYuan 0d6f5868ea
Merge pull request #1665 from yt605155624/add_onnx
3 years ago
Honei f500fa8bde
Merge pull request #1646 from Honei/develop
3 years ago
TianYuan 0282d45c62 remove fill_constant_batch_size_like in static model of speedyspeech, test=tts
3 years ago
TianYuan c765fca6b4 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_onnx
3 years ago
TianYuan 124eb6af8f update notes, test=doc
3 years ago
TianYuan e0d222e674 update notes, test=doc
3 years ago
Hui Zhang 1843bed458
Merge pull request #1666 from Jackwaterveg/cli
3 years ago
xiongxinlei a8244dc5b0 update the note, test=doc
3 years ago
Jackwaterveg c852776bc6
test=doc
3 years ago
TianYuan f264b912fc add warmup for frontend, test=doc
3 years ago
Jackwaterveg 4922e697e1
update cli, test = asr
3 years ago
Jackwaterveg 1c05d03806
test=asr
3 years ago
xiongxinlei 9b5f7f71ac add part ecapa-tdnn note, test=doc
3 years ago
Hui Zhang 6eed542c08
Merge pull request #1660 from yt605155624/fix_pre
3 years ago
Honei 83310b6379
Merge branch 'develop' into develop
3 years ago
huangyuxin faf21f033f add duration limitation for asr
3 years ago
TianYuan 7aecb2c4bb add onnx inference for fastspeech2 + hifigan/mb_melgan, test=tts
3 years ago
xiongxinlei d064c8196e update the speaker verification model, test=doc
3 years ago
xiongxinlei e72912adb9 update the speaker verification model, test=doc
3 years ago
TianYuan a8f5990869 fix preprocess bug, test=tts
3 years ago
lym0302 759a9e61e4 update server cli, test=doc
3 years ago
lym0302 603e565ab1 add stream tts server, test=doc
3 years ago
ccrrong 378fe5909f add ami diarization pipeline, test=doc
3 years ago
xiongxinlei 48b8cc8937 add score method, test=doc
3 years ago
xiongxinlei ebfe3e6b13 test.py update the CSVDataset, test=doc
3 years ago
xiongxinlei acebfad7b7 change the vector csv.spk_id to csv.label, test=doc
3 years ago
xiongxinlei 57c11dcab0 add some annotations, test=doc
3 years ago
xiongxinlei 30b5b3cb9e add vector csv dataset format, test=doc
3 years ago
TianYuan e366fb6b2f
Merge pull request #1643 from Jackwaterveg/check
3 years ago
huangyuxin ca860e3d2f supplement note
3 years ago
TianYuan 828ee14404 add license and reference for some models, test=doc
3 years ago
xiongxinlei 5b05300e53 train process add new voxceleb and rirs dataset, test=doc
3 years ago
xiongxinlei 965f486dd5 add voxceleb and rirs noise dataset
3 years ago
Hui Zhang 36df70cbe6
Merge pull request #1638 from zh794390558/spx_refactor
3 years ago
TianYuan 5bff096715
Merge pull request #1634 from yt605155624/cnn_decoder
3 years ago
TianYuan 3aec266ca5 add chunk size and pad size in args, test=doc
3 years ago
Hui Zhang cb39777a60 format code
3 years ago
TianYuan 4d7cd0e063 add streaming synthesize, test=tts
3 years ago
liangym 602b0b0da3
Merge pull request #1632 from lym0302/develop
3 years ago
Hui Zhang 61941d14b0
Merge pull request #1627 from WilliamZhang06/ws-develop
3 years ago
WilliamZhang06 2ec8d608bf fixed comments, test=doc
3 years ago
liangym 21c4132eda
Update paddlespeech_client.py
3 years ago
TianYuan 005aa4066c Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into cnn_decoder
3 years ago
TianYuan 0fc79f474d add CNNDecoder, test=tts
3 years ago
WilliamZhang06 d847fe29cf added online asr engine , test=doc
3 years ago
TianYuan 318edec303
Merge pull request #1613 from yt605155624/restructure_expand
3 years ago
Hui Zhang 943d4ac1ee
Merge pull request #1612 from Jackwaterveg/update
3 years ago
huangyuxin f47146af49 add docstring, test=asr
3 years ago
huangyuxin ed490b66cb update spectrogram, test=asr
3 years ago
Hui Zhang 84d712d493 format code, test=doc
3 years ago
Honei d60856b1ed
Merge pull request #1614 from Honei/vox12
3 years ago
xiongxinlei ed7113f320 change the vector output to numpy.array
3 years ago
TianYuan bc5ae43d3a restructure expand in length_regulator.py for paddle2onnx, test=tts
3 years ago
huangyuxin 0ffe1f9114 replace kaidi_fbank with paddleaudio
3 years ago
Hui Zhang caee809513
Merge pull request #1605 from Honei/vox12
3 years ago
xiongxinlei 5ae57206f3 add paddlespeech vector modules __init__.py
3 years ago
xiongxinlei 2c9dc0c89b add some vector cli comments, test=doc
3 years ago
xiongxinlei ef1bc5e815 vector cli output dim info, test=doc
3 years ago
xiongxinlei d5142e5e15 add vector cli annotation, test=doc
3 years ago
xiongxinlei ad2caf2ccb add speaker verification demo and doc, test=doc
3 years ago
TianYuan 3cc0ec950e
Merge pull request #1604 from lym0302/add_readme
3 years ago
lym0302 829f1e332e update readme, test=doc
3 years ago
xiongxinlei 0f78d25f76 add vector cli batch and pipeline test demo, test=doc
3 years ago
Honei 305bacdcf2
Merge branch 'develop' into vox12
3 years ago
xiongxinlei 0bb67d8b8e add vector cli unit test, test=doc
3 years ago
KP b6e976a860
Merge pull request #1602 from yt605155624/fix_dtype
3 years ago
xiongxinlei 62cbce6915 add vectorwrapper to extract audio embedding
3 years ago
TianYuan 8938483529
Merge pull request #1601 from yt605155624/add_ljspeech_hifigan
3 years ago
TianYuan 5347dbad3f fix dtype of window of stft, test=tts
3 years ago
TianYuan 342b487383 update readme for ljspeech hifigan, test=tts
3 years ago
Hui Zhang 4051e7b762 fix compliance test bug, and format
3 years ago
TianYuan 26ef47810d
Merge pull request #1593 from windstamp/npu_dev_20220322
3 years ago
zhangkeliang 59b3de6a6d [NPU] test TransformerTTS with NPU
3 years ago
Jackwaterveg fcc1762048
Merge pull request #1577 from Jackwaterveg/change_init
3 years ago
huangyuxin e1b581b622 fix some bug, test=asr
3 years ago
Hui Zhang b5315657ff
Merge pull request #1509 from qingen/cluster
3 years ago
huangyuxin 6da8465f14 add dist_sampler args, test=asr
3 years ago
TianYuan e5e8b8a129
Merge pull request #1587 from yt605155624/add_vctk_hifigan
3 years ago
TianYuan 6469568d2a update readme for vctk hifigan, test=tts
3 years ago
huangyuxin a4f5a68074 fix some format, test=asr
3 years ago
xiongxinlei d85d1deef5 exec pre-commit in paddlespeech vector, test=doc
3 years ago
xiongxinlei 9874fb7d75 add some comments in code
3 years ago
huangyuxin e991d82ae7 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into change_init
3 years ago
huangyuxin d53e1163a6 update the code, test=asr
3 years ago
xiongxinlei b9eafddd94 change - to _ to distinguish field
3 years ago
xiongxinlei 9c6735f921 add vector voxceleb12 base mode url, test=doc
3 years ago
xiongxinlei d28ccfa96b add vector cli component, test=doc
3 years ago
KP 831cadacc7 Add paddleaudio doc.
3 years ago
TianYuan 5ab2601759 update readme for aishell3 hifigan, test=tts
3 years ago
Hui Zhang 6abc5d9f7e format
3 years ago
huangyuxin ab16d8ce3c change default initializer to kaiming_uniform, test=asr
3 years ago
qingen 0f7ede11ef Merge branch 'cluster' of github.com:qingen/PaddleSpeech into cluster
3 years ago
qingen d16ce21d47 [wip][vec] update cluster of diarization, test=doc #1304
3 years ago
xiongxinlei 506d26a957 change the code style to s2t code style, test=doc
3 years ago
xiongxinlei 311fa87a11 add some comments to the code
3 years ago
Hui Zhang 90deeca06f
Merge pull request #1554 from lym0302/develop
3 years ago
lym0302 89457b273a modify, test=doc
3 years ago
xiongxinlei 8ed5c287a3 add vox2 data into VoxCeleb class
3 years ago
lym0302 77bad44e8b modify readme, test=doc
3 years ago
lym0302 8ef92a9495 modify, test=doc
3 years ago
lym0302 89dbda58f6 add cls static model, test=doc
3 years ago
Hui Zhang 40ab05a462
Merge pull request #1552 from yt605155624/format_syn
3 years ago
lym0302 5187df847f modify server demo, test=doc
3 years ago
xiongxinlei 584a2c0e39 add ecapa-tdnn config yaml file
3 years ago
lym0302 0a6602c708 modify application.yaml, test=doc
3 years ago
TianYuan 544c372b50 fix cr, test=tts
3 years ago
lym0302 99fa7a8205 add server cls, test=doc
3 years ago
TianYuan fe8bf2a38c format synthesize, test=tts
3 years ago
xiongxinlei 993d6783d7 remove unused code, test=doc
3 years ago
xiongxinlei 0e87037f2c refactor to compilance paddleaudio
3 years ago
xiongxinlei 4473405f82 merge develop to vox12, test=doc
3 years ago
Honei 0dee8f40e9
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
xiongxinlei 60d73bb7bd add state 0 to prepare the voxcele data and augment data
3 years ago
xiongxinlei 14efbf5b15 check extract embedding result, test=doc
3 years ago
xiongxinlei 386ef3f161 add voxceleb augment unit test, test=doc
3 years ago
Hui Zhang 5147163592
Merge pull request #1544 from yt605155624/add_vctk_hifigan
3 years ago
TianYuan 81d964f0a0 add vctk hifigan, test=tts
3 years ago
xiongxinlei 2d89c80e6f add waveform augment pipeline, test=doc
3 years ago
lym0302 3b304544f6 modify yaml, test=doc
3 years ago
xiongxinlei ac4967e204 optimize the data prepare process
3 years ago
xiongxinlei 016ed6d69c repair the code according to the part comment, test=doc
3 years ago
Hui Zhang 2886ab9373
Merge pull request #1530 from lym0302/server_cli
3 years ago
xiongxinlei 1f74af110b add training log info and comment, test=doc
3 years ago
lym0302 e50c1b3b1d add server test, test=doc
3 years ago
xiongxinlei 4648059b5f add training process for sid, test=doc
3 years ago
xiongxinlei 7668f61422 add sid dataloader for training, test=doc
3 years ago
xiongxinlei 6af2bc3d5b add sid loss wraper for voxceleb, test=doc
3 years ago
xiongxinlei 57c4f4a68c add sid learning rate and training model
3 years ago
TianYuan 4d2f2191a8 fix gbk encode bug
3 years ago
Honei 1395b5f5fa
Merge branch 'PaddlePaddle:develop' into develop
3 years ago
TianYuan 175c39b4a4
Merge pull request #1511 from yt605155624/pre_fix_for_streaming
3 years ago
Hui Zhang 5ba4907c44
Merge pull request #1514 from lym0302/server_cli
3 years ago
lym0302 85d4a31e04 update application.yaml, test=doc
3 years ago
Jerryuhoo c116a3a926 fix Speedyspeech multi-speaker inference, test=tts
3 years ago
lym0302 ab04488738 update server cli, test=doc
3 years ago
TianYuan cb07bd2a94 add rtf for synthesize, add more vocoder for synthesize_e2e.sh, test=tts
3 years ago
Hui Zhang 26d413ce8f
Merge pull request #1510 from lym0302/paddlespeech_stats
3 years ago
lym0302 72c0cda30c add paddlespeech_server stats, test=doc
3 years ago
Hui Zhang e8f2d8f11b
Merge pull request #1507 from zh794390558/cli
3 years ago
Hui Zhang 2517df92a0
Merge pull request #1508 from lym0302/paddlespeech_stats
3 years ago
TianYuan b6d33a7fb4
Merge pull request #1506 from yt605155624/fix_frontend
3 years ago
lym0302 395c923dee modified text sr to lang, test=doc
3 years ago
Hui Zhang 75098698d8 format,test=doc
3 years ago
TianYuan 66a8beb27f update text frontend, test=tts
3 years ago
lym0302 96abb33b5b add __call__, test=doc
3 years ago
lym0302 5f1728f855 rm server related, test=doc
3 years ago
xiongxinlei 70d3b01c0d remove invalid code
3 years ago
xiongxinlei d7da629302 add kaldi feats egs dataset
3 years ago
xiongxinlei 6f7e9656fe add kaldi feats ark dataset
3 years ago
lym0302 35357e775e update, test=doc
3 years ago
lym0302 e5aa24fa5a resolve setup.py conflicts, test=doc
3 years ago
lym0302 fe6be4a65e Merge branch 'develop' of https://github.com/lym0302/PaddleSpeech into paddlespeech_stats
3 years ago
lym0302 f8375764b9 add paddlespeech stats, test=doc
3 years ago
Hui Zhang 8d474c2658
Merge pull request #1482 from lym0302/servercli_update
3 years ago
lym0302 162361d878 format code, test=doc
3 years ago
lym0302 434708cff4 set device cpu, test=doc
3 years ago
lym0302 920b2c808c paras required, test=doc
3 years ago
Hui Zhang 6b1fe70100 format code,test=doc
3 years ago
lym0302 6b2dd16845 update server cli, test=doc
3 years ago
WilliamZhang06 78c9b7342c deleted wav file , test=doc
3 years ago
WilliamZhang06 a6ec3a26f1
Merge branch 'develop' into server_asr
3 years ago
WilliamZhang06 8b4602f738 added isinstance code, test=doc
3 years ago
lym0302 bb60561c66 update util, test=doc
3 years ago
WilliamZhang06 147018a8b4 added cli changed code, test=doc
3 years ago
lym0302 332009142b add server demo, test=doc
3 years ago
WilliamZhang06 7ebe904e20 fixed overload , test=doc
3 years ago
Hui Zhang 60c0877e7a
Merge pull request #1472 from KPatr1ck/cli_batch
3 years ago
WilliamZhang06 b8f16ac9b0
Merge branch 'develop' into server_asr
3 years ago
WilliamZhang06 da3ea7bb40 added engine type and asr inference , test=doc
3 years ago
Hui Zhang 49f80afe6a
Merge pull request #1381 from PaddlePaddle/server
3 years ago
lym0302 b508c4d0cb add readme, test=doc
3 years ago
KP d36a4ccfc8 Add cli logger control.
3 years ago
KP 94ed5969fa Add cli logger control.
3 years ago
lym0302 42cbe313c2 improve cli code, test=doc
3 years ago
lym0302 2bf4b4521f add cli, test=doc
3 years ago
lym0302 8fd117e4da add cli, test=doc
3 years ago
lym0302 80b83b7434 add cli, test=doc
3 years ago
KP 7814fba07f Update batch input.
3 years ago
KP 05288fe1c3 Update batch input and stdin input.
3 years ago
KP 1818b058aa Support batch input in cls task.
3 years ago
WilliamZhang06 35e3be9ac8 Merge remote-tracking branch 'remote/develop' into server
3 years ago
TianYuan ae521d3700
Update infer.py
3 years ago
lym0302 07158b2f12 move dir, test=doc
3 years ago
lym0302 76391275fc move dir, test=doc
3 years ago
TianYuan 67ec6242c3 fix ci for waveflow, test=tts
3 years ago
TianYuan f51097618b Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into fix_ci_waveflow
3 years ago
TianYuan fc8c0e3ea2 fix ci for waveflow, test=tts
3 years ago
huangyuxin 95d5274aef fix sortagrad, test=asr
3 years ago
Hui Zhang 718c849f68
Merge pull request #1445 from yt605155624/update_train
3 years ago
Hui Zhang f3ec985aaf
Merge pull request #1439 from Jackwaterveg/tipc
3 years ago
TianYuan 4ac7db185e init for all works in train.py when ngpu>1, test=tts
3 years ago
Jackwaterveg 426bae3de1
Merge pull request #1440 from yt605155624/merge_datasets
3 years ago
TianYuan 2cec8f6c76 update tts cli, test=doc
3 years ago
TianYuan 9699c00769 change the docstring style from numpydoc to google, test=tts
3 years ago
huangyuxin aefe9e93a7 add tipc benchmark of conformer
3 years ago
TianYuan 683679bec7 merge data and datasets, test=tts
3 years ago
TianYuan 7dc1f2daa3 fix some librosa bugs, test=tts
3 years ago
TianYuan 30085ac229 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into rename_tacotron2
3 years ago
TianYuan 25347bb6a3 rename tacotron2, test=tts
3 years ago
huangyuxin 9a55783aa0 fix resample
3 years ago
Hui Zhang dcfc32f1ec
Merge pull request #1379 from yt605155624/new_wavernn
3 years ago
TianYuan 0747600c95
[TTS]add ljspeech new tacotron2 (#1416)
3 years ago
TianYuan 348a1a33bf
update tacotron2 voice cloning in aishell3 with new tacotron2, test=tts (#1419)
3 years ago
huangyuxin f428ec4431 change log of cli/asr/infer
3 years ago
TianYuan 1b0c034134 update wavernn, test=tts
3 years ago
TianYuan 89e69ee10e
[TTS]fix tacotron2 dygraph to static (#1414)
3 years ago
huangyuxin 2a42421a63 cli add ds2-librispeech offline, fix versionm, test=asr
3 years ago
Hui Zhang 4128f4d61f
fix __version__ error in develop (#1398)
3 years ago
TianYuan 001afee644 fix wavernn dygraph to static , test=tts
3 years ago
TianYuan 2844f388dc
[doc ]add tacotron2 readme (#1385)
3 years ago
TianYuan 2071774d81 add wavernn in synthesize_e2e, test=tts
3 years ago
TianYuan 1cc7905d51 rm csmsc.py, test=tts
3 years ago
TianYuan 4c3e57a23c align preprocess of wavernn, test=tts
3 years ago
Jackwaterveg f49cf838a8
Update u2.py (#1378)
3 years ago
TianYuan fb0acd40a2 add wavernn, test=tts
3 years ago
Jackwaterveg d7222c0453
[ASR] Support CTC decoder online (#821)
3 years ago
Jerryuhoo f515416c4a fix missing model choice, test=doc
3 years ago
Jerryuhoo a22080130b Add speedyspeech multi-speaker support for synthesize_e2e.py, test=tts
3 years ago
Hui Zhang 97db74ca60
Merge pull request #1314 from yt605155624/add_new_tacotron2
3 years ago
huangyuxin 3845804cc9 Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into Setup
3 years ago
TianYuan 96323816e9 fix yamls, change labels to stop_labels, test=tts
3 years ago
TianYuan 1bf1a876ae Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_new_tacotron2, test=tts
3 years ago
TianYuan 3fd7a7790b add typehit for updater and evaluator, test=tts
3 years ago
huangyuxin 4e31247633 refacto the code
3 years ago
TianYuan 41d24337cb fix fastspeech2 multi speaker to static, test=tts
3 years ago
TianYuan 1a9e59612a fix fastspeech2 multi speaker to static, test=tts
3 years ago
huangyuxin 565a63c5ef refactor the setup in paddleaudio
3 years ago
huangyuxin eb91ce84f9 refactor the version
3 years ago
Hui Zhang 4a133619a1
Merge pull request #1356 from Jackwaterveg/CLI
3 years ago
Hui Zhang d4acf4704f
Merge pull request #1350 from LittleChenCc/develop
3 years ago
huangyuxin ab759b16de Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into CLI
3 years ago
huangyuxin 38edfd1a89 Add Deepspeech2 online and offline in cli
3 years ago
TianYuan d368d57d67
fix low ips bug of speedyspeech and fastspeech2, test=tts (#1349)
3 years ago
TianYuan 9c7f0762b0 update racotron2 and transformer tts, test=tts
3 years ago
huangyuxin 8028f33b7f synchronize the version
3 years ago
Junkun 44408e5211 sync the variable name to others
3 years ago
Junkun f866059b74 config and formalize
3 years ago
Junkun 43aad7a018 beam search with optimality guarantees
3 years ago
Jackwaterveg 26524031d2
Merge pull request #1343 from Jackwaterveg/fix
3 years ago
huangyuxin 5e7e8a3e24 fix the u2 export, test=asr
3 years ago
TianYuan a1867c20c3
fix slice bug of speedyspeech expand, test=tts (#1337)
3 years ago
Hui Zhang ec1c88ae1a
[s2t] remove nltk (#1332)
3 years ago
TianYuan 7ae4f7221e
Update length_regulator.py
3 years ago
TianYuan acfe2b9084
Update duration_predictor.py
3 years ago
TianYuan caa391f461
fix speedyspeech inference, test=tts (#1322)
3 years ago
Jackwaterveg 0c4895cd0b
mv the ctcdecoders to third_part (#1313)
3 years ago
TianYuan 8f507ba4ba
Merge pull request #1302 from jerryuhoo/develop
3 years ago
Jerryuhoo 111a452378 Fix the code format, test=tts
3 years ago
TianYuan 89e988a69e add csmsc tacotron2, test=tts
3 years ago
TianYuan c088b9a304 add csmsc tacotron2
3 years ago
huangyuxin fe1dc9d211 refactor the cli/st, test=st
3 years ago
TianYuan 27bb76bdb9 fix tone_sandhi of yi, test=tts
3 years ago
Jerryuhoo be99807d61 Add durations to gen_gta_mel.py inference
3 years ago
KP 52a8b2f320
Add ECAPA_TDNN. (#1301)
3 years ago
Jerryuhoo fcc34e3e95 [tts] add gen_gta_mel.py for finetuning speedypeech, test=tts
3 years ago
Jackwaterveg 010aa65b2b
[cli] asr - support English, decode_metod and unified config (#1297)
3 years ago
KP c09466ebbe
Add ECAPA_TDNN. (#1295)
3 years ago
TianYuan fb238d83f4
update vctk voc1, test=tts (#1294)
3 years ago
TianYuan 73dc0e2535 fix_ning
3 years ago
billishyahao ddf184be60 fix some typos
3 years ago