Commit Graph

1060 Commits (5c1867a71ef89176999d08aee98312b5d2434a1b)

Author SHA1 Message Date
TianYuan b9ade18055 add onnxruntime infer for cli
2 years ago
Hui Zhang 05bc258833 update docstring
2 years ago
Hui Zhang 6149daa221 export ctc_activation
2 years ago
huangyuxin 923b0b873e fix import kws.exps.mdtc
2 years ago
huangyuxin 060e337623 fix dataloader factory, test=asr
2 years ago
TianYuan b0b3222f9a
Merge pull request #2213 from yt605155624/fix_name_bug
2 years ago
TianYuan 354601d0e9 fix readme, test=doc
2 years ago
Hui Zhang 812d80ab1c Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into u2_export
2 years ago
Hui Zhang e5a6c243f1 fix jit save for conformer
2 years ago
TianYuan 510e240c5c achange default voc in cli from pwgan_csmsc to hifigan_csmsc, test=tts
2 years ago
TianYuan 00e9853f66 add mix tts cli, test=tts
2 years ago
0x45f 4e7106d9e2 Support dy2st
2 years ago
TianYuan 1f128a0817
Merge pull request #2117 from yt605155624/ernie_sat_trainer
2 years ago
TianYuan 1bf78fa5c7 updatte batch_fn train.py, test=doc
2 years ago
TianYuan 9d4161ce5f update config, test=doc
2 years ago
lym0302 e1f8695456 add mix tts, test=tts
2 years ago
Betterman-qs e2dc204d4d update engine_warmup.py, test=tts
2 years ago
Betterman-qs cf1b873528 update engine_warmup.py, test=tts
2 years ago
Hui Zhang ef37f73a01 fix cnn cache dy2st shape
2 years ago
0x45f e21cceea51 Remove blank line
2 years ago
0x45f e6ac8881f1 Fix comments
2 years ago
0x45f ac680aa783 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into new_api
2 years ago
Hui Zhang d098e027ca
Merge pull request #2155 from Jackwaterveg/develop_dev
2 years ago
0x45f 294b7b00bd Supprot dy2st for conformer
2 years ago
TianYuan 97965f4c37 fix mlm_prob, test=tts
2 years ago
huangyuxin 7c9208765f fix audio,test=doc
2 years ago
huangyuxin 75997d8277 merge
2 years ago
TianYuan 72fa8176ca fix for mix_lang
2 years ago
TianYuan 5503c8bd6b add ernie_sat synthesize script for metadata.jsonl, test=tts
2 years ago
TianYuan f4ac0c79d9
Merge pull request #2143 from lym0302/mix_front
2 years ago
Jackwaterveg ae7a73bc11
Merge pull request #2138 from zh794390558/demos
2 years ago
lym0302 207bb5d93b add mix frontend, test=tts
2 years ago
Hui Zhang e62cbc464e
Merge pull request #2124 from zh794390558/new_api
2 years ago
Hui Zhang 8376f3d40d
Merge pull request #2128 from zh794390558/endpoint
2 years ago
Hui Zhang caaa5cd502 more cli for speech demos
2 years ago
Hui Zhang 1edf120506 fix comment error
2 years ago
Hui Zhang d142d3a7c0 add docstring
2 years ago
Hui Zhang f8450c39e5 rename n_v_s to n_v_b, n_v_ns to n_v_nb
2 years ago
Hui Zhang f4b11b19e5 rename time_s and time_ns to time_b and time_nb
2 years ago
liangym 45f51651bf
Merge pull request #2129 from lym0302/onnx_gpu
2 years ago
lym0302 3d5ed00c60 specify id, test=doc
2 years ago
Hui Zhang 98eed53e6d more accuracy decoding somthing
2 years ago
TianYuan 028742b69a update lr scheduler
2 years ago
TianYuan 94688264c7 add ernie sat model file and config
2 years ago
Hui Zhang e81849277e att cache for streaming asr
2 years ago
Hui Zhang 5ca05fea20 cli batch process support \t
2 years ago
Hui Zhang fb40602d94 refactor attention cache
2 years ago
liangym e153495519
Merge pull request #2122 from yt605155624/rm_server_log
2 years ago
TianYuan 6bbe6de1ec add stream_play_tts.py, test=doc
2 years ago
lym0302 d66d6a05c7 Merge branch 'develop' of https://github.com/lym0302/PaddleSpeech into develop
2 years ago
lym0302 5b06b76ebc change sr, test=doc
2 years ago
huangyuxin 05d41523ad Merge branch 'develop' into webdataset
2 years ago
huangyuxin 92d1d08b9a fix scripts
2 years ago
TianYuan f6d1c545ac fromat doc_string
2 years ago
TianYuan 4b1f82d312 log redundancy in server
2 years ago
TianYuan e4a8e15334
Merge pull request #2111 from yt605155624/rm_more_log
2 years ago
TianYuan 496e2dd14b fix Pillow's version
2 years ago
TianYuan bc93bffbb4 replace logger.info with logger.debug in cli, change default log level to INFO
2 years ago
TianYuan f76bd9fe51
Merge pull request #2109 from raycool/fix_log
2 years ago
TianYuan e10eaa397e
Merge pull request #2100 from Jackwaterveg/develop_dev
2 years ago
huangyuxin 98cfdc4c05 fix nxpu
2 years ago
huguanghui ddf14662ca fix log issue #2070
2 years ago
huguanghui 20a9a67925 fix log issue #2070
2 years ago
TianYuan cf846f9ebc rm extra log
2 years ago
KP 527744d5f0 Fix unnecessary download present in issue #2067.
2 years ago
KP adc7c9b4aa Fix unnecessary download present in issue #2067.
2 years ago
huangyuxin 7463df89c5 fix nxpu
2 years ago
huangyuxin 6ec6921255 Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
2 years ago
huangyuxin 429221dc03 adopt multi machine traiing
2 years ago
huangyuxin ac1b301657 Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
2 years ago
TianYuan d8a0ba5913
Merge pull request #2085 from yt605155624/fix_tts_cli_log
2 years ago
Jackwaterveg 6598216b2f
Merge branch 'develop' into webdataset
2 years ago
TianYuan c0f126ecd9 fix int32 warning in tts, test=tts
2 years ago
huangyuxin 9b5655f6ad fix 'print log' in cli
2 years ago
huangyuxin aa12b9ab52 replace s2t.transform with audio.transform
2 years ago
huangyuxin 0c7abc1f17 add training scripts
2 years ago
TianYuan 5ff885f116 add tts static/onnx models in pretrained_models.py
2 years ago
huangyuxin c7a7b113c8 support multi-gpu training with webdataset
2 years ago
贾晓 0fa3fdb9ee
Merge pull request #2068 from yt605155624/p_norm
2 years ago
TianYuan 7743c6a1ff add onnx models for aishell3/ljspeech/vctk's tts3/voc1/voc5, test=tts
2 years ago
KP b230dfbdec Add kws cli and demo.
2 years ago
huangyuxin 8f5e61090b new feature: Add webdataset in audio
2 years ago
TianYuan 46ff848d66
Merge pull request #2056 from lym0302/develop
2 years ago
Hui Zhang e04cd18846
Merge pull request #2050 from zh794390558/onnx_quant
2 years ago
lym0302 7d4f320836 fix_model_init, test=doc
2 years ago
TianYuan d1aa83a239
Merge pull request #2052 from yt605155624/ernie_sat
2 years ago
Hui Zhang 54a777055a
Merge pull request #2039 from iftaken/dev-hym
2 years ago
Hui Zhang d20adb5c89
Merge pull request #2048 from KPatr1ck/import_bug
2 years ago
TianYuan 79658a5f20 add ernie sat inference, test=tts
2 years ago
TianYuan 02734141ce
Merge pull request #2040 from yt605155624/add_blank
2 years ago
Hui Zhang d95b0cd9b2 add release and resource
2 years ago
Hui Zhang 3cf1f1f0b5 support onnx quantize
2 years ago
Jackwaterveg 6dfe7273e6
Merge pull request #2045 from zh794390558/wenetspeech_onnx
2 years ago
KP b452be3d8d Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio
2 years ago
KP 220bcffac8 Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio
2 years ago
KP fc5f0b14e0 Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio
2 years ago
KP fe345409bb Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio
2 years ago
huangyuxin c552c0877f fix ds2 server config
2 years ago
Hui Zhang d21e6d8adb fix window ms config
2 years ago
Hui Zhang 59a78f2a46 ds2 wenetspeech to onnx and support streaming asr server
2 years ago
huangyuxin 704b5f8bc4 fix win len in ds2 server
2 years ago
Hui Zhang 0f8e9cdd32 add init file
2 years ago
iftaken 357b177232 rename readme and fixed conflict
2 years ago
TianYuan 1731976e4e add blank between characters for vits, test=tts
2 years ago
Hui Zhang 73d702fd6a
Merge branch 'develop' into asr_ol
2 years ago
Hui Zhang 27f2833bf7 format
2 years ago
iftaken cb50999096 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into dev-hym
2 years ago
Hui Zhang 5e03d753ac add ds2 steaming asr onnx
2 years ago
Hui Zhang cd0b82ff1b fix pretrain model error
2 years ago
Hui Zhang 9106daa2a3 code format
2 years ago
Hui Zhang 42d28b961c fix pretrian model error
2 years ago
iftaken 474373bceb Merge branch 'develop' into dev-hym
2 years ago
Hui Zhang 4851d1d3a2 Merge branch 'onnx' into asr_ol
2 years ago
Hui Zhang d66bfefd01 update aishell ds2 streaming onnx model
2 years ago
Hui Zhang ff3b1ff817 opt onnx script
2 years ago
Hui Zhang 3cee7db021 onnx ds2 straming asr
2 years ago
Hui Zhang b4c6a52beb
Merge pull request #2034 from zh794390558/onnx
2 years ago
Hui Zhang c8574c7e35 ds2 inference as sepearte engine for streaming asr
2 years ago
Hui Zhang b9e3e49305 refactor stream asr and fix ds2 stream bug
2 years ago
Jackwaterveg bca014fd92
Merge pull request #2032 from PaddlePaddle/audio_refactoring
2 years ago
Hui Zhang 1628e19bce
Merge branch 'develop' into onnx
2 years ago
KP 4aaa8effe8 Refactor paddleaudio to paddlespeech.audio
2 years ago
KP bf056c013d Refactor paddleaudio to paddlespeech.audio
2 years ago
liangym e0c0804ce3
Merge pull request #2025 from Jackwaterveg/fix
2 years ago
huangyuxin 2b5bc6df39 fix cli, test=doc
2 years ago
Jackwaterveg b4e868ac0f
Merge pull request #2020 from Jackwaterveg/develop_dev
2 years ago
Hui Zhang 28c1794b9b format
2 years ago
huangyuxin 678cd15354 update model, test=doc
2 years ago
Jackwaterveg 05fd9a0e2c
Merge pull request #2019 from Jackwaterveg/develop_dev
2 years ago
huangyuxin 06c9eee339 update reademe, add conf file, updata test_cli
2 years ago
Hui Zhang 1f5f34a815
Merge pull request #2016 from Jackwaterveg/develop_dev
2 years ago
Hui Zhang 262f42d49f do not reset result for web ws api
2 years ago
huangyuxin 6ebe476532 support editing num_decode_left_chunks in cli and server
2 years ago
Hui Zhang dfdf450b22 fix #2013; and format
2 years ago
Hui Zhang 69a6da4c16 ctc endpoint work
2 years ago
Hui Zhang 8f9b7bba48 refactor asr online server
2 years ago
lym0302 90b7f88eb5 fix hifigan pad value
2 years ago
liangym 919c8d0607 Merge branch 'PaddlePaddle:develop' into update_engine
2 years ago
Hui Zhang 82c1f4c508
Merge pull request #1997 from Jackwaterveg/develop_dev
2 years ago
TianYuan 84d13dea18
Merge pull request #2002 from KPatr1ck/resource
2 years ago
KP 613b689017 Fix tts_online server issue. test=doc
2 years ago
liangym 8b1c1ec43f
Merge branch 'PaddlePaddle:develop' into update_engine
2 years ago
TianYuan 1a6df85f97
Merge pull request #2000 from yt605155624/fix_chunk
2 years ago
huangyuxin e48e1d5e81 fix tiny and local script, test=asr
2 years ago
KP f77b8ede74 Fix server issues.
2 years ago
TianYuan 004ab8d0c0 reneame chunk to block in streaming tts, test=tts
2 years ago
liangym 4a11257dcb
Merge branch 'develop' into update_engine
2 years ago
KP 46690b1b3c Fix windows issue in paddlespeech.resource
2 years ago
huangyuxin 47dd61e5b2 refactor ds2, cli, server
2 years ago
Hui Zhang 0fa32e4aae
Merge pull request #1917 from KPatr1ck/resource
2 years ago
KP 6436f343bb Fix asr_inference server engine.
2 years ago
TianYuan aa3d151d1d
Merge pull request #1994 from Jackwaterveg/develop
2 years ago
huangyuxin 10819e0fa2 not install ctc on win, test=asr
2 years ago
lym0302 d48c4d686a update engine, test=doc
3 years ago
KP 6a08221525 Add paddlespeech.resource.
3 years ago
KP 1e066fab9e Add paddlespeech.resource.
3 years ago
KP dcea088c66 Add paddlespeech.resource.
3 years ago
KP 7766d7344d Add paddlespeech.resource.
3 years ago
KP fa6e44e4ff Add paddlespeech.resource.
3 years ago
TianYuan b6ad4260eb fix bug in tts cli, test=tts
3 years ago
Hui Zhang 42fba661c9 more detail of copyright
3 years ago
Hui Zhang 3d88ac4e68
Merge pull request #1950 from Jackwaterveg/develop
3 years ago
TianYuan 7bc54cbbe6
Merge pull request #1957 from yt605155624/vits_doc
3 years ago
KP 6c57c2bf8e Dynamic cli commands registration.
3 years ago
TianYuan f9f014d159 add VITS readme, test=tts
3 years ago
Hui Zhang 6f7917b7f2 fix streaming asr
3 years ago
Hui Zhang f07f57a3a8
Merge pull request #1945 from PaddlePaddle/asr_line
3 years ago
Hui Zhang 8f8239ad3b
Merge pull request #1954 from Honei/acs_server
3 years ago
xiongxinlei be8a78a9d1 fix the vector model type error, test=doc
3 years ago
xiongxinlei a5605978fa update the acs note, test=doc
3 years ago
xiongxinlei a83374a787 update the vector readme, test=doc
3 years ago
xiongxinlei 7afbdbefad update the vector model, test=doc
3 years ago
huangyuxin b23bde8ec5 tensor.shape => paddle.shape(tensor)
3 years ago
huangyuxin 4c09927f61 fix
3 years ago
huangyuxin e1888f9ae6 remove size,test=asr
3 years ago
Zhangjingyu06 acb19cf465 deepspeech2 modify for kunlun
3 years ago
Zhangjingyu06 b0eaeccd67 deepspeech2 modify for kunlun
3 years ago
Zhangjingyu06 1e91f7da35 deepspeech2 modify for kunlun
3 years ago
huangyuxin 1cdd41bd03 fix pad_sequence, test=asr
3 years ago
TianYuan 5ee3cc0c31
Merge pull request #1855 from yt605155624/add_vits
3 years ago
Hui Zhang c15278ed80 format
3 years ago
TianYuan 327509951f rm unused comment, test=tts
3 years ago
TianYuan 433ebf2594
Merge pull request #1940 from yt605155624/rm_fluid
3 years ago
Hui Zhang 943272385a refactor asr online
3 years ago
TianYuan c1b512c58a rm fluid in tts, test=tts
3 years ago
huangyuxin ea71fddbde fix condition of wenetspeech
3 years ago
Jackwaterveg 3638320f3b
fix self.max_len
3 years ago
huangyuxin 008c812f63 fix cli/asr
3 years ago
TianYuan df3f975ea5 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_vits
3 years ago
TianYuan 58028509c3 replace dynamic_import
3 years ago
iftaken 2938d3e49b Merge branch 'develop' into dev-hym
3 years ago
TianYuan 8db06444c5 add vits trainer and synthesize
3 years ago
Honei bde7093578
Merge pull request #1906 from Honei/acs_server
3 years ago
Hui Zhang 4b3f6c615e
Merge pull request #1913 from Honei/timelimit
3 years ago
xiongxinlei b1ef434983 update the max len compute method, test=doc
3 years ago
xiongxinlei 0ea39f837b add asr time limt configuration, test=doc
3 years ago
xiongxinlei 25ee960571 refair the acs note, test=doc
3 years ago
TianYuan e61757dbf7 fix yao, test=tts
3 years ago
xiongxinlei 3535079434 update the acs engine doc, test=doc
3 years ago
xiongxinlei d94ab22e92 acs server, test=doc
3 years ago
xiongxinlei f57fff24fb update the init flag, test=doc
3 years ago
xiongxinlei 67939d0d66 add check asr server model type, test=doc
3 years ago
lym0302 c5d2224d6d fix cors, test=doc
3 years ago
iftaken c7dd207038 fixs CORS Error
3 years ago
Hui Zhang a11dc53c1b
Merge pull request #1888 from Jackwaterveg/develop
3 years ago
root 88501fc32a fix server doc and decode_method
3 years ago
Jerryuhoo 167aaa65b9 normalize wav max value to 1 in preprocess, test=tts
3 years ago
liangym 22b67ed051
Merge pull request #1882 from lym0302/streaming_tts_server
3 years ago
Hui Zhang f86317893e
Merge pull request #1884 from Honei/develop
3 years ago
lym0302 be21aed09b trans remove file way, test=doc
3 years ago
lym0302 b1f9b8016d add start and end request on ws tts, test=doc
3 years ago
xiongxinlei 347af638e2 changet vector train.py local_rank to rank, test=doc
3 years ago
lym0302 d4f863dc97 improve, test=doc
3 years ago
pollyyan 018dda6ee9
Merge pull request #1879 from QingshuChen/develop
3 years ago
Hui Zhang c23a97e242
Merge pull request #1877 from Jackwaterveg/develop
3 years ago
Hui Zhang 5b053cde6a
Merge pull request #1878 from Honei/develop
3 years ago
xiongxinlei 06bea5f03d update the vector and text readme, test=doc
3 years ago
QingshuChen e55177c3db speedyspeech support kunlun
3 years ago
root 9f389a7a33 support cpu, test=asr
3 years ago
root 864041085f replace dist.spawn with dist.launch, test=asr
3 years ago
TianYuan 4b7786f2ed add vits network scripts, test=tts
3 years ago
KP 19d015b60a Add RFT for asr task.
3 years ago
KP da08f1c1af Add RFT for asr task.
3 years ago
Hui Zhang 12ae137c83 update tts_api for ws
3 years ago
Hui Zhang 175c67b75e asr socket to asr api
3 years ago
Hui Zhang 7be6b0e8cf unify name style & frame with abs timestamp
3 years ago
Hui Zhang 15b25199c2
Merge pull request #1864 from zh794390558/doc
3 years ago
xiongxinlei bb0db29d7e update the streaming asr readme, test=doc
3 years ago
root 4d7046d244 updata released model info, test=doc
3 years ago
liangym e7a35485e4
Merge pull request #1859 from lym0302/update_readme
3 years ago
Hui Zhang 02e7586394 update readme
3 years ago
lym0302 b361a73888 improve server code, test=doc
3 years ago
Hui Zhang 94aaa61726
Merge pull request #1858 from KPatr1ck/cli_version
3 years ago
KP 677898ab96 Add version command in cli.
3 years ago
Hui Zhang 13503613b4
Merge pull request #1853 from Jackwaterveg/develop
3 years ago
root 3a7896fc96 update cli, test=asr
3 years ago
liangym e87495f045
[server] update readme (#1851)
3 years ago
Hui Zhang 37c6106ee0
Merge pull request #1848 from zh794390558/spx
3 years ago
Hui Zhang 8522b82999 format
3 years ago
xiongxinlei b7a77eebca update the time stamp type, test=doc
3 years ago
Honei 43582f5091
Merge branch 'develop' into asr_time
3 years ago
Hui Zhang d99e99ce2c
Merge pull request #1836 from Honei/punc
3 years ago
Hui Zhang 435e86b335
Merge pull request #1835 from Honei/vec_server
3 years ago
xiongxinlei 10da21a77b update the vector cli for server, test=doc
3 years ago
xiongxinlei 2ab96187aa streaming asr server add time stamp, test=doc
3 years ago
xiongxinlei c78653850b join streaming asr and punc server, test=doc
3 years ago
xiongxinlei 3950557e04 update the vector server note, test=doc
3 years ago
xiongxinlei b1dddddbe0 add vector server, test=doc
3 years ago
Jerryuhoo fba0693a20 fix random speaker embedding bug, test=tts
3 years ago
Hui Zhang cdb9a1b20b
Merge pull request #1813 from Honei/v0.3
3 years ago