Commit Graph

1072 Commits (d35dda002012d84b742dfef86b917fd7d3a40b37)

Author SHA1 Message Date
Hui Zhang 2bb40c41ba
Merge pull request #2351 from Zth9730/fix_deepspeech
2 years ago
tianhao zhang ab92e2c98c fix deepspeech2 decode_wav
2 years ago
艾梦 ea9ee93739
[TTS]Update VITS to support VITS and its voice cloning training on AIShell-3 (#2268)
2 years ago
TianYuan 795eb7bd10
format paddlespeech with pre-commit (#2331)
2 years ago
TianYuan 5d5888af8e
fix tone, update readme (#2335)
2 years ago
贾晓 0b544ee84e
Merge pull request #2336 from Zth9730/fix_multigpu_train
2 years ago
tianhao zhang cdcb1a5316 s2t: fix encoder.py
2 years ago
tianhao zhang ed2819d7af fix format test=asr
2 years ago
Hui Zhang 58ab7e8d10
Merge pull request #2334 from Zth9730/fix_multigpu_train
2 years ago
tianhao zhang 1dfca4ef73 fix multigpu training
2 years ago
Hui Zhang 94e750c4c4
Merge pull request #2327 from Zth9730/fix_multigpu_train
2 years ago
tianhao zhang ed80b0e2c3 fix multigpu training test=asr
2 years ago
tianhao zhang 733ec7f2bc fix conformer multi-gpu training test=asr
2 years ago
David An (An Hongliang) f5367f5efb
[TTS]fix bug of tone modify (#2323)
2 years ago
Zhao Yuting c28064fec2
Update asr_engine.py (#2302)
2 years ago
TianYuan 7b864e8f38
clean old ernie sat inference scripts (#2316)
2 years ago
David An (An Hongliang) c7163abffa
add thanks into readme, append data for chinese unit (#2312)
2 years ago
彭震东 c9de22eaa8
[TN] Update quantifiers (#2308)
2 years ago
TianYuan d1c70a7809
fix g2pw model (#2304)
2 years ago
liangym 043b21d3b4
fix mix frontend, test=tts (#2299)
2 years ago
David An (An Hongliang) 25b96405df
add chinese words correct phonic,test=tts (#2300)
2 years ago
TianYuan c1d4551055
add ernie sat synthesize_e2e, test=tts (#2287)
2 years ago
李子 5a58a27492
[TTS]指定G2PW的传入数据类型 , test=tts (#2288)
2 years ago
TianYuan 3f9339edff
Update polyphonic.yaml
2 years ago
TianYuan f9a6970a62
Merge pull request #2263 from oyjxer/pc
2 years ago
lym0302 677e0961a8 fix point bug, test=tts
2 years ago
TianYuan 4a59702d60
Merge pull request #2255 from lym0302/develop
2 years ago
TianYuan 0baec4325a fix stats bugs
2 years ago
TianYuan f7780658db fix tone sand_hi bugs for Chinese frontend
2 years ago
pangchao04 b9be2bd64a add ernie-sat sampler
2 years ago
lym0302 f8f73e41f0 fix point bug, test=tts
2 years ago
TianYuan 5de2c2dab5 format g2pw
2 years ago
TianYuan 5d515f3f3f update mix tts
2 years ago
TianYuan a75b2a5bab
Merge pull request #2230 from BarryKCL/develop_g2pW
2 years ago
TianYuan db89cfe829
Merge pull request #2234 from lym0302/mix_example
2 years ago
TianYuan 8dbefc0165 fix preprocess bug, add hifigan_csmsc decoder, update readme
2 years ago
BarryKCL a84b40ef79 update g2pW dict
2 years ago
Zhao Yuting d02e04d532
Update audio_handler.py
2 years ago
BarryKCL 6593c24968 set window_size None
2 years ago
BarryKCL 5e63ac1e60 Fix a bug in g2pW
2 years ago
TianYuan 0eb598b876
Merge pull request #2235 from david-95/hongliang-dev
2 years ago
david.95 0df7fc8fbf remove comment
2 years ago
david.95 7ba74f175f remove comment
2 years ago
david.95 f52a87b8d0 remove useless fix, test=tts
2 years ago
david.95 a48e4f249f add filter for double punctuation, revise comment ;
2 years ago
BarryKCL aecf8fd384 add onnxruntime sess_options
2 years ago
lym0302 368e3e1b59 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into mix_example
2 years ago
lym0302 894556f871 add zh_en mix example, test=tts
2 years ago
david.95 1edd474bcb add filter for double punctuation in sentences; add homonym, test=tts
2 years ago
BarryKCL 61dd92e49c update
2 years ago
BarryKCL de0f99150a change G2PWModel download
2 years ago
BarryKCL 744ea44279 add comment
2 years ago
BarryKCL 7b0f2a796d change transformers to paddlenlp.transformers
2 years ago
BarryKCL e60a63fbdd Rollback "get_input_ids"
2 years ago
BarryKCL ab2a1219c8 Add g2pW to Chinese frontend
2 years ago
TianYuan 2f9bdf2306
Merge pull request #2222 from yt605155624/add_onnx_cli
2 years ago
TianYuan c3d47441cf fix fs bug in inference.py (change fixed 24000 to variable for ljspeech)
2 years ago
TianYuan 8da993bbf8 fix fs bug
2 years ago
TianYuan 788a3062d0 fix onnx am_ckpt from list to item in prtrained_mdoels.py
2 years ago
TianYuan c6b25c05f4 change logger.debug to logger.info for streaming asr
2 years ago
Hui Zhang c1fbfe928e add test
2 years ago
TianYuan cd662a08e0 fix for load specified model files
2 years ago
TianYuan b9ade18055 add onnxruntime infer for cli
2 years ago
Hui Zhang 05bc258833 update docstring
2 years ago
Hui Zhang 6149daa221 export ctc_activation
2 years ago
huangyuxin 923b0b873e fix import kws.exps.mdtc
2 years ago
huangyuxin 060e337623 fix dataloader factory, test=asr
2 years ago
TianYuan b0b3222f9a
Merge pull request #2213 from yt605155624/fix_name_bug
2 years ago
TianYuan 354601d0e9 fix readme, test=doc
2 years ago
Hui Zhang 812d80ab1c Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into u2_export
2 years ago
Hui Zhang e5a6c243f1 fix jit save for conformer
2 years ago
TianYuan 510e240c5c achange default voc in cli from pwgan_csmsc to hifigan_csmsc, test=tts
2 years ago
TianYuan 00e9853f66 add mix tts cli, test=tts
2 years ago
0x45f 4e7106d9e2 Support dy2st
2 years ago
TianYuan 1f128a0817
Merge pull request #2117 from yt605155624/ernie_sat_trainer
2 years ago
TianYuan 1bf78fa5c7 updatte batch_fn train.py, test=doc
2 years ago
TianYuan 9d4161ce5f update config, test=doc
2 years ago
lym0302 e1f8695456 add mix tts, test=tts
2 years ago
Betterman-qs e2dc204d4d update engine_warmup.py, test=tts
2 years ago
Betterman-qs cf1b873528 update engine_warmup.py, test=tts
2 years ago
Hui Zhang ef37f73a01 fix cnn cache dy2st shape
2 years ago
0x45f e21cceea51 Remove blank line
2 years ago
0x45f e6ac8881f1 Fix comments
2 years ago
0x45f ac680aa783 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into new_api
2 years ago
Hui Zhang d098e027ca
Merge pull request #2155 from Jackwaterveg/develop_dev
2 years ago
0x45f 294b7b00bd Supprot dy2st for conformer
2 years ago
TianYuan 97965f4c37 fix mlm_prob, test=tts
2 years ago
huangyuxin 7c9208765f fix audio,test=doc
2 years ago
huangyuxin 75997d8277 merge
2 years ago
TianYuan 72fa8176ca fix for mix_lang
2 years ago
TianYuan 5503c8bd6b add ernie_sat synthesize script for metadata.jsonl, test=tts
2 years ago
TianYuan f4ac0c79d9
Merge pull request #2143 from lym0302/mix_front
2 years ago
Jackwaterveg ae7a73bc11
Merge pull request #2138 from zh794390558/demos
2 years ago
lym0302 207bb5d93b add mix frontend, test=tts
2 years ago
Hui Zhang e62cbc464e
Merge pull request #2124 from zh794390558/new_api
2 years ago
Hui Zhang 8376f3d40d
Merge pull request #2128 from zh794390558/endpoint
2 years ago
Hui Zhang caaa5cd502 more cli for speech demos
2 years ago
Hui Zhang 1edf120506 fix comment error
2 years ago
Hui Zhang d142d3a7c0 add docstring
2 years ago
Hui Zhang f8450c39e5 rename n_v_s to n_v_b, n_v_ns to n_v_nb
2 years ago
Hui Zhang f4b11b19e5 rename time_s and time_ns to time_b and time_nb
2 years ago
liangym 45f51651bf
Merge pull request #2129 from lym0302/onnx_gpu
2 years ago
lym0302 3d5ed00c60 specify id, test=doc
2 years ago
Hui Zhang 98eed53e6d more accuracy decoding somthing
2 years ago
TianYuan 028742b69a update lr scheduler
3 years ago
TianYuan 94688264c7 add ernie sat model file and config
3 years ago
Hui Zhang e81849277e att cache for streaming asr
3 years ago
Hui Zhang 5ca05fea20 cli batch process support \t
3 years ago
Hui Zhang fb40602d94 refactor attention cache
3 years ago
liangym e153495519
Merge pull request #2122 from yt605155624/rm_server_log
3 years ago
TianYuan 6bbe6de1ec add stream_play_tts.py, test=doc
3 years ago
lym0302 d66d6a05c7 Merge branch 'develop' of https://github.com/lym0302/PaddleSpeech into develop
3 years ago
lym0302 5b06b76ebc change sr, test=doc
3 years ago
huangyuxin 05d41523ad Merge branch 'develop' into webdataset
3 years ago
huangyuxin 92d1d08b9a fix scripts
3 years ago
TianYuan f6d1c545ac fromat doc_string
3 years ago
TianYuan 4b1f82d312 log redundancy in server
3 years ago
TianYuan e4a8e15334
Merge pull request #2111 from yt605155624/rm_more_log
3 years ago
TianYuan 496e2dd14b fix Pillow's version
3 years ago
TianYuan bc93bffbb4 replace logger.info with logger.debug in cli, change default log level to INFO
3 years ago
TianYuan f76bd9fe51
Merge pull request #2109 from raycool/fix_log
3 years ago
TianYuan e10eaa397e
Merge pull request #2100 from Jackwaterveg/develop_dev
3 years ago
huangyuxin 98cfdc4c05 fix nxpu
3 years ago
huguanghui ddf14662ca fix log issue #2070
3 years ago
huguanghui 20a9a67925 fix log issue #2070
3 years ago
TianYuan cf846f9ebc rm extra log
3 years ago
KP 527744d5f0 Fix unnecessary download present in issue #2067.
3 years ago
KP adc7c9b4aa Fix unnecessary download present in issue #2067.
3 years ago
huangyuxin 7463df89c5 fix nxpu
3 years ago
huangyuxin 6ec6921255 Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
3 years ago
huangyuxin 429221dc03 adopt multi machine traiing
3 years ago
huangyuxin ac1b301657 Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
3 years ago
TianYuan d8a0ba5913
Merge pull request #2085 from yt605155624/fix_tts_cli_log
3 years ago
Jackwaterveg 6598216b2f
Merge branch 'develop' into webdataset
3 years ago
TianYuan c0f126ecd9 fix int32 warning in tts, test=tts
3 years ago
huangyuxin 9b5655f6ad fix 'print log' in cli
3 years ago
huangyuxin aa12b9ab52 replace s2t.transform with audio.transform
3 years ago
huangyuxin 0c7abc1f17 add training scripts
3 years ago
TianYuan 5ff885f116 add tts static/onnx models in pretrained_models.py
3 years ago
huangyuxin c7a7b113c8 support multi-gpu training with webdataset
3 years ago
贾晓 0fa3fdb9ee
Merge pull request #2068 from yt605155624/p_norm
3 years ago
TianYuan 7743c6a1ff add onnx models for aishell3/ljspeech/vctk's tts3/voc1/voc5, test=tts
3 years ago
KP b230dfbdec Add kws cli and demo.
3 years ago
huangyuxin 8f5e61090b new feature: Add webdataset in audio
3 years ago
TianYuan 46ff848d66
Merge pull request #2056 from lym0302/develop
3 years ago
Hui Zhang e04cd18846
Merge pull request #2050 from zh794390558/onnx_quant
3 years ago
lym0302 7d4f320836 fix_model_init, test=doc
3 years ago
TianYuan d1aa83a239
Merge pull request #2052 from yt605155624/ernie_sat
3 years ago
Hui Zhang 54a777055a
Merge pull request #2039 from iftaken/dev-hym
3 years ago
Hui Zhang d20adb5c89
Merge pull request #2048 from KPatr1ck/import_bug
3 years ago
TianYuan 79658a5f20 add ernie sat inference, test=tts
3 years ago
TianYuan 02734141ce
Merge pull request #2040 from yt605155624/add_blank
3 years ago
Hui Zhang d95b0cd9b2 add release and resource
3 years ago
Hui Zhang 3cf1f1f0b5 support onnx quantize
3 years ago
Jackwaterveg 6dfe7273e6
Merge pull request #2045 from zh794390558/wenetspeech_onnx
3 years ago
KP b452be3d8d Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio
3 years ago
KP 220bcffac8 Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio
3 years ago
KP fc5f0b14e0 Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio
3 years ago
KP fe345409bb Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio
3 years ago
huangyuxin c552c0877f fix ds2 server config
3 years ago
Hui Zhang d21e6d8adb fix window ms config
3 years ago
Hui Zhang 59a78f2a46 ds2 wenetspeech to onnx and support streaming asr server
3 years ago
huangyuxin 704b5f8bc4 fix win len in ds2 server
3 years ago
Hui Zhang 0f8e9cdd32 add init file
3 years ago
iftaken 357b177232 rename readme and fixed conflict
3 years ago
TianYuan 1731976e4e add blank between characters for vits, test=tts
3 years ago
Hui Zhang 73d702fd6a
Merge branch 'develop' into asr_ol
3 years ago
Hui Zhang 27f2833bf7 format
3 years ago
iftaken cb50999096 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into dev-hym
3 years ago
Hui Zhang 5e03d753ac add ds2 steaming asr onnx
3 years ago
Hui Zhang cd0b82ff1b fix pretrain model error
3 years ago
Hui Zhang 9106daa2a3 code format
3 years ago
Hui Zhang 42d28b961c fix pretrian model error
3 years ago
iftaken 474373bceb Merge branch 'develop' into dev-hym
3 years ago
Hui Zhang 4851d1d3a2 Merge branch 'onnx' into asr_ol
3 years ago
Hui Zhang d66bfefd01 update aishell ds2 streaming onnx model
3 years ago
Hui Zhang ff3b1ff817 opt onnx script
3 years ago
Hui Zhang 3cee7db021 onnx ds2 straming asr
3 years ago
Hui Zhang b4c6a52beb
Merge pull request #2034 from zh794390558/onnx
3 years ago
Hui Zhang c8574c7e35 ds2 inference as sepearte engine for streaming asr
3 years ago
Hui Zhang b9e3e49305 refactor stream asr and fix ds2 stream bug
3 years ago
Jackwaterveg bca014fd92
Merge pull request #2032 from PaddlePaddle/audio_refactoring
3 years ago
Hui Zhang 1628e19bce
Merge branch 'develop' into onnx
3 years ago
KP 4aaa8effe8 Refactor paddleaudio to paddlespeech.audio
3 years ago
KP bf056c013d Refactor paddleaudio to paddlespeech.audio
3 years ago
liangym e0c0804ce3
Merge pull request #2025 from Jackwaterveg/fix
3 years ago
huangyuxin 2b5bc6df39 fix cli, test=doc
3 years ago
Jackwaterveg b4e868ac0f
Merge pull request #2020 from Jackwaterveg/develop_dev
3 years ago
Hui Zhang 28c1794b9b format
3 years ago
huangyuxin 678cd15354 update model, test=doc
3 years ago
Jackwaterveg 05fd9a0e2c
Merge pull request #2019 from Jackwaterveg/develop_dev
3 years ago
huangyuxin 06c9eee339 update reademe, add conf file, updata test_cli
3 years ago
Hui Zhang 1f5f34a815
Merge pull request #2016 from Jackwaterveg/develop_dev
3 years ago
Hui Zhang 262f42d49f do not reset result for web ws api
3 years ago
huangyuxin 6ebe476532 support editing num_decode_left_chunks in cli and server
3 years ago
Hui Zhang dfdf450b22 fix #2013; and format
3 years ago
Hui Zhang 69a6da4c16 ctc endpoint work
3 years ago
Hui Zhang 8f9b7bba48 refactor asr online server
3 years ago
lym0302 90b7f88eb5 fix hifigan pad value
3 years ago
liangym 919c8d0607 Merge branch 'PaddlePaddle:develop' into update_engine
3 years ago