Commit Graph

1215 Commits (290d161d8ac9d44bd0cb31e9b521bd37ade5c326)

Author SHA1 Message Date
tianhao zhang 6e429f0513 support wav2vec2ASR on librispeech
2 years ago
Hui Zhang 290c23b9d7 add u2 nnet, u2 nnet main, codelab, and can compile
2 years ago
tianhao zhang e367242765 update dependency of paddle
2 years ago
tianhao zhang 5a66a14659 fix u2pp model version number
2 years ago
tianhao zhang cda440e6f0 use reverse_weight in decode.yaml
2 years ago
Zth9730 c9b0c96b7b
Merge pull request #2502 from zh794390558/u2pp_export
2 years ago
Hui Zhang c98b5dd173 fix masked_fill which will nan in trainning
2 years ago
Hui Zhang 9277fcb8a8 fix attn can not train
2 years ago
Hui Zhang 1f4f98b171 fix bug
2 years ago
liangym 0359c3f6b5
Fix mix front (#2493)
2 years ago
Hui Zhang e86337a423 fix bug
2 years ago
Hui Zhang 925abcca23 format
2 years ago
Hui Zhang 2a75405e9a Merge branch 'develop' into u2pp_export
2 years ago
Hui Zhang 3ed24474d2 wenetspeech asr1 quant
2 years ago
Hui Zhang 467cfd4e75
Merge pull request #2489 from Zth9730/u2++_server
2 years ago
tianhao zhang 5b5167b586 support u2pp cli and server, optimiz code of u2pp decode, test=asr
2 years ago
YangZhou 3507829a6d
Merge pull request #2464 from THUzyt21/Deploy-text-model-in-server
2 years ago
ZapBird 7a13b35fe6
BytesIO类型时,要保证切到初始位置,这样多次读取才能够正常。比如__call__函数。 (#2484)
2 years ago
tianhao zhang 5bbe6e9897 support u2pp cli and server, optimiz code of u2pp decode, test=asr
2 years ago
Hui Zhang bdf876ea7b Merge branch 'develop' into u2pp_export
2 years ago
Zhao Yuting 304dc2603c
Update text_engine.py
2 years ago
Zhao Yuting 8c945c073d
Update application.yaml
2 years ago
Zhao Yuting b9693a0e8e
Update text_engine.py
2 years ago
Zhao Yuting 8ecf6796f3
Update text_engine.py
2 years ago
Hui Zhang afda7ed7d1 remove useless code
2 years ago
YangZhou 4841f94298
Merge pull request #2421 from THUzyt21/Deploy-fast-text-model-for-cli
2 years ago
Hui Zhang b20bf7d5de masked_fill by multiply, remove while
2 years ago
Zhao Yuting d2da7f50d2
Update text_engine.py
2 years ago
Zhao Yuting 82f731c153
Update application.yaml
2 years ago
Hui Zhang feb27e2a84 fuse linear kv
2 years ago
Hui Zhang 3adb20b468 eliminate shape and slice
2 years ago
Hui Zhang 46088c0a16 elimiate attn transpose
2 years ago
Hui Zhang f9e3eaa024 transpose in matmul
2 years ago
Hui Zhang 3d7ca93861 bool type slice
2 years ago
Hui Zhang c2c8a662b1 refactor reshape
2 years ago
Hui Zhang 6de81d74d9 elimiete cast dtype for bool op
2 years ago
Hui Zhang 8e7a315e00 remove comment
2 years ago
Hui Zhang c4a5ae3825 eliminate mul
2 years ago
Hui Zhang b7388ce25a eliminate useless unsqueese
2 years ago
Hui Zhang 1a1ce92cb4
Merge pull request #2415 from Zth9730/u2++_decoder
2 years ago
TianYuan 52af86fcc3
fix ERNIE-SAT bug when edit the end of sentenses, test=doc (#2432)
2 years ago
tianhao zhang d3e5937591 support bitransformer decoder
2 years ago
Hui Zhang 7382050e21 fix bug on win
2 years ago
TianYuan b14da765e8
frm random spk embedding in voice cloning, test=doc (#2429)
2 years ago
Hui Zhang d25871a7b0 format
2 years ago
Hui Zhang b10512eb0e more config or u2pp
2 years ago
Hui Zhang 00b2c1c8fb fix forward attention decoder caller
2 years ago
zhoupc2015 2ae0f66d0d
Solve "unknown format: 3" (#2422)
2 years ago
Hui Zhang 309c8d70d9 add reverse weight
2 years ago
Hui Zhang 9b66680ea4 Merge branch 'u2++_decoder' into u2pp_export
2 years ago
tianhao zhang 027535dec1 support bitransformer decoder, test=asr
2 years ago
THUzyt21 bdbacd4249 precomited
2 years ago
Zhao Yuting d5dec46336
Update README.md
2 years ago
Zhao Yuting 18b71dc136
Update README.md
2 years ago
tianhao zhang 0a95689461 support bitransformer decoder
2 years ago
tianhao zhang 455379b88e support bitransformer decoder
2 years ago
Zhao Yuting a63a0b1350
Update pretrained_models.py
2 years ago
Zhao Yuting 12a11394bd
Update infer.py
2 years ago
Zhao Yuting fb7f04e021
Update README.md
2 years ago
Zhao Yuting 92d09d5cce
Update README_cn.md
2 years ago
Zhao Yuting 57dcd0d17f
Update infer.py
2 years ago
Zhao Yuting b627666ce9
Update model_alias.py
2 years ago
Zhao Yuting a02654660a
Update pretrained_models.py
2 years ago
tianhao zhang ecbf324286 support bitransformer decoder, test=asr
2 years ago
tianhao zhang 1a56a6e42b add bitransformer decoder, test=asr
2 years ago
Hui Zhang 53d6baff0b format
2 years ago
Hui Zhang 549d477592 fix code style
2 years ago
Hui Zhang 4d5cfd4003 export param from cnofig
2 years ago
Hui Zhang e3298c79ce Merge branch 'develop' into u2_export
2 years ago
Hui Zhang 260752aa2a using forward_attention_decoder
2 years ago
TianYuan 5e714ecb4a
[doc]update api docs (#2406)
2 years ago
TianYuan eac362057c
add typehint for g2pw (#2390)
2 years ago
Hui Zhang 0d7d87120b simplify feature pipeline graph
2 years ago
WongLaw 324b166c52
Removed useless spk_id in speech_server and streaming_tts_server from demos, and support bilingual server engine, test=tts (#2380)
2 years ago
TianYuan 80b180217d
[TTS] fix some bugs of ERNIE-SAT (#2378)
2 years ago
Hui Zhang 8690a00bd8 add feature pipeline layer(cmvn, fbank), but to_static and jit.layer output is not equal
2 years ago
Hui Zhang 07f566e0a5
Merge pull request #2372 from Zth9730/fix_dp_init
2 years ago
Hui Zhang 3a8869fba4 rm to_static decarator; configure jit save for ctc_activation
2 years ago
Hui Zhang 1c9f238ba0 configurable export
2 years ago
Hui Zhang 63aeb747b0 more comment
2 years ago
Hui Zhang a7c6c54e75 fix
2 years ago
Hui Zhang d638325c46 do not jit save forward; using slice for zeros([0,0,0,0]) tensor
2 years ago
tianhao zhang 663e3ab58e fix dp init
2 years ago
tianhao zhang 6745e9dd6b fix dp init
2 years ago
tianhao zhang 598eb1a5ef Merge branch 'develop' into fix_dp_init
2 years ago
WongLaw 989b755e8e
Revised must_neural_tone_words, test=doc. (#2370)
2 years ago
tianhao zhang 9560d650db fix dp init
2 years ago
TianYuan 7e4f3b029c
Merge pull request #2359 from yt605155624/add_vc2
2 years ago
tianhao zhang 82e04d7815 fix trianer
2 years ago
TianYuan f7873773bf
uadd __init__.py for VITS, test=tts (#2362)
2 years ago
TianYuan 35c6ffa90b Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_vc2
2 years ago
TianYuan e622f42d92 add aishell3 voice cloning with ECAPA-TDNN spk encoder
2 years ago
TianYuan 1c30cff1bf
fix gpus of ernie_sat, test=tts (#2355)
2 years ago
Hui Zhang 2bb40c41ba
Merge pull request #2351 from Zth9730/fix_deepspeech
2 years ago
tianhao zhang ab92e2c98c fix deepspeech2 decode_wav
2 years ago
艾梦 ea9ee93739
[TTS]Update VITS to support VITS and its voice cloning training on AIShell-3 (#2268)
2 years ago
TianYuan 795eb7bd10
format paddlespeech with pre-commit (#2331)
2 years ago
TianYuan 5d5888af8e
fix tone, update readme (#2335)
2 years ago
贾晓 0b544ee84e
Merge pull request #2336 from Zth9730/fix_multigpu_train
2 years ago
tianhao zhang cdcb1a5316 s2t: fix encoder.py
2 years ago
tianhao zhang ed2819d7af fix format test=asr
2 years ago
Hui Zhang 58ab7e8d10
Merge pull request #2334 from Zth9730/fix_multigpu_train
2 years ago
tianhao zhang 1dfca4ef73 fix multigpu training
2 years ago
Hui Zhang 94e750c4c4
Merge pull request #2327 from Zth9730/fix_multigpu_train
2 years ago
tianhao zhang ed80b0e2c3 fix multigpu training test=asr
2 years ago
tianhao zhang 733ec7f2bc fix conformer multi-gpu training test=asr
2 years ago
David An (An Hongliang) f5367f5efb
[TTS]fix bug of tone modify (#2323)
2 years ago
Zhao Yuting c28064fec2
Update asr_engine.py (#2302)
2 years ago
TianYuan 7b864e8f38
clean old ernie sat inference scripts (#2316)
2 years ago
David An (An Hongliang) c7163abffa
add thanks into readme, append data for chinese unit (#2312)
2 years ago
彭震东 c9de22eaa8
[TN] Update quantifiers (#2308)
2 years ago
TianYuan d1c70a7809
fix g2pw model (#2304)
2 years ago
liangym 043b21d3b4
fix mix frontend, test=tts (#2299)
2 years ago
David An (An Hongliang) 25b96405df
add chinese words correct phonic,test=tts (#2300)
2 years ago
TianYuan c1d4551055
add ernie sat synthesize_e2e, test=tts (#2287)
2 years ago
李子 5a58a27492
[TTS]指定G2PW的传入数据类型 , test=tts (#2288)
2 years ago
TianYuan 3f9339edff
Update polyphonic.yaml
2 years ago
TianYuan f9a6970a62
Merge pull request #2263 from oyjxer/pc
2 years ago
lym0302 677e0961a8 fix point bug, test=tts
2 years ago
TianYuan 4a59702d60
Merge pull request #2255 from lym0302/develop
2 years ago
TianYuan 0baec4325a fix stats bugs
2 years ago
TianYuan f7780658db fix tone sand_hi bugs for Chinese frontend
2 years ago
pangchao04 b9be2bd64a add ernie-sat sampler
2 years ago
lym0302 f8f73e41f0 fix point bug, test=tts
2 years ago
TianYuan 5de2c2dab5 format g2pw
2 years ago
TianYuan 5d515f3f3f update mix tts
2 years ago
TianYuan a75b2a5bab
Merge pull request #2230 from BarryKCL/develop_g2pW
2 years ago
TianYuan db89cfe829
Merge pull request #2234 from lym0302/mix_example
2 years ago
TianYuan 8dbefc0165 fix preprocess bug, add hifigan_csmsc decoder, update readme
2 years ago
BarryKCL a84b40ef79 update g2pW dict
2 years ago
Zhao Yuting d02e04d532
Update audio_handler.py
2 years ago
BarryKCL 6593c24968 set window_size None
2 years ago
BarryKCL 5e63ac1e60 Fix a bug in g2pW
2 years ago
TianYuan 0eb598b876
Merge pull request #2235 from david-95/hongliang-dev
2 years ago
david.95 0df7fc8fbf remove comment
2 years ago
david.95 7ba74f175f remove comment
2 years ago
david.95 f52a87b8d0 remove useless fix, test=tts
2 years ago
david.95 a48e4f249f add filter for double punctuation, revise comment ;
2 years ago
BarryKCL aecf8fd384 add onnxruntime sess_options
2 years ago
lym0302 368e3e1b59 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into mix_example
2 years ago
lym0302 894556f871 add zh_en mix example, test=tts
2 years ago
david.95 1edd474bcb add filter for double punctuation in sentences; add homonym, test=tts
2 years ago
BarryKCL 61dd92e49c update
2 years ago
BarryKCL de0f99150a change G2PWModel download
2 years ago
BarryKCL 744ea44279 add comment
2 years ago
BarryKCL 7b0f2a796d change transformers to paddlenlp.transformers
2 years ago
BarryKCL e60a63fbdd Rollback "get_input_ids"
2 years ago
BarryKCL ab2a1219c8 Add g2pW to Chinese frontend
2 years ago
TianYuan 2f9bdf2306
Merge pull request #2222 from yt605155624/add_onnx_cli
2 years ago
TianYuan c3d47441cf fix fs bug in inference.py (change fixed 24000 to variable for ljspeech)
2 years ago
TianYuan 8da993bbf8 fix fs bug
2 years ago
TianYuan 788a3062d0 fix onnx am_ckpt from list to item in prtrained_mdoels.py
2 years ago
TianYuan c6b25c05f4 change logger.debug to logger.info for streaming asr
2 years ago
Hui Zhang c1fbfe928e add test
2 years ago
TianYuan cd662a08e0 fix for load specified model files
2 years ago
TianYuan b9ade18055 add onnxruntime infer for cli
2 years ago
Hui Zhang 05bc258833 update docstring
2 years ago
Hui Zhang 6149daa221 export ctc_activation
2 years ago
huangyuxin 923b0b873e fix import kws.exps.mdtc
2 years ago
huangyuxin 060e337623 fix dataloader factory, test=asr
2 years ago
TianYuan b0b3222f9a
Merge pull request #2213 from yt605155624/fix_name_bug
2 years ago
TianYuan 354601d0e9 fix readme, test=doc
2 years ago
Hui Zhang 812d80ab1c Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into u2_export
2 years ago
Hui Zhang e5a6c243f1 fix jit save for conformer
2 years ago
TianYuan 510e240c5c achange default voc in cli from pwgan_csmsc to hifigan_csmsc, test=tts
2 years ago
TianYuan 00e9853f66 add mix tts cli, test=tts
2 years ago
0x45f 4e7106d9e2 Support dy2st
2 years ago
TianYuan 1f128a0817
Merge pull request #2117 from yt605155624/ernie_sat_trainer
2 years ago
TianYuan 1bf78fa5c7 updatte batch_fn train.py, test=doc
2 years ago
TianYuan 9d4161ce5f update config, test=doc
2 years ago
lym0302 e1f8695456 add mix tts, test=tts
2 years ago
Betterman-qs e2dc204d4d update engine_warmup.py, test=tts
2 years ago
Betterman-qs cf1b873528 update engine_warmup.py, test=tts
2 years ago
Hui Zhang ef37f73a01 fix cnn cache dy2st shape
2 years ago
0x45f e21cceea51 Remove blank line
2 years ago
0x45f e6ac8881f1 Fix comments
2 years ago
0x45f ac680aa783 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into new_api
2 years ago
Hui Zhang d098e027ca
Merge pull request #2155 from Jackwaterveg/develop_dev
2 years ago
0x45f 294b7b00bd Supprot dy2st for conformer
2 years ago
TianYuan 97965f4c37 fix mlm_prob, test=tts
2 years ago
huangyuxin 7c9208765f fix audio,test=doc
2 years ago
huangyuxin 75997d8277 merge
2 years ago
TianYuan 72fa8176ca fix for mix_lang
2 years ago
TianYuan 5503c8bd6b add ernie_sat synthesize script for metadata.jsonl, test=tts
2 years ago
TianYuan f4ac0c79d9
Merge pull request #2143 from lym0302/mix_front
2 years ago
Jackwaterveg ae7a73bc11
Merge pull request #2138 from zh794390558/demos
2 years ago
lym0302 207bb5d93b add mix frontend, test=tts
2 years ago
Hui Zhang e62cbc464e
Merge pull request #2124 from zh794390558/new_api
2 years ago
Hui Zhang 8376f3d40d
Merge pull request #2128 from zh794390558/endpoint
2 years ago
Hui Zhang caaa5cd502 more cli for speech demos
2 years ago
Hui Zhang 1edf120506 fix comment error
2 years ago
Hui Zhang d142d3a7c0 add docstring
2 years ago
Hui Zhang f8450c39e5 rename n_v_s to n_v_b, n_v_ns to n_v_nb
2 years ago
Hui Zhang f4b11b19e5 rename time_s and time_ns to time_b and time_nb
2 years ago
liangym 45f51651bf
Merge pull request #2129 from lym0302/onnx_gpu
2 years ago
lym0302 3d5ed00c60 specify id, test=doc
2 years ago
Hui Zhang 98eed53e6d more accuracy decoding somthing
2 years ago
TianYuan 028742b69a update lr scheduler
2 years ago
TianYuan 94688264c7 add ernie sat model file and config
2 years ago
Hui Zhang e81849277e att cache for streaming asr
2 years ago
Hui Zhang 5ca05fea20 cli batch process support \t
2 years ago
Hui Zhang fb40602d94 refactor attention cache
2 years ago
liangym e153495519
Merge pull request #2122 from yt605155624/rm_server_log
2 years ago
TianYuan 6bbe6de1ec add stream_play_tts.py, test=doc
2 years ago
lym0302 d66d6a05c7 Merge branch 'develop' of https://github.com/lym0302/PaddleSpeech into develop
2 years ago
lym0302 5b06b76ebc change sr, test=doc
2 years ago
huangyuxin 05d41523ad Merge branch 'develop' into webdataset
2 years ago
huangyuxin 92d1d08b9a fix scripts
2 years ago
TianYuan f6d1c545ac fromat doc_string
2 years ago
TianYuan 4b1f82d312 log redundancy in server
2 years ago
TianYuan e4a8e15334
Merge pull request #2111 from yt605155624/rm_more_log
2 years ago
TianYuan 496e2dd14b fix Pillow's version
2 years ago
TianYuan bc93bffbb4 replace logger.info with logger.debug in cli, change default log level to INFO
2 years ago
TianYuan f76bd9fe51
Merge pull request #2109 from raycool/fix_log
2 years ago
TianYuan e10eaa397e
Merge pull request #2100 from Jackwaterveg/develop_dev
2 years ago
huangyuxin 98cfdc4c05 fix nxpu
2 years ago
huguanghui ddf14662ca fix log issue #2070
2 years ago
huguanghui 20a9a67925 fix log issue #2070
2 years ago
TianYuan cf846f9ebc rm extra log
2 years ago
KP 527744d5f0 Fix unnecessary download present in issue #2067.
2 years ago
KP adc7c9b4aa Fix unnecessary download present in issue #2067.
2 years ago
huangyuxin 7463df89c5 fix nxpu
2 years ago
huangyuxin 6ec6921255 Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
2 years ago
huangyuxin 429221dc03 adopt multi machine traiing
2 years ago
huangyuxin ac1b301657 Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
2 years ago
TianYuan d8a0ba5913
Merge pull request #2085 from yt605155624/fix_tts_cli_log
2 years ago
Jackwaterveg 6598216b2f
Merge branch 'develop' into webdataset
2 years ago
TianYuan c0f126ecd9 fix int32 warning in tts, test=tts
2 years ago
huangyuxin 9b5655f6ad fix 'print log' in cli
2 years ago
huangyuxin aa12b9ab52 replace s2t.transform with audio.transform
2 years ago
huangyuxin 0c7abc1f17 add training scripts
2 years ago
TianYuan 5ff885f116 add tts static/onnx models in pretrained_models.py
2 years ago
huangyuxin c7a7b113c8 support multi-gpu training with webdataset
2 years ago
贾晓 0fa3fdb9ee
Merge pull request #2068 from yt605155624/p_norm
2 years ago
TianYuan 7743c6a1ff add onnx models for aishell3/ljspeech/vctk's tts3/voc1/voc5, test=tts
2 years ago
KP b230dfbdec Add kws cli and demo.
2 years ago
huangyuxin 8f5e61090b new feature: Add webdataset in audio
2 years ago
TianYuan 46ff848d66
Merge pull request #2056 from lym0302/develop
2 years ago
Hui Zhang e04cd18846
Merge pull request #2050 from zh794390558/onnx_quant
2 years ago
lym0302 7d4f320836 fix_model_init, test=doc
2 years ago
TianYuan d1aa83a239
Merge pull request #2052 from yt605155624/ernie_sat
2 years ago
Hui Zhang 54a777055a
Merge pull request #2039 from iftaken/dev-hym
2 years ago
Hui Zhang d20adb5c89
Merge pull request #2048 from KPatr1ck/import_bug
2 years ago
TianYuan 79658a5f20 add ernie sat inference, test=tts
2 years ago
TianYuan 02734141ce
Merge pull request #2040 from yt605155624/add_blank
2 years ago
Hui Zhang d95b0cd9b2 add release and resource
2 years ago
Hui Zhang 3cf1f1f0b5 support onnx quantize
2 years ago
Jackwaterveg 6dfe7273e6
Merge pull request #2045 from zh794390558/wenetspeech_onnx
2 years ago
KP b452be3d8d Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio
2 years ago
KP 220bcffac8 Fix circular import error in paddlespeech.cli.utils and paddlespeech.audio
2 years ago