Commit Graph

1278 Commits (develop)

Author SHA1 Message Date
liangym 96d76c83ad
multi-spk tts static model (#2779)
3 years ago
HuangLiangJie 2e51e0da90
[TTS]Fix attention bugs and sort VITS data with feats_lengths (#2770)
3 years ago
TianYuan 6725bcd823
revise paddlenlp's version (#2767)
3 years ago
TianYuan 979bbd9dcb
add mkldnn and trt config for paddleInference (#2748)
3 years ago
zxcd a8a240d4ef
remove paddle.fluid (#2740)
3 years ago
YangZhou 12fa8a2d19
[audio]patch:fix tensor_utils error (#2738)
3 years ago
TianYuan 3f6afc4834
[TTS]Add slim for TTS (#2729)
3 years ago
YangZhou 42ff946007
[audio] mv paddlespeech/audio to paddleaudio (#2706)
3 years ago
HuangLiangJie a874d8f325
Add prosody prediction in synthesize_e2e, test=tts (#2693)
3 years ago
TianYuan 62357d876c
[TTS]rm paddlelite in setup.py (#2713)
3 years ago
Zth9730 c67bf7b4ef
[ASR] support wav2vec2-zh cli, test=asr (#2697)
3 years ago
David An (An Hongliang) bd01bc155d
add greek char and fix issue2571 (#2683)
3 years ago
zxcd 4542684694
[ASR] fix Whisper cli model download path error. test=asr (#2679)
3 years ago
Zth9730 fc02cd0540
[doc] update wav2vec2 demos README.md, test=doc (#2674)
3 years ago
zxcd b71f1428c7
add all whisper model size support, test=asr (#2677)
3 years ago
TianYuan 0b4cf2211d
[TTS]Add TTS Paddle-Lite x86 inference (#2667)
3 years ago
David An (An Hongliang) 1c3d2cb89e
add double byte char for zh normalization (#2661)
3 years ago
Zth9730 94a487bd81
[ASR] support wav2vec2 command line and demo (#2658)
3 years ago
zxcd b1d3f59bcb
[s2t] add whisper asr large model (#2640)
3 years ago
kFoodie dc9d3baf51
Update onnx_api.py (#2664)
3 years ago
liangym 25b6bf9668
[tts] Add male voice for tts (#2660)
3 years ago
Zth9730 8d3494320d
[ASR] wav2vec2_en, test=asr (#2637)
3 years ago
HuangLiangJie b7312e9f0b
Revised TN qualifier for measure notation, test=tts (#2629)
3 years ago
Zth9730 e6d20888c5
支持0维Tensor需要的修改 (#2621)
3 years ago
David An (An Hongliang) 8a5fe83e1d
add ssml sentences.txt (#2620)
3 years ago
Hui Zhang 2c34481ea0
[s2t] quant with wav scp (#2568)
3 years ago
Zth9730 8d3464c050
[s2t] Update wav2vec2 license (#2600)
3 years ago
liangym e18170228c
[tts] add adversarial loss (#2588)
3 years ago
TianYuan 9aab706cba
fix frontend bug, test=tts (#2606)
3 years ago
WongLaw e348aa825d Added Rhythm Prediction, test=tts
3 years ago
WongLaw b96fb1d57e Added Rythm Prediction, test=tts
3 years ago
WongLaw d27364d141 Added Text Rhythm Prediction, test=tts
3 years ago
HuangLiangJie 872be9c8ce
Merge branch 'PaddlePaddle:develop' into rhy
3 years ago
YangZhou bbf2401e3e
Merge pull request #2524 from zh794390558/u2
3 years ago
WongLaw 72bbabbf79 Revised structure of rhythm prediction, test=tts
3 years ago
david.95 ed0138c6e3 add condition check if a ssml input and filter space line, test=tts
3 years ago
David An (An Hongliang) 21cce0e0bb
Merge branch 'PaddlePaddle:develop' into hongliang1014
3 years ago
TianYuan 63c80121e2 fix uvicorn's bug
3 years ago
TianYuan 2a60c3d854
Merge pull request #2554 from dahu1/develop
3 years ago
david.95 3ac7ac253f fix review issue,test=tts
3 years ago
David An (An Hongliang) 0476e645aa
Merge branch 'PaddlePaddle:develop' into hongliang1014
3 years ago
Zth9730 68134c8436
fix u2pp model (#2549)
3 years ago
dahu1 cb76e66401 1.token配置不写死,2.text显示不乱码, test=asr
3 years ago
Hui Zhang eac545e1db
Merge pull request #2544 from Zth9730/fix_attention
3 years ago
tianhao zhang 1ea828c30e fix attention val bug
3 years ago
David An (An Hongliang) 103e46f819
Merge branch 'PaddlePaddle:develop' into hongliang1014
3 years ago
TianYuan 2d71577e75
fix g2p (#2539)
3 years ago
david.95 f295d2d445 remove useless code
3 years ago
david.95 89e9ea69eb modify __init__
3 years ago
david.95 1067088deb modify __init__
3 years ago
david.95 f56cc08b18 add license content, test=tts
3 years ago
david.95 29508f400b to fix CI issue, test=tts
3 years ago
david.95 60801d8f14 Merge branch 'hongliang1014' of https://github.com/david-95/PaddleSpeech into hongliang1014
3 years ago
David An (An Hongliang) ce21f9bc41
Merge branch 'PaddlePaddle:develop' into hongliang1014
3 years ago
david.95 278c7a41a8 add module define to fix ci, test=tts
3 years ago
Hui Zhang 964c22c677
Merge pull request #2532 from Zth9730/wav2vec2.0
3 years ago
tianhao zhang 86f65f0b8e fix wav2vec2 report loss bug
3 years ago
david.95 13a7fa9808 enable chinese words' pinyin specified in text of ssml formats, test=tts
3 years ago
Hui Zhang f1ca564731
Merge pull request #2518 from Zth9730/wav2vec2.0
3 years ago
tianhao zhang 2ae94bd277 freeze wav2vec2=True, change loss report and update README.md
3 years ago
tianhao zhang 3d994f5c23 format wav2vec2 demo
3 years ago
Hui Zhang c6f9764ed6
Merge pull request #2510 from Zth9730/u2pp_jit_export
3 years ago
tianhao zhang 19180d359d format wav2vec2 demo
3 years ago
tianhao zhang 6e429f0513 support wav2vec2ASR on librispeech
3 years ago
Hui Zhang 290c23b9d7 add u2 nnet, u2 nnet main, codelab, and can compile
3 years ago
tianhao zhang e367242765 update dependency of paddle
3 years ago
tianhao zhang 5a66a14659 fix u2pp model version number
3 years ago
tianhao zhang cda440e6f0 use reverse_weight in decode.yaml
3 years ago
Zth9730 c9b0c96b7b
Merge pull request #2502 from zh794390558/u2pp_export
3 years ago
Hui Zhang c98b5dd173 fix masked_fill which will nan in trainning
3 years ago
Hui Zhang 9277fcb8a8 fix attn can not train
3 years ago
Hui Zhang 1f4f98b171 fix bug
3 years ago
liangym 0359c3f6b5
Fix mix front (#2493)
3 years ago
Hui Zhang e86337a423 fix bug
3 years ago
Hui Zhang 925abcca23 format
3 years ago
Hui Zhang 2a75405e9a Merge branch 'develop' into u2pp_export
3 years ago
Hui Zhang 3ed24474d2 wenetspeech asr1 quant
3 years ago
Hui Zhang 467cfd4e75
Merge pull request #2489 from Zth9730/u2++_server
3 years ago
tianhao zhang 5b5167b586 support u2pp cli and server, optimiz code of u2pp decode, test=asr
3 years ago
YangZhou 3507829a6d
Merge pull request #2464 from THUzyt21/Deploy-text-model-in-server
3 years ago
ZapBird 7a13b35fe6
BytesIO类型时,要保证切到初始位置,这样多次读取才能够正常。比如__call__函数。 (#2484)
3 years ago
tianhao zhang 5bbe6e9897 support u2pp cli and server, optimiz code of u2pp decode, test=asr
3 years ago
Hui Zhang bdf876ea7b Merge branch 'develop' into u2pp_export
3 years ago
Zhao Yuting 304dc2603c
Update text_engine.py
3 years ago
Zhao Yuting 8c945c073d
Update application.yaml
3 years ago
Zhao Yuting b9693a0e8e
Update text_engine.py
3 years ago
Zhao Yuting 8ecf6796f3
Update text_engine.py
3 years ago
Hui Zhang afda7ed7d1 remove useless code
3 years ago
YangZhou 4841f94298
Merge pull request #2421 from THUzyt21/Deploy-fast-text-model-for-cli
3 years ago
Hui Zhang b20bf7d5de masked_fill by multiply, remove while
3 years ago
Zhao Yuting d2da7f50d2
Update text_engine.py
3 years ago
Zhao Yuting 82f731c153
Update application.yaml
3 years ago
Hui Zhang feb27e2a84 fuse linear kv
3 years ago
Hui Zhang 3adb20b468 eliminate shape and slice
3 years ago
Hui Zhang 46088c0a16 elimiate attn transpose
3 years ago
Hui Zhang f9e3eaa024 transpose in matmul
3 years ago
Hui Zhang 3d7ca93861 bool type slice
3 years ago
Hui Zhang c2c8a662b1 refactor reshape
3 years ago
Hui Zhang 6de81d74d9 elimiete cast dtype for bool op
3 years ago
Hui Zhang 8e7a315e00 remove comment
3 years ago
Hui Zhang c4a5ae3825 eliminate mul
3 years ago
Hui Zhang b7388ce25a eliminate useless unsqueese
3 years ago
Hui Zhang 1a1ce92cb4
Merge pull request #2415 from Zth9730/u2++_decoder
3 years ago
TianYuan 52af86fcc3
fix ERNIE-SAT bug when edit the end of sentenses, test=doc (#2432)
3 years ago
tianhao zhang d3e5937591 support bitransformer decoder
3 years ago
Hui Zhang 7382050e21 fix bug on win
3 years ago
TianYuan b14da765e8
frm random spk embedding in voice cloning, test=doc (#2429)
3 years ago
Hui Zhang d25871a7b0 format
3 years ago
Hui Zhang b10512eb0e more config or u2pp
3 years ago
Hui Zhang 00b2c1c8fb fix forward attention decoder caller
3 years ago
zhoupc2015 2ae0f66d0d
Solve "unknown format: 3" (#2422)
3 years ago
Hui Zhang 309c8d70d9 add reverse weight
3 years ago
Hui Zhang 9b66680ea4 Merge branch 'u2++_decoder' into u2pp_export
3 years ago
tianhao zhang 027535dec1 support bitransformer decoder, test=asr
3 years ago
THUzyt21 bdbacd4249 precomited
3 years ago
Zhao Yuting d5dec46336
Update README.md
3 years ago
Zhao Yuting 18b71dc136
Update README.md
3 years ago
tianhao zhang 0a95689461 support bitransformer decoder
3 years ago
tianhao zhang 455379b88e support bitransformer decoder
3 years ago
Zhao Yuting a63a0b1350
Update pretrained_models.py
3 years ago
Zhao Yuting 12a11394bd
Update infer.py
3 years ago
Zhao Yuting fb7f04e021
Update README.md
3 years ago
Zhao Yuting 92d09d5cce
Update README_cn.md
3 years ago
Zhao Yuting 57dcd0d17f
Update infer.py
3 years ago
Zhao Yuting b627666ce9
Update model_alias.py
3 years ago
Zhao Yuting a02654660a
Update pretrained_models.py
3 years ago
tianhao zhang ecbf324286 support bitransformer decoder, test=asr
3 years ago
tianhao zhang 1a56a6e42b add bitransformer decoder, test=asr
3 years ago
Hui Zhang 53d6baff0b format
3 years ago
Hui Zhang 549d477592 fix code style
3 years ago
Hui Zhang 4d5cfd4003 export param from cnofig
3 years ago
Hui Zhang e3298c79ce Merge branch 'develop' into u2_export
3 years ago
Hui Zhang 260752aa2a using forward_attention_decoder
3 years ago
TianYuan 5e714ecb4a
[doc]update api docs (#2406)
3 years ago
TianYuan eac362057c
add typehint for g2pw (#2390)
3 years ago
Hui Zhang 0d7d87120b simplify feature pipeline graph
3 years ago
WongLaw 324b166c52
Removed useless spk_id in speech_server and streaming_tts_server from demos, and support bilingual server engine, test=tts (#2380)
3 years ago
TianYuan 80b180217d
[TTS] fix some bugs of ERNIE-SAT (#2378)
3 years ago
Hui Zhang 8690a00bd8 add feature pipeline layer(cmvn, fbank), but to_static and jit.layer output is not equal
3 years ago
Hui Zhang 07f566e0a5
Merge pull request #2372 from Zth9730/fix_dp_init
3 years ago
Hui Zhang 3a8869fba4 rm to_static decarator; configure jit save for ctc_activation
3 years ago
Hui Zhang 1c9f238ba0 configurable export
3 years ago
Hui Zhang 63aeb747b0 more comment
3 years ago
Hui Zhang a7c6c54e75 fix
3 years ago
Hui Zhang d638325c46 do not jit save forward; using slice for zeros([0,0,0,0]) tensor
3 years ago
tianhao zhang 663e3ab58e fix dp init
3 years ago
tianhao zhang 6745e9dd6b fix dp init
3 years ago
tianhao zhang 598eb1a5ef Merge branch 'develop' into fix_dp_init
3 years ago
WongLaw 989b755e8e
Revised must_neural_tone_words, test=doc. (#2370)
3 years ago
tianhao zhang 9560d650db fix dp init
3 years ago
TianYuan 7e4f3b029c
Merge pull request #2359 from yt605155624/add_vc2
3 years ago
tianhao zhang 82e04d7815 fix trianer
3 years ago
TianYuan f7873773bf
uadd __init__.py for VITS, test=tts (#2362)
3 years ago
TianYuan 35c6ffa90b Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_vc2
3 years ago
TianYuan e622f42d92 add aishell3 voice cloning with ECAPA-TDNN spk encoder
3 years ago
TianYuan 1c30cff1bf
fix gpus of ernie_sat, test=tts (#2355)
3 years ago
Hui Zhang 2bb40c41ba
Merge pull request #2351 from Zth9730/fix_deepspeech
3 years ago
tianhao zhang ab92e2c98c fix deepspeech2 decode_wav
3 years ago
艾梦 ea9ee93739
[TTS]Update VITS to support VITS and its voice cloning training on AIShell-3 (#2268)
3 years ago
TianYuan 795eb7bd10
format paddlespeech with pre-commit (#2331)
3 years ago
TianYuan 5d5888af8e
fix tone, update readme (#2335)
3 years ago
贾晓 0b544ee84e
Merge pull request #2336 from Zth9730/fix_multigpu_train
3 years ago
tianhao zhang cdcb1a5316 s2t: fix encoder.py
3 years ago
tianhao zhang ed2819d7af fix format test=asr
3 years ago
Hui Zhang 58ab7e8d10
Merge pull request #2334 from Zth9730/fix_multigpu_train
3 years ago
tianhao zhang 1dfca4ef73 fix multigpu training
3 years ago
Hui Zhang 94e750c4c4
Merge pull request #2327 from Zth9730/fix_multigpu_train
3 years ago
tianhao zhang ed80b0e2c3 fix multigpu training test=asr
3 years ago
tianhao zhang 733ec7f2bc fix conformer multi-gpu training test=asr
3 years ago
David An (An Hongliang) f5367f5efb
[TTS]fix bug of tone modify (#2323)
3 years ago
Zhao Yuting c28064fec2
Update asr_engine.py (#2302)
3 years ago
TianYuan 7b864e8f38
clean old ernie sat inference scripts (#2316)
3 years ago
David An (An Hongliang) c7163abffa
add thanks into readme, append data for chinese unit (#2312)
3 years ago
彭震东 c9de22eaa8
[TN] Update quantifiers (#2308)
3 years ago
TianYuan d1c70a7809
fix g2pw model (#2304)
3 years ago
liangym 043b21d3b4
fix mix frontend, test=tts (#2299)
3 years ago
David An (An Hongliang) 25b96405df
add chinese words correct phonic,test=tts (#2300)
3 years ago
TianYuan c1d4551055
add ernie sat synthesize_e2e, test=tts (#2287)
3 years ago
李子 5a58a27492
[TTS]指定G2PW的传入数据类型 , test=tts (#2288)
3 years ago
TianYuan 3f9339edff
Update polyphonic.yaml
3 years ago
TianYuan f9a6970a62
Merge pull request #2263 from oyjxer/pc
3 years ago
lym0302 677e0961a8 fix point bug, test=tts
3 years ago
TianYuan 4a59702d60
Merge pull request #2255 from lym0302/develop
3 years ago
TianYuan 0baec4325a fix stats bugs
3 years ago
TianYuan f7780658db fix tone sand_hi bugs for Chinese frontend
3 years ago
pangchao04 b9be2bd64a add ernie-sat sampler
3 years ago
lym0302 f8f73e41f0 fix point bug, test=tts
3 years ago
TianYuan 5de2c2dab5 format g2pw
3 years ago
TianYuan 5d515f3f3f update mix tts
3 years ago
TianYuan a75b2a5bab
Merge pull request #2230 from BarryKCL/develop_g2pW
3 years ago
TianYuan db89cfe829
Merge pull request #2234 from lym0302/mix_example
3 years ago
TianYuan 8dbefc0165 fix preprocess bug, add hifigan_csmsc decoder, update readme
3 years ago
BarryKCL a84b40ef79 update g2pW dict
3 years ago
Zhao Yuting d02e04d532
Update audio_handler.py
3 years ago
BarryKCL 6593c24968 set window_size None
3 years ago
BarryKCL 5e63ac1e60 Fix a bug in g2pW
3 years ago
TianYuan 0eb598b876
Merge pull request #2235 from david-95/hongliang-dev
3 years ago
david.95 0df7fc8fbf remove comment
3 years ago
david.95 7ba74f175f remove comment
3 years ago
david.95 f52a87b8d0 remove useless fix, test=tts
3 years ago
david.95 a48e4f249f add filter for double punctuation, revise comment ;
3 years ago
BarryKCL aecf8fd384 add onnxruntime sess_options
3 years ago
lym0302 368e3e1b59 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into mix_example
3 years ago
lym0302 894556f871 add zh_en mix example, test=tts
3 years ago
david.95 1edd474bcb add filter for double punctuation in sentences; add homonym, test=tts
3 years ago
BarryKCL 61dd92e49c update
3 years ago
BarryKCL de0f99150a change G2PWModel download
3 years ago
BarryKCL 744ea44279 add comment
3 years ago
BarryKCL 7b0f2a796d change transformers to paddlenlp.transformers
3 years ago
BarryKCL e60a63fbdd Rollback "get_input_ids"
3 years ago
BarryKCL ab2a1219c8 Add g2pW to Chinese frontend
3 years ago
TianYuan 2f9bdf2306
Merge pull request #2222 from yt605155624/add_onnx_cli
3 years ago
TianYuan c3d47441cf fix fs bug in inference.py (change fixed 24000 to variable for ljspeech)
3 years ago
TianYuan 8da993bbf8 fix fs bug
3 years ago
TianYuan 788a3062d0 fix onnx am_ckpt from list to item in prtrained_mdoels.py
3 years ago
TianYuan c6b25c05f4 change logger.debug to logger.info for streaming asr
3 years ago
Hui Zhang c1fbfe928e add test
3 years ago
TianYuan cd662a08e0 fix for load specified model files
3 years ago
TianYuan b9ade18055 add onnxruntime infer for cli
3 years ago
Hui Zhang 05bc258833 update docstring
3 years ago
Hui Zhang 6149daa221 export ctc_activation
3 years ago
huangyuxin 923b0b873e fix import kws.exps.mdtc
3 years ago
huangyuxin 060e337623 fix dataloader factory, test=asr
3 years ago
TianYuan b0b3222f9a
Merge pull request #2213 from yt605155624/fix_name_bug
3 years ago
TianYuan 354601d0e9 fix readme, test=doc
3 years ago
Hui Zhang 812d80ab1c Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into u2_export
3 years ago
Hui Zhang e5a6c243f1 fix jit save for conformer
3 years ago
TianYuan 510e240c5c achange default voc in cli from pwgan_csmsc to hifigan_csmsc, test=tts
3 years ago
TianYuan 00e9853f66 add mix tts cli, test=tts
3 years ago
0x45f 4e7106d9e2 Support dy2st
3 years ago
TianYuan 1f128a0817
Merge pull request #2117 from yt605155624/ernie_sat_trainer
3 years ago
TianYuan 1bf78fa5c7 updatte batch_fn train.py, test=doc
3 years ago
TianYuan 9d4161ce5f update config, test=doc
3 years ago
lym0302 e1f8695456 add mix tts, test=tts
3 years ago
Betterman-qs e2dc204d4d update engine_warmup.py, test=tts
3 years ago
Betterman-qs cf1b873528 update engine_warmup.py, test=tts
3 years ago
Hui Zhang ef37f73a01 fix cnn cache dy2st shape
3 years ago
0x45f e21cceea51 Remove blank line
3 years ago
0x45f e6ac8881f1 Fix comments
3 years ago
0x45f ac680aa783 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into new_api
3 years ago
Hui Zhang d098e027ca
Merge pull request #2155 from Jackwaterveg/develop_dev
3 years ago
0x45f 294b7b00bd Supprot dy2st for conformer
3 years ago
TianYuan 97965f4c37 fix mlm_prob, test=tts
3 years ago
huangyuxin 7c9208765f fix audio,test=doc
3 years ago
huangyuxin 75997d8277 merge
3 years ago
TianYuan 72fa8176ca fix for mix_lang
3 years ago
TianYuan 5503c8bd6b add ernie_sat synthesize script for metadata.jsonl, test=tts
3 years ago
TianYuan f4ac0c79d9
Merge pull request #2143 from lym0302/mix_front
3 years ago
Jackwaterveg ae7a73bc11
Merge pull request #2138 from zh794390558/demos
3 years ago
lym0302 207bb5d93b add mix frontend, test=tts
3 years ago