Commit Graph

1248 Commits (ee4f15826bb4556c6407e8882012776191782a23)

Author SHA1 Message Date
HuangLiangJie acfa057dc7
[TTS]Cantonese FastSpeech2 Training, test=tts (#2907)
2 years ago
zxcd 047092de8e
add wav2vev2_zh aishell recipe, and speechbrain dataloader. (#2916)
2 years ago
艾梦 bcd8e309ec
[TTS]Add diffusion noise clip to optimize sample result (#2902)
2 years ago
zxcd 6728db5b59
[ASR]Whisper remove audio duration limit, test=asr (#2900)
2 years ago
zxcd f6b624ddc8
add encoding=utf8 for text cli. (#2896)
2 years ago
章宏彬 c764710aa1
[TTS]Avoid using variable "attn_loss" before assignment (#2860)
2 years ago
TianYuan a283f8a57e
[TTS]fix open encoding (#2865)
2 years ago
艾梦 a55fd2e556
[TTS]Fix diffusion wavenet denoiser final conv init param (#2868)
2 years ago
QuanZ9 ac3ed3c5a8
Update zh_frontend.py (#2863)
2 years ago
zxcd 64aeb6dccc
remove some fluid api (elementwise_div elementwise_mul sqrt reduce_sum). (#2859)
2 years ago
zxcd 31c2c226ca
clean fluid elementwise_max and square api. (#2852)
2 years ago
HuangLiangJie 140aed4b54
[TTS]VITS init sampler reverse, test=tts (#2843)
2 years ago
艾梦 57b9d4bca4
add diffusion module for training diffsinger (#2832)
2 years ago
TianYuan 1fd38c0e8b
fix o (#2831)
2 years ago
晋东毅 742523fb38
[tts]For mixed Chinese and English speech synthesis, add SSML support for Chinese (#2830)
2 years ago
cxumol a99244d86e
fix: whisper language choice, test=asr (#2828)
2 years ago
zxcd ad40dafa85
fix some bug. (#2825)
2 years ago
HuangLiangJie faa2f86651
[TTS]update VITS init method (#2809)
2 years ago
zxcd 88fe26f17c
[ASR] add asr code-switch cli and demo, test='asr' (#2816)
2 years ago
HuangLiangJie 964211a81b
Change optimizer for vits, test=tts (#2791)
2 years ago
liangym 96d76c83ad
multi-spk tts static model (#2779)
2 years ago
HuangLiangJie 2e51e0da90
[TTS]Fix attention bugs and sort VITS data with feats_lengths (#2770)
2 years ago
TianYuan 6725bcd823
revise paddlenlp's version (#2767)
2 years ago
TianYuan 979bbd9dcb
add mkldnn and trt config for paddleInference (#2748)
2 years ago
zxcd a8a240d4ef
remove paddle.fluid (#2740)
2 years ago
YangZhou 12fa8a2d19
[audio]patch:fix tensor_utils error (#2738)
2 years ago
TianYuan 3f6afc4834
[TTS]Add slim for TTS (#2729)
2 years ago
YangZhou 42ff946007
[audio] mv paddlespeech/audio to paddleaudio (#2706)
2 years ago
HuangLiangJie a874d8f325
Add prosody prediction in synthesize_e2e, test=tts (#2693)
2 years ago
TianYuan 62357d876c
[TTS]rm paddlelite in setup.py (#2713)
2 years ago
Zth9730 c67bf7b4ef
[ASR] support wav2vec2-zh cli, test=asr (#2697)
2 years ago
David An (An Hongliang) bd01bc155d
add greek char and fix issue2571 (#2683)
2 years ago
zxcd 4542684694
[ASR] fix Whisper cli model download path error. test=asr (#2679)
2 years ago
Zth9730 fc02cd0540
[doc] update wav2vec2 demos README.md, test=doc (#2674)
2 years ago
zxcd b71f1428c7
add all whisper model size support, test=asr (#2677)
2 years ago
TianYuan 0b4cf2211d
[TTS]Add TTS Paddle-Lite x86 inference (#2667)
2 years ago
David An (An Hongliang) 1c3d2cb89e
add double byte char for zh normalization (#2661)
2 years ago
Zth9730 94a487bd81
[ASR] support wav2vec2 command line and demo (#2658)
2 years ago
zxcd b1d3f59bcb
[s2t] add whisper asr large model (#2640)
2 years ago
kFoodie dc9d3baf51
Update onnx_api.py (#2664)
2 years ago
liangym 25b6bf9668
[tts] Add male voice for tts (#2660)
2 years ago
Zth9730 8d3494320d
[ASR] wav2vec2_en, test=asr (#2637)
2 years ago
HuangLiangJie b7312e9f0b
Revised TN qualifier for measure notation, test=tts (#2629)
2 years ago
Zth9730 e6d20888c5
支持0维Tensor需要的修改 (#2621)
2 years ago
David An (An Hongliang) 8a5fe83e1d
add ssml sentences.txt (#2620)
2 years ago
Hui Zhang 2c34481ea0
[s2t] quant with wav scp (#2568)
2 years ago
Zth9730 8d3464c050
[s2t] Update wav2vec2 license (#2600)
2 years ago
liangym e18170228c
[tts] add adversarial loss (#2588)
2 years ago
TianYuan 9aab706cba
fix frontend bug, test=tts (#2606)
2 years ago
WongLaw e348aa825d Added Rhythm Prediction, test=tts
2 years ago
WongLaw b96fb1d57e Added Rythm Prediction, test=tts
2 years ago
WongLaw d27364d141 Added Text Rhythm Prediction, test=tts
2 years ago
HuangLiangJie 872be9c8ce
Merge branch 'PaddlePaddle:develop' into rhy
2 years ago
YangZhou bbf2401e3e
Merge pull request #2524 from zh794390558/u2
2 years ago
WongLaw 72bbabbf79 Revised structure of rhythm prediction, test=tts
2 years ago
david.95 ed0138c6e3 add condition check if a ssml input and filter space line, test=tts
2 years ago
David An (An Hongliang) 21cce0e0bb
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
TianYuan 63c80121e2 fix uvicorn's bug
2 years ago
TianYuan 2a60c3d854
Merge pull request #2554 from dahu1/develop
2 years ago
david.95 3ac7ac253f fix review issue,test=tts
2 years ago
David An (An Hongliang) 0476e645aa
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
Zth9730 68134c8436
fix u2pp model (#2549)
2 years ago
dahu1 cb76e66401 1.token配置不写死,2.text显示不乱码, test=asr
2 years ago
Hui Zhang eac545e1db
Merge pull request #2544 from Zth9730/fix_attention
2 years ago
tianhao zhang 1ea828c30e fix attention val bug
2 years ago
David An (An Hongliang) 103e46f819
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
TianYuan 2d71577e75
fix g2p (#2539)
2 years ago
david.95 f295d2d445 remove useless code
2 years ago
david.95 89e9ea69eb modify __init__
2 years ago
david.95 1067088deb modify __init__
2 years ago
david.95 f56cc08b18 add license content, test=tts
2 years ago
david.95 29508f400b to fix CI issue, test=tts
2 years ago
david.95 60801d8f14 Merge branch 'hongliang1014' of https://github.com/david-95/PaddleSpeech into hongliang1014
2 years ago
David An (An Hongliang) ce21f9bc41
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
david.95 278c7a41a8 add module define to fix ci, test=tts
2 years ago
Hui Zhang 964c22c677
Merge pull request #2532 from Zth9730/wav2vec2.0
2 years ago
tianhao zhang 86f65f0b8e fix wav2vec2 report loss bug
2 years ago
david.95 13a7fa9808 enable chinese words' pinyin specified in text of ssml formats, test=tts
2 years ago
Hui Zhang f1ca564731
Merge pull request #2518 from Zth9730/wav2vec2.0
2 years ago
tianhao zhang 2ae94bd277 freeze wav2vec2=True, change loss report and update README.md
2 years ago
tianhao zhang 3d994f5c23 format wav2vec2 demo
2 years ago
Hui Zhang c6f9764ed6
Merge pull request #2510 from Zth9730/u2pp_jit_export
2 years ago
tianhao zhang 19180d359d format wav2vec2 demo
2 years ago
tianhao zhang 6e429f0513 support wav2vec2ASR on librispeech
2 years ago
Hui Zhang 290c23b9d7 add u2 nnet, u2 nnet main, codelab, and can compile
2 years ago
tianhao zhang e367242765 update dependency of paddle
2 years ago
tianhao zhang 5a66a14659 fix u2pp model version number
2 years ago
tianhao zhang cda440e6f0 use reverse_weight in decode.yaml
2 years ago
Zth9730 c9b0c96b7b
Merge pull request #2502 from zh794390558/u2pp_export
2 years ago
Hui Zhang c98b5dd173 fix masked_fill which will nan in trainning
2 years ago
Hui Zhang 9277fcb8a8 fix attn can not train
2 years ago
Hui Zhang 1f4f98b171 fix bug
2 years ago
liangym 0359c3f6b5
Fix mix front (#2493)
2 years ago
Hui Zhang e86337a423 fix bug
2 years ago
Hui Zhang 925abcca23 format
2 years ago
Hui Zhang 2a75405e9a Merge branch 'develop' into u2pp_export
2 years ago
Hui Zhang 3ed24474d2 wenetspeech asr1 quant
2 years ago
Hui Zhang 467cfd4e75
Merge pull request #2489 from Zth9730/u2++_server
2 years ago
tianhao zhang 5b5167b586 support u2pp cli and server, optimiz code of u2pp decode, test=asr
2 years ago
YangZhou 3507829a6d
Merge pull request #2464 from THUzyt21/Deploy-text-model-in-server
2 years ago
ZapBird 7a13b35fe6
BytesIO类型时,要保证切到初始位置,这样多次读取才能够正常。比如__call__函数。 (#2484)
2 years ago
tianhao zhang 5bbe6e9897 support u2pp cli and server, optimiz code of u2pp decode, test=asr
2 years ago
Hui Zhang bdf876ea7b Merge branch 'develop' into u2pp_export
2 years ago
Zhao Yuting 304dc2603c
Update text_engine.py
2 years ago
Zhao Yuting 8c945c073d
Update application.yaml
2 years ago
Zhao Yuting b9693a0e8e
Update text_engine.py
2 years ago
Zhao Yuting 8ecf6796f3
Update text_engine.py
2 years ago
Hui Zhang afda7ed7d1 remove useless code
2 years ago
YangZhou 4841f94298
Merge pull request #2421 from THUzyt21/Deploy-fast-text-model-for-cli
2 years ago
Hui Zhang b20bf7d5de masked_fill by multiply, remove while
2 years ago
Zhao Yuting d2da7f50d2
Update text_engine.py
2 years ago
Zhao Yuting 82f731c153
Update application.yaml
2 years ago
Hui Zhang feb27e2a84 fuse linear kv
2 years ago
Hui Zhang 3adb20b468 eliminate shape and slice
2 years ago
Hui Zhang 46088c0a16 elimiate attn transpose
2 years ago
Hui Zhang f9e3eaa024 transpose in matmul
2 years ago
Hui Zhang 3d7ca93861 bool type slice
2 years ago
Hui Zhang c2c8a662b1 refactor reshape
2 years ago
Hui Zhang 6de81d74d9 elimiete cast dtype for bool op
2 years ago
Hui Zhang 8e7a315e00 remove comment
2 years ago
Hui Zhang c4a5ae3825 eliminate mul
2 years ago
Hui Zhang b7388ce25a eliminate useless unsqueese
2 years ago
Hui Zhang 1a1ce92cb4
Merge pull request #2415 from Zth9730/u2++_decoder
2 years ago
TianYuan 52af86fcc3
fix ERNIE-SAT bug when edit the end of sentenses, test=doc (#2432)
2 years ago
tianhao zhang d3e5937591 support bitransformer decoder
2 years ago
Hui Zhang 7382050e21 fix bug on win
2 years ago
TianYuan b14da765e8
frm random spk embedding in voice cloning, test=doc (#2429)
2 years ago
Hui Zhang d25871a7b0 format
2 years ago
Hui Zhang b10512eb0e more config or u2pp
2 years ago
Hui Zhang 00b2c1c8fb fix forward attention decoder caller
2 years ago
zhoupc2015 2ae0f66d0d
Solve "unknown format: 3" (#2422)
2 years ago
Hui Zhang 309c8d70d9 add reverse weight
2 years ago
Hui Zhang 9b66680ea4 Merge branch 'u2++_decoder' into u2pp_export
2 years ago
tianhao zhang 027535dec1 support bitransformer decoder, test=asr
2 years ago
THUzyt21 bdbacd4249 precomited
2 years ago
Zhao Yuting d5dec46336
Update README.md
2 years ago
Zhao Yuting 18b71dc136
Update README.md
2 years ago
tianhao zhang 0a95689461 support bitransformer decoder
2 years ago
tianhao zhang 455379b88e support bitransformer decoder
2 years ago
Zhao Yuting a63a0b1350
Update pretrained_models.py
2 years ago
Zhao Yuting 12a11394bd
Update infer.py
2 years ago
Zhao Yuting fb7f04e021
Update README.md
2 years ago
Zhao Yuting 92d09d5cce
Update README_cn.md
2 years ago
Zhao Yuting 57dcd0d17f
Update infer.py
2 years ago
Zhao Yuting b627666ce9
Update model_alias.py
2 years ago
Zhao Yuting a02654660a
Update pretrained_models.py
2 years ago
tianhao zhang ecbf324286 support bitransformer decoder, test=asr
2 years ago
tianhao zhang 1a56a6e42b add bitransformer decoder, test=asr
2 years ago
Hui Zhang 53d6baff0b format
2 years ago
Hui Zhang 549d477592 fix code style
2 years ago
Hui Zhang 4d5cfd4003 export param from cnofig
2 years ago
Hui Zhang e3298c79ce Merge branch 'develop' into u2_export
2 years ago
Hui Zhang 260752aa2a using forward_attention_decoder
2 years ago
TianYuan 5e714ecb4a
[doc]update api docs (#2406)
2 years ago
TianYuan eac362057c
add typehint for g2pw (#2390)
2 years ago
Hui Zhang 0d7d87120b simplify feature pipeline graph
2 years ago
WongLaw 324b166c52
Removed useless spk_id in speech_server and streaming_tts_server from demos, and support bilingual server engine, test=tts (#2380)
2 years ago
TianYuan 80b180217d
[TTS] fix some bugs of ERNIE-SAT (#2378)
2 years ago
Hui Zhang 8690a00bd8 add feature pipeline layer(cmvn, fbank), but to_static and jit.layer output is not equal
2 years ago
Hui Zhang 07f566e0a5
Merge pull request #2372 from Zth9730/fix_dp_init
2 years ago
Hui Zhang 3a8869fba4 rm to_static decarator; configure jit save for ctc_activation
2 years ago
Hui Zhang 1c9f238ba0 configurable export
2 years ago
Hui Zhang 63aeb747b0 more comment
2 years ago
Hui Zhang a7c6c54e75 fix
2 years ago
Hui Zhang d638325c46 do not jit save forward; using slice for zeros([0,0,0,0]) tensor
2 years ago
tianhao zhang 663e3ab58e fix dp init
2 years ago
tianhao zhang 6745e9dd6b fix dp init
2 years ago
tianhao zhang 598eb1a5ef Merge branch 'develop' into fix_dp_init
2 years ago
WongLaw 989b755e8e
Revised must_neural_tone_words, test=doc. (#2370)
2 years ago
tianhao zhang 9560d650db fix dp init
2 years ago
TianYuan 7e4f3b029c
Merge pull request #2359 from yt605155624/add_vc2
2 years ago
tianhao zhang 82e04d7815 fix trianer
2 years ago
TianYuan f7873773bf
uadd __init__.py for VITS, test=tts (#2362)
2 years ago
TianYuan 35c6ffa90b Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_vc2
2 years ago
TianYuan e622f42d92 add aishell3 voice cloning with ECAPA-TDNN spk encoder
2 years ago
TianYuan 1c30cff1bf
fix gpus of ernie_sat, test=tts (#2355)
2 years ago
Hui Zhang 2bb40c41ba
Merge pull request #2351 from Zth9730/fix_deepspeech
2 years ago
tianhao zhang ab92e2c98c fix deepspeech2 decode_wav
2 years ago
艾梦 ea9ee93739
[TTS]Update VITS to support VITS and its voice cloning training on AIShell-3 (#2268)
2 years ago
TianYuan 795eb7bd10
format paddlespeech with pre-commit (#2331)
2 years ago
TianYuan 5d5888af8e
fix tone, update readme (#2335)
2 years ago
贾晓 0b544ee84e
Merge pull request #2336 from Zth9730/fix_multigpu_train
2 years ago
tianhao zhang cdcb1a5316 s2t: fix encoder.py
2 years ago
tianhao zhang ed2819d7af fix format test=asr
2 years ago
Hui Zhang 58ab7e8d10
Merge pull request #2334 from Zth9730/fix_multigpu_train
2 years ago
tianhao zhang 1dfca4ef73 fix multigpu training
2 years ago
Hui Zhang 94e750c4c4
Merge pull request #2327 from Zth9730/fix_multigpu_train
2 years ago
tianhao zhang ed80b0e2c3 fix multigpu training test=asr
2 years ago
tianhao zhang 733ec7f2bc fix conformer multi-gpu training test=asr
2 years ago
David An (An Hongliang) f5367f5efb
[TTS]fix bug of tone modify (#2323)
2 years ago
Zhao Yuting c28064fec2
Update asr_engine.py (#2302)
2 years ago
TianYuan 7b864e8f38
clean old ernie sat inference scripts (#2316)
2 years ago
David An (An Hongliang) c7163abffa
add thanks into readme, append data for chinese unit (#2312)
2 years ago
彭震东 c9de22eaa8
[TN] Update quantifiers (#2308)
2 years ago
TianYuan d1c70a7809
fix g2pw model (#2304)
2 years ago
liangym 043b21d3b4
fix mix frontend, test=tts (#2299)
2 years ago
David An (An Hongliang) 25b96405df
add chinese words correct phonic,test=tts (#2300)
2 years ago
TianYuan c1d4551055
add ernie sat synthesize_e2e, test=tts (#2287)
2 years ago
李子 5a58a27492
[TTS]指定G2PW的传入数据类型 , test=tts (#2288)
2 years ago
TianYuan 3f9339edff
Update polyphonic.yaml
2 years ago