Commit Graph

1205 Commits (72ce8861779cc7fef9eb3277217878fd65375c58)

Author SHA1 Message Date
Zth9730 e6d20888c5
支持0维Tensor需要的修改 (#2621)
2 years ago
David An (An Hongliang) 8a5fe83e1d
add ssml sentences.txt (#2620)
2 years ago
Hui Zhang 2c34481ea0
[s2t] quant with wav scp (#2568)
2 years ago
Zth9730 8d3464c050
[s2t] Update wav2vec2 license (#2600)
2 years ago
liangym e18170228c
[tts] add adversarial loss (#2588)
2 years ago
TianYuan 9aab706cba
fix frontend bug, test=tts (#2606)
2 years ago
WongLaw e348aa825d Added Rhythm Prediction, test=tts
2 years ago
WongLaw b96fb1d57e Added Rythm Prediction, test=tts
2 years ago
WongLaw d27364d141 Added Text Rhythm Prediction, test=tts
2 years ago
HuangLiangJie 872be9c8ce
Merge branch 'PaddlePaddle:develop' into rhy
2 years ago
YangZhou bbf2401e3e
Merge pull request #2524 from zh794390558/u2
2 years ago
WongLaw 72bbabbf79 Revised structure of rhythm prediction, test=tts
2 years ago
david.95 ed0138c6e3 add condition check if a ssml input and filter space line, test=tts
2 years ago
David An (An Hongliang) 21cce0e0bb
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
TianYuan 63c80121e2 fix uvicorn's bug
2 years ago
TianYuan 2a60c3d854
Merge pull request #2554 from dahu1/develop
2 years ago
david.95 3ac7ac253f fix review issue,test=tts
2 years ago
David An (An Hongliang) 0476e645aa
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
Zth9730 68134c8436
fix u2pp model (#2549)
2 years ago
dahu1 cb76e66401 1.token配置不写死,2.text显示不乱码, test=asr
2 years ago
Hui Zhang eac545e1db
Merge pull request #2544 from Zth9730/fix_attention
2 years ago
tianhao zhang 1ea828c30e fix attention val bug
2 years ago
David An (An Hongliang) 103e46f819
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
TianYuan 2d71577e75
fix g2p (#2539)
2 years ago
david.95 f295d2d445 remove useless code
2 years ago
david.95 89e9ea69eb modify __init__
2 years ago
david.95 1067088deb modify __init__
2 years ago
david.95 f56cc08b18 add license content, test=tts
2 years ago
david.95 29508f400b to fix CI issue, test=tts
2 years ago
david.95 60801d8f14 Merge branch 'hongliang1014' of https://github.com/david-95/PaddleSpeech into hongliang1014
2 years ago
David An (An Hongliang) ce21f9bc41
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
david.95 278c7a41a8 add module define to fix ci, test=tts
2 years ago
Hui Zhang 964c22c677
Merge pull request #2532 from Zth9730/wav2vec2.0
2 years ago
tianhao zhang 86f65f0b8e fix wav2vec2 report loss bug
2 years ago
david.95 13a7fa9808 enable chinese words' pinyin specified in text of ssml formats, test=tts
2 years ago
Hui Zhang f1ca564731
Merge pull request #2518 from Zth9730/wav2vec2.0
2 years ago
tianhao zhang 2ae94bd277 freeze wav2vec2=True, change loss report and update README.md
2 years ago
tianhao zhang 3d994f5c23 format wav2vec2 demo
2 years ago
Hui Zhang c6f9764ed6
Merge pull request #2510 from Zth9730/u2pp_jit_export
2 years ago
tianhao zhang 19180d359d format wav2vec2 demo
2 years ago
tianhao zhang 6e429f0513 support wav2vec2ASR on librispeech
2 years ago
Hui Zhang 290c23b9d7 add u2 nnet, u2 nnet main, codelab, and can compile
2 years ago
tianhao zhang e367242765 update dependency of paddle
2 years ago
tianhao zhang 5a66a14659 fix u2pp model version number
2 years ago
tianhao zhang cda440e6f0 use reverse_weight in decode.yaml
2 years ago
Zth9730 c9b0c96b7b
Merge pull request #2502 from zh794390558/u2pp_export
2 years ago
Hui Zhang c98b5dd173 fix masked_fill which will nan in trainning
2 years ago
Hui Zhang 9277fcb8a8 fix attn can not train
2 years ago
Hui Zhang 1f4f98b171 fix bug
2 years ago
liangym 0359c3f6b5
Fix mix front (#2493)
2 years ago
Hui Zhang e86337a423 fix bug
2 years ago
Hui Zhang 925abcca23 format
2 years ago
Hui Zhang 2a75405e9a Merge branch 'develop' into u2pp_export
2 years ago
Hui Zhang 3ed24474d2 wenetspeech asr1 quant
2 years ago
Hui Zhang 467cfd4e75
Merge pull request #2489 from Zth9730/u2++_server
2 years ago
tianhao zhang 5b5167b586 support u2pp cli and server, optimiz code of u2pp decode, test=asr
2 years ago
YangZhou 3507829a6d
Merge pull request #2464 from THUzyt21/Deploy-text-model-in-server
2 years ago
ZapBird 7a13b35fe6
BytesIO类型时,要保证切到初始位置,这样多次读取才能够正常。比如__call__函数。 (#2484)
2 years ago
tianhao zhang 5bbe6e9897 support u2pp cli and server, optimiz code of u2pp decode, test=asr
2 years ago
Hui Zhang bdf876ea7b Merge branch 'develop' into u2pp_export
2 years ago
Zhao Yuting 304dc2603c
Update text_engine.py
2 years ago
Zhao Yuting 8c945c073d
Update application.yaml
2 years ago
Zhao Yuting b9693a0e8e
Update text_engine.py
2 years ago
Zhao Yuting 8ecf6796f3
Update text_engine.py
2 years ago
Hui Zhang afda7ed7d1 remove useless code
2 years ago
YangZhou 4841f94298
Merge pull request #2421 from THUzyt21/Deploy-fast-text-model-for-cli
2 years ago
Hui Zhang b20bf7d5de masked_fill by multiply, remove while
2 years ago
Zhao Yuting d2da7f50d2
Update text_engine.py
2 years ago
Zhao Yuting 82f731c153
Update application.yaml
2 years ago
Hui Zhang feb27e2a84 fuse linear kv
2 years ago
Hui Zhang 3adb20b468 eliminate shape and slice
2 years ago
Hui Zhang 46088c0a16 elimiate attn transpose
2 years ago
Hui Zhang f9e3eaa024 transpose in matmul
2 years ago
Hui Zhang 3d7ca93861 bool type slice
2 years ago
Hui Zhang c2c8a662b1 refactor reshape
2 years ago
Hui Zhang 6de81d74d9 elimiete cast dtype for bool op
2 years ago
Hui Zhang 8e7a315e00 remove comment
2 years ago
Hui Zhang c4a5ae3825 eliminate mul
2 years ago
Hui Zhang b7388ce25a eliminate useless unsqueese
2 years ago
Hui Zhang 1a1ce92cb4
Merge pull request #2415 from Zth9730/u2++_decoder
2 years ago
TianYuan 52af86fcc3
fix ERNIE-SAT bug when edit the end of sentenses, test=doc (#2432)
2 years ago
tianhao zhang d3e5937591 support bitransformer decoder
2 years ago
Hui Zhang 7382050e21 fix bug on win
2 years ago
TianYuan b14da765e8
frm random spk embedding in voice cloning, test=doc (#2429)
2 years ago
Hui Zhang d25871a7b0 format
2 years ago
Hui Zhang b10512eb0e more config or u2pp
2 years ago
Hui Zhang 00b2c1c8fb fix forward attention decoder caller
2 years ago
zhoupc2015 2ae0f66d0d
Solve "unknown format: 3" (#2422)
2 years ago
Hui Zhang 309c8d70d9 add reverse weight
2 years ago
Hui Zhang 9b66680ea4 Merge branch 'u2++_decoder' into u2pp_export
2 years ago
tianhao zhang 027535dec1 support bitransformer decoder, test=asr
2 years ago
THUzyt21 bdbacd4249 precomited
2 years ago
Zhao Yuting d5dec46336
Update README.md
2 years ago
Zhao Yuting 18b71dc136
Update README.md
2 years ago
tianhao zhang 0a95689461 support bitransformer decoder
2 years ago
tianhao zhang 455379b88e support bitransformer decoder
2 years ago
Zhao Yuting a63a0b1350
Update pretrained_models.py
2 years ago
Zhao Yuting 12a11394bd
Update infer.py
2 years ago
Zhao Yuting fb7f04e021
Update README.md
2 years ago
Zhao Yuting 92d09d5cce
Update README_cn.md
2 years ago
Zhao Yuting 57dcd0d17f
Update infer.py
2 years ago
Zhao Yuting b627666ce9
Update model_alias.py
2 years ago
Zhao Yuting a02654660a
Update pretrained_models.py
2 years ago
tianhao zhang ecbf324286 support bitransformer decoder, test=asr
2 years ago
tianhao zhang 1a56a6e42b add bitransformer decoder, test=asr
2 years ago
Hui Zhang 53d6baff0b format
2 years ago
Hui Zhang 549d477592 fix code style
2 years ago
Hui Zhang 4d5cfd4003 export param from cnofig
2 years ago
Hui Zhang e3298c79ce Merge branch 'develop' into u2_export
2 years ago
Hui Zhang 260752aa2a using forward_attention_decoder
2 years ago
TianYuan 5e714ecb4a
[doc]update api docs (#2406)
2 years ago
TianYuan eac362057c
add typehint for g2pw (#2390)
2 years ago
Hui Zhang 0d7d87120b simplify feature pipeline graph
2 years ago
WongLaw 324b166c52
Removed useless spk_id in speech_server and streaming_tts_server from demos, and support bilingual server engine, test=tts (#2380)
2 years ago
TianYuan 80b180217d
[TTS] fix some bugs of ERNIE-SAT (#2378)
2 years ago
Hui Zhang 8690a00bd8 add feature pipeline layer(cmvn, fbank), but to_static and jit.layer output is not equal
2 years ago
Hui Zhang 07f566e0a5
Merge pull request #2372 from Zth9730/fix_dp_init
2 years ago
Hui Zhang 3a8869fba4 rm to_static decarator; configure jit save for ctc_activation
2 years ago
Hui Zhang 1c9f238ba0 configurable export
2 years ago
Hui Zhang 63aeb747b0 more comment
2 years ago
Hui Zhang a7c6c54e75 fix
2 years ago
Hui Zhang d638325c46 do not jit save forward; using slice for zeros([0,0,0,0]) tensor
2 years ago
tianhao zhang 663e3ab58e fix dp init
2 years ago
tianhao zhang 6745e9dd6b fix dp init
2 years ago
tianhao zhang 598eb1a5ef Merge branch 'develop' into fix_dp_init
2 years ago
WongLaw 989b755e8e
Revised must_neural_tone_words, test=doc. (#2370)
2 years ago
tianhao zhang 9560d650db fix dp init
2 years ago
TianYuan 7e4f3b029c
Merge pull request #2359 from yt605155624/add_vc2
2 years ago
tianhao zhang 82e04d7815 fix trianer
2 years ago
TianYuan f7873773bf
uadd __init__.py for VITS, test=tts (#2362)
2 years ago
TianYuan 35c6ffa90b Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_vc2
2 years ago
TianYuan e622f42d92 add aishell3 voice cloning with ECAPA-TDNN spk encoder
2 years ago
TianYuan 1c30cff1bf
fix gpus of ernie_sat, test=tts (#2355)
2 years ago
Hui Zhang 2bb40c41ba
Merge pull request #2351 from Zth9730/fix_deepspeech
2 years ago
tianhao zhang ab92e2c98c fix deepspeech2 decode_wav
2 years ago
艾梦 ea9ee93739
[TTS]Update VITS to support VITS and its voice cloning training on AIShell-3 (#2268)
2 years ago
TianYuan 795eb7bd10
format paddlespeech with pre-commit (#2331)
2 years ago
TianYuan 5d5888af8e
fix tone, update readme (#2335)
2 years ago
贾晓 0b544ee84e
Merge pull request #2336 from Zth9730/fix_multigpu_train
2 years ago
tianhao zhang cdcb1a5316 s2t: fix encoder.py
2 years ago
tianhao zhang ed2819d7af fix format test=asr
2 years ago
Hui Zhang 58ab7e8d10
Merge pull request #2334 from Zth9730/fix_multigpu_train
2 years ago
tianhao zhang 1dfca4ef73 fix multigpu training
2 years ago
Hui Zhang 94e750c4c4
Merge pull request #2327 from Zth9730/fix_multigpu_train
2 years ago
tianhao zhang ed80b0e2c3 fix multigpu training test=asr
2 years ago
tianhao zhang 733ec7f2bc fix conformer multi-gpu training test=asr
2 years ago
David An (An Hongliang) f5367f5efb
[TTS]fix bug of tone modify (#2323)
2 years ago
Zhao Yuting c28064fec2
Update asr_engine.py (#2302)
2 years ago
TianYuan 7b864e8f38
clean old ernie sat inference scripts (#2316)
2 years ago
David An (An Hongliang) c7163abffa
add thanks into readme, append data for chinese unit (#2312)
2 years ago
彭震东 c9de22eaa8
[TN] Update quantifiers (#2308)
2 years ago
TianYuan d1c70a7809
fix g2pw model (#2304)
2 years ago
liangym 043b21d3b4
fix mix frontend, test=tts (#2299)
2 years ago
David An (An Hongliang) 25b96405df
add chinese words correct phonic,test=tts (#2300)
2 years ago
TianYuan c1d4551055
add ernie sat synthesize_e2e, test=tts (#2287)
2 years ago
李子 5a58a27492
[TTS]指定G2PW的传入数据类型 , test=tts (#2288)
2 years ago
TianYuan 3f9339edff
Update polyphonic.yaml
2 years ago
TianYuan f9a6970a62
Merge pull request #2263 from oyjxer/pc
2 years ago
lym0302 677e0961a8 fix point bug, test=tts
2 years ago
TianYuan 4a59702d60
Merge pull request #2255 from lym0302/develop
2 years ago
TianYuan 0baec4325a fix stats bugs
2 years ago
TianYuan f7780658db fix tone sand_hi bugs for Chinese frontend
2 years ago
pangchao04 b9be2bd64a add ernie-sat sampler
2 years ago
lym0302 f8f73e41f0 fix point bug, test=tts
2 years ago
TianYuan 5de2c2dab5 format g2pw
2 years ago
TianYuan 5d515f3f3f update mix tts
2 years ago
TianYuan a75b2a5bab
Merge pull request #2230 from BarryKCL/develop_g2pW
2 years ago
TianYuan db89cfe829
Merge pull request #2234 from lym0302/mix_example
2 years ago
TianYuan 8dbefc0165 fix preprocess bug, add hifigan_csmsc decoder, update readme
2 years ago
BarryKCL a84b40ef79 update g2pW dict
2 years ago
Zhao Yuting d02e04d532
Update audio_handler.py
2 years ago
BarryKCL 6593c24968 set window_size None
2 years ago
BarryKCL 5e63ac1e60 Fix a bug in g2pW
2 years ago
TianYuan 0eb598b876
Merge pull request #2235 from david-95/hongliang-dev
2 years ago
david.95 0df7fc8fbf remove comment
2 years ago
david.95 7ba74f175f remove comment
2 years ago
david.95 f52a87b8d0 remove useless fix, test=tts
2 years ago
david.95 a48e4f249f add filter for double punctuation, revise comment ;
2 years ago
BarryKCL aecf8fd384 add onnxruntime sess_options
2 years ago
lym0302 368e3e1b59 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into mix_example
2 years ago
lym0302 894556f871 add zh_en mix example, test=tts
2 years ago
david.95 1edd474bcb add filter for double punctuation in sentences; add homonym, test=tts
2 years ago
BarryKCL 61dd92e49c update
2 years ago
BarryKCL de0f99150a change G2PWModel download
2 years ago
BarryKCL 744ea44279 add comment
2 years ago
BarryKCL 7b0f2a796d change transformers to paddlenlp.transformers
2 years ago
BarryKCL e60a63fbdd Rollback "get_input_ids"
2 years ago
BarryKCL ab2a1219c8 Add g2pW to Chinese frontend
2 years ago
TianYuan 2f9bdf2306
Merge pull request #2222 from yt605155624/add_onnx_cli
2 years ago
TianYuan c3d47441cf fix fs bug in inference.py (change fixed 24000 to variable for ljspeech)
2 years ago
TianYuan 8da993bbf8 fix fs bug
2 years ago
TianYuan 788a3062d0 fix onnx am_ckpt from list to item in prtrained_mdoels.py
2 years ago
TianYuan c6b25c05f4 change logger.debug to logger.info for streaming asr
2 years ago
Hui Zhang c1fbfe928e add test
2 years ago
TianYuan cd662a08e0 fix for load specified model files
2 years ago
TianYuan b9ade18055 add onnxruntime infer for cli
2 years ago
Hui Zhang 05bc258833 update docstring
2 years ago
Hui Zhang 6149daa221 export ctc_activation
2 years ago
huangyuxin 923b0b873e fix import kws.exps.mdtc
2 years ago
huangyuxin 060e337623 fix dataloader factory, test=asr
2 years ago