HuangLiangJie
acfa057dc7
[TTS]Cantonese FastSpeech2 Training, test=tts ( #2907 )
2 years ago
zxcd
047092de8e
add wav2vev2_zh aishell recipe, and speechbrain dataloader. ( #2916 )
2 years ago
艾梦
bcd8e309ec
[TTS]Add diffusion noise clip to optimize sample result ( #2902 )
...
* add diffusion module for training diffsinger
* add wavenet denoiser final conv initializer
* add diffusion noise clip to optimize sample result
2 years ago
zxcd
6728db5b59
[ASR]Whisper remove audio duration limit, test=asr ( #2900 )
2 years ago
zxcd
f6b624ddc8
add encoding=utf8 for text cli. ( #2896 )
2 years ago
章宏彬
c764710aa1
[TTS]Avoid using variable "attn_loss" before assignment ( #2860 )
...
* Avoid using variable "attn_loss" before assignment
* Update tacotron2_updater.py
---------
Co-authored-by: TianYuan <white-sky@qq.com>
2 years ago
TianYuan
a283f8a57e
[TTS]fix open encoding ( #2865 )
2 years ago
艾梦
a55fd2e556
[TTS]Fix diffusion wavenet denoiser final conv init param ( #2868 )
...
* add diffusion module for training diffsinger
* add wavenet denoiser final conv initializer
2 years ago
QuanZ9
ac3ed3c5a8
Update zh_frontend.py ( #2863 )
2 years ago
zxcd
64aeb6dccc
remove some fluid api (elementwise_div elementwise_mul sqrt reduce_sum). ( #2859 )
2 years ago
zxcd
31c2c226ca
clean fluid elementwise_max and square api. ( #2852 )
2 years ago
HuangLiangJie
140aed4b54
[TTS]VITS init sampler reverse, test=tts ( #2843 )
2 years ago
艾梦
57b9d4bca4
add diffusion module for training diffsinger ( #2832 )
2 years ago
TianYuan
1fd38c0e8b
fix o ( #2831 )
2 years ago
晋东毅
742523fb38
[tts]For mixed Chinese and English speech synthesis, add SSML support for Chinese ( #2830 )
...
* 添加.history
* [tts]添加中英混合语音合成时对中文SSML的支持
2 years ago
cxumol
a99244d86e
fix: whisper language choice, test=asr ( #2828 )
2 years ago
zxcd
ad40dafa85
fix some bug. ( #2825 )
2 years ago
HuangLiangJie
faa2f86651
[TTS]update VITS init method ( #2809 )
2 years ago
zxcd
88fe26f17c
[ASR] add asr code-switch cli and demo, test='asr' ( #2816 )
...
* add asr code-switch cli and demo.
* fix some model named problem.
2 years ago
HuangLiangJie
964211a81b
Change optimizer for vits, test=tts ( #2791 )
2 years ago
liangym
96d76c83ad
multi-spk tts static model ( #2779 )
...
* updata readme, test=doc
* update yaml and readme, test=tts
* fix batch_size, test=tts
* update readme, test=doc
* chmod, test=tts
* add multi-spk tts static model infer on server, test=tts
2 years ago
HuangLiangJie
2e51e0da90
[TTS]Fix attention bugs and sort VITS data with feats_lengths ( #2770 )
2 years ago
TianYuan
6725bcd823
revise paddlenlp's version ( #2767 )
2 years ago
TianYuan
979bbd9dcb
add mkldnn and trt config for paddleInference ( #2748 )
2 years ago
zxcd
a8a240d4ef
remove paddle.fluid ( #2740 )
2 years ago
YangZhou
12fa8a2d19
[audio]patch:fix tensor_utils error ( #2738 )
...
* fix tensor utils
2 years ago
TianYuan
3f6afc4834
[TTS]Add slim for TTS ( #2729 )
2 years ago
YangZhou
42ff946007
[audio] mv paddlespeech/audio to paddleaudio ( #2706 )
...
* split paddlespeech/audio to paddleaudio.
* add sox io ,sox effect, kaldi native fbank to paddleaudio.
2 years ago
HuangLiangJie
a874d8f325
Add prosody prediction in synthesize_e2e, test=tts ( #2693 )
2 years ago
TianYuan
62357d876c
[TTS]rm paddlelite in setup.py ( #2713 )
...
* rm paddlelite in setup.py
* fix setup.py
2 years ago
Zth9730
c67bf7b4ef
[ASR] support wav2vec2-zh cli, test=asr ( #2697 )
...
* support wav2vec2-zh cli, test=asr
* support wav2vec2-zh cli, test=asr
* support wav2vec2-zh cli, test=asr
* support wav2vec2-zh cli, test=asr
2 years ago
David An (An Hongliang)
bd01bc155d
add greek char and fix issue2571 ( #2683 )
...
Co-authored-by: TianYuan <white-sky@qq.com>
2 years ago
zxcd
4542684694
[ASR] fix Whisper cli model download path error. test=asr ( #2679 )
...
* add all whisper model size support
* add choices in parser.
* fix Whisper cli model download path error.
* fix resource download path.
* fix code style
2 years ago
Zth9730
fc02cd0540
[doc] update wav2vec2 demos README.md, test=doc ( #2674 )
...
* fix wav2vec2 demos, test=doc
* fix wav2vec2 demos, test=doc
* fix enc_dropout and nor.py, test=asr
2 years ago
zxcd
b71f1428c7
add all whisper model size support, test=asr ( #2677 )
...
* add all whisper model size support
* add choices in parser.
2 years ago
TianYuan
0b4cf2211d
[TTS]Add TTS Paddle-Lite x86 inference ( #2667 )
...
* Add export2lite, test=tts
* add tts paddlelite x86 inference, test=tts
* update released_model.md, test=tts
* add paddlelite in setup.py
* update
2 years ago
David An (An Hongliang)
1c3d2cb89e
add double byte char for zh normalization ( #2661 )
2 years ago
Zth9730
94a487bd81
[ASR] support wav2vec2 command line and demo ( #2658 )
...
* wav2vec2_cli
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* Update RESULTS.md
* Update RESULTS.md
* Update base_commands.py
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
2 years ago
zxcd
b1d3f59bcb
[s2t] add whisper asr large model ( #2640 )
...
* add whisper asr large model decoding, test=asr
* fix code style.
* fix json code style.
* remove resource and fix code style.
* fix yapf
* add cli and demos, fix some code style.
* fix some problem by comment.
* fix yapf
2 years ago
kFoodie
dc9d3baf51
Update onnx_api.py ( #2664 )
2 years ago
liangym
25b6bf9668
[tts] Add male voice for tts ( #2660 )
2 years ago
Zth9730
8d3494320d
[ASR] wav2vec2_en, test=asr ( #2637 )
...
* wav2vec2_en, test=asr
* wav2vec2_en, test=asr
* wav2vec2_en, test=asr
2 years ago
HuangLiangJie
b7312e9f0b
Revised TN qualifier for measure notation, test=tts ( #2629 )
2 years ago
Zth9730
e6d20888c5
支持0维Tensor需要的修改 ( #2621 )
2 years ago
David An (An Hongliang)
8a5fe83e1d
add ssml sentences.txt ( #2620 )
2 years ago
Hui Zhang
2c34481ea0
[s2t] quant with wav scp ( #2568 )
...
* add quant hint
* add paddleslim
* using paddleslim 2.3.4 and paddle 2.4
2 years ago
Zth9730
8d3464c050
[s2t] Update wav2vec2 license ( #2600 )
2 years ago
liangym
e18170228c
[tts] add adversarial loss ( #2588 )
2 years ago
TianYuan
9aab706cba
fix frontend bug, test=tts ( #2606 )
2 years ago
WongLaw
e348aa825d
Added Rhythm Prediction, test=tts
2 years ago
WongLaw
b96fb1d57e
Added Rythm Prediction, test=tts
2 years ago
WongLaw
d27364d141
Added Text Rhythm Prediction, test=tts
2 years ago
HuangLiangJie
872be9c8ce
Merge branch 'PaddlePaddle:develop' into rhy
2 years ago
YangZhou
bbf2401e3e
Merge pull request #2524 from zh794390558/u2
...
[speechx] add u2/u2pp asr inference
2 years ago
WongLaw
72bbabbf79
Revised structure of rhythm prediction, test=tts
2 years ago
david.95
ed0138c6e3
add condition check if a ssml input and filter space line, test=tts
2 years ago
David An (An Hongliang)
21cce0e0bb
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
TianYuan
63c80121e2
fix uvicorn's bug
2 years ago
TianYuan
2a60c3d854
Merge pull request #2554 from dahu1/develop
...
标点恢复代码更新,test=asr
2 years ago
david.95
3ac7ac253f
fix review issue,test=tts
2 years ago
David An (An Hongliang)
0476e645aa
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
Zth9730
68134c8436
fix u2pp model ( #2549 )
2 years ago
dahu1
cb76e66401
1.token配置不写死,2.text显示不乱码, test=asr
2 years ago
Hui Zhang
eac545e1db
Merge pull request #2544 from Zth9730/fix_attention
...
[s2t] fix attention eval bug, do not compose kv in infer
2 years ago
tianhao zhang
1ea828c30e
fix attention val bug
2 years ago
David An (An Hongliang)
103e46f819
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
TianYuan
2d71577e75
fix g2p ( #2539 )
2 years ago
david.95
f295d2d445
remove useless code
2 years ago
david.95
89e9ea69eb
modify __init__
2 years ago
david.95
1067088deb
modify __init__
2 years ago
david.95
f56cc08b18
add license content, test=tts
2 years ago
david.95
29508f400b
to fix CI issue, test=tts
2 years ago
david.95
60801d8f14
Merge branch 'hongliang1014' of https://github.com/david-95/PaddleSpeech into hongliang1014
2 years ago
David An (An Hongliang)
ce21f9bc41
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
david.95
278c7a41a8
add module define to fix ci, test=tts
2 years ago
Hui Zhang
964c22c677
Merge pull request #2532 from Zth9730/wav2vec2.0
...
[s2t] fix wav2vec2 report loss bug
2 years ago
tianhao zhang
86f65f0b8e
fix wav2vec2 report loss bug
2 years ago
david.95
13a7fa9808
enable chinese words' pinyin specified in text of ssml formats, test=tts
2 years ago
Hui Zhang
f1ca564731
Merge pull request #2518 from Zth9730/wav2vec2.0
...
[ASR] wav2vec2 ASR, pre-trained wav2vec2 based CTC for librispeech
2 years ago
tianhao zhang
2ae94bd277
freeze wav2vec2=True, change loss report and update README.md
2 years ago
tianhao zhang
3d994f5c23
format wav2vec2 demo
2 years ago
Hui Zhang
c6f9764ed6
Merge pull request #2510 from Zth9730/u2pp_jit_export
...
[s2t] use reverse_weight in decode.yaml
2 years ago
tianhao zhang
19180d359d
format wav2vec2 demo
2 years ago
tianhao zhang
6e429f0513
support wav2vec2ASR on librispeech
2 years ago
Hui Zhang
290c23b9d7
add u2 nnet, u2 nnet main, codelab, and can compile
2 years ago
tianhao zhang
e367242765
update dependency of paddle
2 years ago
tianhao zhang
5a66a14659
fix u2pp model version number
2 years ago
tianhao zhang
cda440e6f0
use reverse_weight in decode.yaml
2 years ago
Zth9730
c9b0c96b7b
Merge pull request #2502 from zh794390558/u2pp_export
...
[s2t] streaming conformer u2 and u2pp jit export
2 years ago
Hui Zhang
c98b5dd173
fix masked_fill which will nan in trainning
2 years ago
Hui Zhang
9277fcb8a8
fix attn can not train
2 years ago
Hui Zhang
1f4f98b171
fix bug
2 years ago
liangym
0359c3f6b5
Fix mix front ( #2493 )
...
* update mix frontend, test=tts
2 years ago
Hui Zhang
e86337a423
fix bug
2 years ago
Hui Zhang
925abcca23
format
2 years ago
Hui Zhang
2a75405e9a
Merge branch 'develop' into u2pp_export
2 years ago
Hui Zhang
3ed24474d2
wenetspeech asr1 quant
2 years ago
Hui Zhang
467cfd4e75
Merge pull request #2489 from Zth9730/u2++_server
...
[ASR] support u2pp based cli and server, optimiz code of u2pp decode (reversed_weight parameter)
2 years ago
tianhao zhang
5b5167b586
support u2pp cli and server, optimiz code of u2pp decode, test=asr
2 years ago
YangZhou
3507829a6d
Merge pull request #2464 from THUzyt21/Deploy-text-model-in-server
...
[Server]Deploy text model in server
2 years ago
ZapBird
7a13b35fe6
BytesIO类型时,要保证切到初始位置,这样多次读取才能够正常。比如__call__函数。 ( #2484 )
...
* BytesIO类型时,要保证切到初始位置,这样多次读取才能够正常。比如__call__函数。
__call__函数的参数audio_file为BytesIO类型时执行到self.preprocess(model, audio_file)会报错,需要判断audio_file为BytesIO类型时执行audio_file.seek(0)。
2 years ago
tianhao zhang
5bbe6e9897
support u2pp cli and server, optimiz code of u2pp decode, test=asr
2 years ago
Hui Zhang
bdf876ea7b
Merge branch 'develop' into u2pp_export
2 years ago
Zhao Yuting
304dc2603c
Update text_engine.py
2 years ago
Zhao Yuting
8c945c073d
Update application.yaml
2 years ago
Zhao Yuting
b9693a0e8e
Update text_engine.py
2 years ago
Zhao Yuting
8ecf6796f3
Update text_engine.py
2 years ago
Hui Zhang
afda7ed7d1
remove useless code
2 years ago
YangZhou
4841f94298
Merge pull request #2421 from THUzyt21/Deploy-fast-text-model-for-cli
...
[CLI]Deploy fast text model for cli
2 years ago
Hui Zhang
b20bf7d5de
masked_fill by multiply, remove while
2 years ago
Zhao Yuting
d2da7f50d2
Update text_engine.py
...
precommihted already
2 years ago
Zhao Yuting
82f731c153
Update application.yaml
...
change model
2 years ago
Hui Zhang
feb27e2a84
fuse linear kv
2 years ago
Hui Zhang
3adb20b468
eliminate shape and slice
2 years ago
Hui Zhang
46088c0a16
elimiate attn transpose
2 years ago
Hui Zhang
f9e3eaa024
transpose in matmul
2 years ago
Hui Zhang
3d7ca93861
bool type slice
2 years ago
Hui Zhang
c2c8a662b1
refactor reshape
2 years ago
Hui Zhang
6de81d74d9
elimiete cast dtype for bool op
2 years ago
Hui Zhang
8e7a315e00
remove comment
2 years ago
Hui Zhang
c4a5ae3825
eliminate mul
2 years ago
Hui Zhang
b7388ce25a
eliminate useless unsqueese
2 years ago
Hui Zhang
1a1ce92cb4
Merge pull request #2415 from Zth9730/u2++_decoder
...
[s2t] support bitransformer decoder
2 years ago
TianYuan
52af86fcc3
fix ERNIE-SAT bug when edit the end of sentenses, test=doc ( #2432 )
2 years ago
tianhao zhang
d3e5937591
support bitransformer decoder
2 years ago
Hui Zhang
7382050e21
fix bug on win
2 years ago
TianYuan
b14da765e8
frm random spk embedding in voice cloning, test=doc ( #2429 )
2 years ago
Hui Zhang
d25871a7b0
format
2 years ago
Hui Zhang
b10512eb0e
more config or u2pp
2 years ago
Hui Zhang
00b2c1c8fb
fix forward attention decoder caller
2 years ago
zhoupc2015
2ae0f66d0d
Solve "unknown format: 3" ( #2422 )
...
* Solve execute the following code with return wav:
iob = io.BytesIO(wav)
wave.open(iob, 'rb')
will throw an "unknown format: 3" exception
2 years ago
Hui Zhang
309c8d70d9
add reverse weight
2 years ago
Hui Zhang
9b66680ea4
Merge branch 'u2++_decoder' into u2pp_export
2 years ago
tianhao zhang
027535dec1
support bitransformer decoder, test=asr
2 years ago
THUzyt21
bdbacd4249
precomited
2 years ago
Zhao Yuting
d5dec46336
Update README.md
2 years ago
Zhao Yuting
18b71dc136
Update README.md
2 years ago
tianhao zhang
0a95689461
support bitransformer decoder
2 years ago
tianhao zhang
455379b88e
support bitransformer decoder
2 years ago
Zhao Yuting
a63a0b1350
Update pretrained_models.py
2 years ago
Zhao Yuting
12a11394bd
Update infer.py
...
add a new faster model to infer in cli
2 years ago
Zhao Yuting
fb7f04e021
Update README.md
2 years ago
Zhao Yuting
92d09d5cce
Update README_cn.md
2 years ago
Zhao Yuting
57dcd0d17f
Update infer.py
...
change the infer in order to implement the new faster model for text
2 years ago
Zhao Yuting
b627666ce9
Update model_alias.py
...
Add a new model for faster text process in cli
2 years ago
Zhao Yuting
a02654660a
Update pretrained_models.py
...
Add a new model for faster text process
2 years ago
tianhao zhang
ecbf324286
support bitransformer decoder, test=asr
2 years ago
tianhao zhang
1a56a6e42b
add bitransformer decoder, test=asr
2 years ago
Hui Zhang
53d6baff0b
format
2 years ago
Hui Zhang
549d477592
fix code style
2 years ago
Hui Zhang
4d5cfd4003
export param from cnofig
2 years ago
Hui Zhang
e3298c79ce
Merge branch 'develop' into u2_export
2 years ago
Hui Zhang
260752aa2a
using forward_attention_decoder
2 years ago
TianYuan
5e714ecb4a
[doc]update api docs ( #2406 )
...
* update apt docs, test=doc
2 years ago
TianYuan
eac362057c
add typehint for g2pw ( #2390 )
2 years ago
Hui Zhang
0d7d87120b
simplify feature pipeline graph
2 years ago
WongLaw
324b166c52
Removed useless spk_id in speech_server and streaming_tts_server from demos, and support bilingual server engine, test=tts ( #2380 )
...
* Removed useless spk_id in speech_server and streaming_tts_server from demos, and support bilingual server engine.
2 years ago
TianYuan
80b180217d
[TTS] fix some bugs of ERNIE-SAT ( #2378 )
...
* fix ernie_sat, test=tts
* fix for comments, test=tts
2 years ago
Hui Zhang
8690a00bd8
add feature pipeline layer(cmvn, fbank), but to_static and jit.layer output is not equal
2 years ago
Hui Zhang
07f566e0a5
Merge pull request #2372 from Zth9730/fix_dp_init
...
[s2t] DataParallel init method changed, fixed conformer could not multi-gpu training and don't affect dy2st
2 years ago
Hui Zhang
3a8869fba4
rm to_static decarator; configure jit save for ctc_activation
2 years ago
Hui Zhang
1c9f238ba0
configurable export
2 years ago
Hui Zhang
63aeb747b0
more comment
2 years ago
Hui Zhang
a7c6c54e75
fix
2 years ago
Hui Zhang
d638325c46
do not jit save forward; using slice for zeros([0,0,0,0]) tensor
2 years ago
tianhao zhang
663e3ab58e
fix dp init
2 years ago
tianhao zhang
6745e9dd6b
fix dp init
2 years ago
tianhao zhang
598eb1a5ef
Merge branch 'develop' into fix_dp_init
2 years ago
WongLaw
989b755e8e
Revised must_neural_tone_words, test=doc. ( #2370 )
...
* Revised must_neural_tone_words.
2 years ago
tianhao zhang
9560d650db
fix dp init
2 years ago
TianYuan
7e4f3b029c
Merge pull request #2359 from yt605155624/add_vc2
...
[TTS]add aishell3 voice cloning with ECAPA-TDNN spk encoder
2 years ago
tianhao zhang
82e04d7815
fix trianer
2 years ago
TianYuan
f7873773bf
uadd __init__.py for VITS, test=tts ( #2362 )
2 years ago
TianYuan
35c6ffa90b
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_vc2
2 years ago
TianYuan
e622f42d92
add aishell3 voice cloning with ECAPA-TDNN spk encoder
2 years ago
TianYuan
1c30cff1bf
fix gpus of ernie_sat, test=tts ( #2355 )
2 years ago
Hui Zhang
2bb40c41ba
Merge pull request #2351 from Zth9730/fix_deepspeech
...
[s2t] fix deepspeech2 decode_wav
2 years ago
tianhao zhang
ab92e2c98c
fix deepspeech2 decode_wav
2 years ago
艾梦
ea9ee93739
[TTS]Update VITS to support VITS and its voice cloning training on AIShell-3 ( #2268 )
...
* code for training vits voice clone on aishell3.
Co-authored-by: TianYuan <white-sky@qq.com>
2 years ago
TianYuan
795eb7bd10
format paddlespeech with pre-commit ( #2331 )
2 years ago
TianYuan
5d5888af8e
fix tone, update readme ( #2335 )
2 years ago
贾晓
0b544ee84e
Merge pull request #2336 from Zth9730/fix_multigpu_train
...
[s2t] fix format test=asr
2 years ago
tianhao zhang
cdcb1a5316
s2t: fix encoder.py
2 years ago
tianhao zhang
ed2819d7af
fix format test=asr
2 years ago
Hui Zhang
58ab7e8d10
Merge pull request #2334 from Zth9730/fix_multigpu_train
...
[s2t] fix asr_engine.py
2 years ago
tianhao zhang
1dfca4ef73
fix multigpu training
2 years ago
Hui Zhang
94e750c4c4
Merge pull request #2327 from Zth9730/fix_multigpu_train
...
[s2t] fix conformer/transformer multi-gpu training, maybe impact dy2st
2 years ago
tianhao zhang
ed80b0e2c3
fix multigpu training test=asr
2 years ago
tianhao zhang
733ec7f2bc
fix conformer multi-gpu training test=asr
2 years ago
David An (An Hongliang)
f5367f5efb
[TTS]fix bug of tone modify ( #2323 )
...
* add special tone modifed case
Co-authored-by: TianYuan <white-sky@qq.com>
2 years ago
Zhao Yuting
c28064fec2
Update asr_engine.py ( #2302 )
...
* Update asr_engine.py
* Update asr_engine.py
* Update application.yaml
must add parameter "num_decoding_left_chunks" so as to modify this in other scenarios.
* Update asr_engine.py
* Update application.yaml
* Update application.yaml
* Update asr_engine.py
2 years ago
TianYuan
7b864e8f38
clean old ernie sat inference scripts ( #2316 )
2 years ago
David An (An Hongliang)
c7163abffa
add thanks into readme, append data for chinese unit ( #2312 )
...
* add chinese words correct phonic,test=tts
* added thanks into readme. add data of unit, test=tts
* added thanks into readme. add data of unit, test=tts
* modify data of unit, test=tts
* modify thanks, test=tts
2 years ago
彭震东
c9de22eaa8
[TN] Update quantifiers ( #2308 )
2 years ago
TianYuan
d1c70a7809
fix g2pw model ( #2304 )
2 years ago
liangym
043b21d3b4
fix mix frontend, test=tts ( #2299 )
2 years ago
David An (An Hongliang)
25b96405df
add chinese words correct phonic,test=tts ( #2300 )
2 years ago
TianYuan
c1d4551055
add ernie sat synthesize_e2e, test=tts ( #2287 )
2 years ago
李子
5a58a27492
[TTS]指定G2PW的传入数据类型 , test=tts ( #2288 )
...
* fix ONNXRuntimeError Specify data type (int64),test=tts
* Tactron2→Tacotron2 ,test=doc
2 years ago
TianYuan
3f9339edff
Update polyphonic.yaml
2 years ago