zxcd
9d8660b2f6
add new aishell model for better CER.
1 year ago
WongLaw
305375c310
VITS learning rate revised, test=tts
1 year ago
WongLaw
fdeb9b88a7
VITS learning rate revised, test=tts
1 year ago
TianYuan
fc670339d1
[TTS]Fix losses of StarGAN v2 VC ( #3184 )
1 year ago
Hui Zhang
df3be4acae
[s2t] move s2t data preprocess into paddlespeech.dataset ( #3189 )
...
* move s2t data preprocess into paddlespeech.dataset
* avg model, compute wer, format rsl into paddlespeech.dataset
* fix format rsl
* fix avg ckpts
1 year ago
Shuangchi He
8c7859d3bc
Fix some typos. ( #3178 )
...
Signed-off-by: Yulv-git <yulvchi@qq.com>
1 year ago
Hui Zhang
35d874c532
[s2t] mv dataset into paddlespeech.dataset ( #3183 )
...
* mv dataset into paddlespeech.dataset
* add aidatatang
* fix import
1 year ago
WongLaw
47e31f46cb
VITS learning rate revised, test=tts
1 year ago
WongLaw
414de3747c
VITS learning rate revised, test=tts
1 year ago
TianYuan
3ad55a31e7
[TTS]StarGANv2 VC fix some trainer bugs, add add reset_parameters ( #3182 )
1 year ago
PiaoYang
5a0103b2ae
[BUG] Fix progress bar unit. ( #3177 )
1 year ago
ljhzxc
dc56c3a10e
[TTS] [黑客松]Add JETS ( #3109 )
1 year ago
TianYuan
bd0d69ca74
[TTS]add StarGANv2VC preprocess ( #3163 )
1 year ago
zxcd
a1e5f27003
mv scaler.unscale_ blow grad_clip.
1 year ago
zxcd
7399d560e7
fix scaler save and load.
1 year ago
zxcd
2f4414a5f8
fix scaler save
1 year ago
zxcd
fbd27aab41
add amp for U2 conformer.
1 year ago
TianYuan
c7d24ba42c
fix some preprocess bugs ( #3155 )
2 years ago
longRookie
df37798598
[TTS]【Hackathon + No.190】 + 模型复现:iSTFTNet ( #3006 )
...
* iSTFTNet implementation based on hifigan, not affect the function and execution of HIFIGAN
* modify the comment in iSTFT.yaml
* add the comments in hifigan
* iSTFTNet implementation based on hifigan, not affect the function and execution of HIFIGAN
* modify the comment in iSTFT.yaml
* add the comments in hifigan
* add iSTFTNet.md
* modify the format of iSTFTNet.md
* modify iSTFT.yaml and hifigan.py
* Format code using pre-commit
* modify hifigan.py,delete the unused self.istft_layer_id , move the self.output_conv behind else, change conv_post to output_conv
* update iSTFTNet_csmsc_ckpt.zip download link
* modify iSTFTNet.md
* modify hifigan.py and iSTFT.yaml
* modify iSTFTNet.md
2 years ago
TianYuan
72aa19c32c
[TTS]add starganv2 vc trainer ( #3143 )
...
* add starganv2 vc trainer
* fix StarGANv2VCUpdater and losses
* fix StarGANv2VCEvaluator
* add some typehint
2 years ago
TianYuan
54ef90fcec
[TTS]Fix VITS lite infer ( #3098 )
2 years ago
liangym
e83b491c34
rm unused dep, test=tts ( #3097 )
2 years ago
TianYuan
6894a2a77d
[TTS]fix elementwise_floordiv's fill_constant ( #3075 )
...
* fix elementwise_floordiv's fill_constant
* add float converter for min_value in attention
2 years ago
TianYuan
0a2e367ff4
[TTS]clean starganv2 vc model code and add docstring ( #2987 )
...
* clean code
* add docstring
2 years ago
liangym
880c172db7
[TTS] add svs frontend ( #3062 )
2 years ago
TianYuan
d5720e4e7b
fix input dtype of elementwise_mul op from bool to int64 ( #3054 )
2 years ago
夜雨飘零
31a4562ae8
[ASR]add squeezeformer model ( #2755 )
...
* add squeezeformer model
* change CodeStyle, test=asr
* change CodeStyle, test=asr
* fix subsample rate error, test=asr
* merge classes as required, test=asr
* change CodeStyle, test=asr
* fix missing code, test=asr
* split code to new file, test=asr
* remove rel_shift, test=asr
2 years ago
zxcd
9bf5471613
optional tokenizer and fix some doc. ( #3042 )
2 years ago
TianYuan
706a68bde9
fix dtype diff of last expand_v2 op of VITS ( #3041 )
2 years ago
liangym
348064de0d
[TTS] add opencpop HIFIGAN example ( #3038 )
...
* add opencpop voc, test=tts
* soft link
* add opencpop hifigan, test=tts
* update
2 years ago
zxcd
4e9bca177a
[ASR] change optimizer and fix import error, test=asr ( #3023 )
...
* mv dataio.py to s2t.io.speechbrain.dataio
mv dataio.py to paddlespeech.s2t.io.speechbrain.dataio
* remove transformers import.
* change optimizer same with released model
* add paddlenlp version in RESULT.md.
* fix run.sh
* fix data.sh step_num.
* add adadelta optimizer config.
* fix wav2vec2 test_wav.sh run error.
* add tokenizer config.
2 years ago
liangym
435fc5cc19
[TTS] add opencpop PWGAN example ( #3031 )
...
* add opencpop voc, test=tts
* soft link
2 years ago
TianYuan
271112ca69
fix vits reduce_sum's input/output dtype, test=tts ( #3028 )
2 years ago
liangym
1afd14acd9
[TTS]add Diffsinger with opencpop dataset ( #3005 )
2 years ago
MistEO
319c805968
[TTS] Support set device id for tts prediction, test=tts ( #3019 )
2 years ago
zxcd
3145325b4e
[ASR] add wav2vec2 aishell model result, test=asr ( #3012 )
...
* Create RESULT.md
* add wav2vec2ASR-large-aishell1 finetune model.
* update model link and add readme.
* fix released model info.
2 years ago
zxcd
5186319f48
fix load model schedule error, config optional. ( #3008 )
2 years ago
TianYuan
528ae58a67
[TTS]remove pad op in static model by replace F.pad with nn.Pad1D and nn.Pad2D ( #3002 )
...
* remove pad op in static model by replace F.pad with nn.Pad1D and nn.Pad2D
* fix variable names
* add note
2 years ago
JiehangXie
59cabdc967
[TTS]Cli Cantonese onnx, test=tts ( #2990 )
...
Co-authored-by: TianYuan <white-sky@qq.com>
2 years ago
mooncake
c02bc087f6
rearrange-encoder-infer-param ( #2983 )
2 years ago
TianYuan
f7fd111647
[TTS]add StarGANv2-VC model scripts ( #2842 )
2 years ago
HuangLiangJie
c8196d45ae
[TTS]Canton CLI, test=tts ( #2977 )
2 years ago
TianYuan
ad239eb444
[TTS]add VITS inference ( #2972 )
2 years ago
TianYuan
84f751f529
[TTS]vits dygraph to static ( #2883 )
...
Co-authored-by: 0x45f <wangzhen45@baidu.com>
2 years ago
HuangLiangJie
11bc392617
[TTS]Canton phonetic fix, test=tts ( #2950 )
2 years ago
TianYuan
c8d5a01bdb
[TTS]fix dygraph to static for tacotron2, test=doc ( #2426 )
...
* fix dygraph to static for tacotron2, test=doc
* Fix dy2st error for taco2
* Update attentions.py
---------
Co-authored-by: 0x45f <wangzhen45@baidu.com>
2 years ago
liangym
d9b041e999
[TTS]Cli male onnx ( #2945 )
2 years ago
zxcd
dcf8ef04e0
[ASR] Remove fluid api and useless import, test=asr ( #2944 )
...
* remove fluid api and useless import.
* fix variable name
2 years ago
JiehangXie
a5c0bffd2a
add Cantonese test examples ( #2937 )
2 years ago
zxcd
a8a353d0ac
[ASR] add python simple adadelta optimizer, test=asr ( #2925 )
...
* add simple adeadelta optimizer.
* remove useless log
* remove useless and fluid import.
* add framework.dygraph_only back
2 years ago
HuangLiangJie
1af9bd47d9
[TTS]Cantonese FastSpeech2 e2e infer, test=tts ( #2927 )
2 years ago
zxcd
004a4d6096
[ASR] rm transformers import and modify variable name consistent with infer.py, test=asr ( #2929 )
...
* rm transformers import and modify variable name consistent with infer.py
* add condition ctc_prefix_beam_search decode.
2 years ago
zxcd
17a7ebddfa
fix dist_sampler AttributeError ( #2918 )
2 years ago
HuangLiangJie
acfa057dc7
[TTS]Cantonese FastSpeech2 Training, test=tts ( #2907 )
2 years ago
zxcd
047092de8e
add wav2vev2_zh aishell recipe, and speechbrain dataloader. ( #2916 )
2 years ago
艾梦
bcd8e309ec
[TTS]Add diffusion noise clip to optimize sample result ( #2902 )
...
* add diffusion module for training diffsinger
* add wavenet denoiser final conv initializer
* add diffusion noise clip to optimize sample result
2 years ago
zxcd
6728db5b59
[ASR]Whisper remove audio duration limit, test=asr ( #2900 )
2 years ago
zxcd
f6b624ddc8
add encoding=utf8 for text cli. ( #2896 )
2 years ago
章宏彬
c764710aa1
[TTS]Avoid using variable "attn_loss" before assignment ( #2860 )
...
* Avoid using variable "attn_loss" before assignment
* Update tacotron2_updater.py
---------
Co-authored-by: TianYuan <white-sky@qq.com>
2 years ago
TianYuan
a283f8a57e
[TTS]fix open encoding ( #2865 )
2 years ago
艾梦
a55fd2e556
[TTS]Fix diffusion wavenet denoiser final conv init param ( #2868 )
...
* add diffusion module for training diffsinger
* add wavenet denoiser final conv initializer
2 years ago
QuanZ9
ac3ed3c5a8
Update zh_frontend.py ( #2863 )
2 years ago
zxcd
64aeb6dccc
remove some fluid api (elementwise_div elementwise_mul sqrt reduce_sum). ( #2859 )
2 years ago
zxcd
31c2c226ca
clean fluid elementwise_max and square api. ( #2852 )
2 years ago
HuangLiangJie
140aed4b54
[TTS]VITS init sampler reverse, test=tts ( #2843 )
2 years ago
艾梦
57b9d4bca4
add diffusion module for training diffsinger ( #2832 )
2 years ago
TianYuan
1fd38c0e8b
fix o ( #2831 )
2 years ago
晋东毅
742523fb38
[tts]For mixed Chinese and English speech synthesis, add SSML support for Chinese ( #2830 )
...
* 添加.history
* [tts]添加中英混合语音合成时对中文SSML的支持
2 years ago
cxumol
a99244d86e
fix: whisper language choice, test=asr ( #2828 )
2 years ago
zxcd
ad40dafa85
fix some bug. ( #2825 )
2 years ago
HuangLiangJie
faa2f86651
[TTS]update VITS init method ( #2809 )
2 years ago
zxcd
88fe26f17c
[ASR] add asr code-switch cli and demo, test='asr' ( #2816 )
...
* add asr code-switch cli and demo.
* fix some model named problem.
2 years ago
HuangLiangJie
964211a81b
Change optimizer for vits, test=tts ( #2791 )
2 years ago
liangym
96d76c83ad
multi-spk tts static model ( #2779 )
...
* updata readme, test=doc
* update yaml and readme, test=tts
* fix batch_size, test=tts
* update readme, test=doc
* chmod, test=tts
* add multi-spk tts static model infer on server, test=tts
2 years ago
HuangLiangJie
2e51e0da90
[TTS]Fix attention bugs and sort VITS data with feats_lengths ( #2770 )
2 years ago
TianYuan
6725bcd823
revise paddlenlp's version ( #2767 )
2 years ago
TianYuan
979bbd9dcb
add mkldnn and trt config for paddleInference ( #2748 )
2 years ago
zxcd
a8a240d4ef
remove paddle.fluid ( #2740 )
2 years ago
YangZhou
12fa8a2d19
[audio]patch:fix tensor_utils error ( #2738 )
...
* fix tensor utils
2 years ago
TianYuan
3f6afc4834
[TTS]Add slim for TTS ( #2729 )
2 years ago
YangZhou
42ff946007
[audio] mv paddlespeech/audio to paddleaudio ( #2706 )
...
* split paddlespeech/audio to paddleaudio.
* add sox io ,sox effect, kaldi native fbank to paddleaudio.
2 years ago
HuangLiangJie
a874d8f325
Add prosody prediction in synthesize_e2e, test=tts ( #2693 )
2 years ago
TianYuan
62357d876c
[TTS]rm paddlelite in setup.py ( #2713 )
...
* rm paddlelite in setup.py
* fix setup.py
2 years ago
Zth9730
c67bf7b4ef
[ASR] support wav2vec2-zh cli, test=asr ( #2697 )
...
* support wav2vec2-zh cli, test=asr
* support wav2vec2-zh cli, test=asr
* support wav2vec2-zh cli, test=asr
* support wav2vec2-zh cli, test=asr
2 years ago
David An (An Hongliang)
bd01bc155d
add greek char and fix issue2571 ( #2683 )
...
Co-authored-by: TianYuan <white-sky@qq.com>
2 years ago
zxcd
4542684694
[ASR] fix Whisper cli model download path error. test=asr ( #2679 )
...
* add all whisper model size support
* add choices in parser.
* fix Whisper cli model download path error.
* fix resource download path.
* fix code style
2 years ago
Zth9730
fc02cd0540
[doc] update wav2vec2 demos README.md, test=doc ( #2674 )
...
* fix wav2vec2 demos, test=doc
* fix wav2vec2 demos, test=doc
* fix enc_dropout and nor.py, test=asr
2 years ago
zxcd
b71f1428c7
add all whisper model size support, test=asr ( #2677 )
...
* add all whisper model size support
* add choices in parser.
2 years ago
TianYuan
0b4cf2211d
[TTS]Add TTS Paddle-Lite x86 inference ( #2667 )
...
* Add export2lite, test=tts
* add tts paddlelite x86 inference, test=tts
* update released_model.md, test=tts
* add paddlelite in setup.py
* update
2 years ago
David An (An Hongliang)
1c3d2cb89e
add double byte char for zh normalization ( #2661 )
2 years ago
Zth9730
94a487bd81
[ASR] support wav2vec2 command line and demo ( #2658 )
...
* wav2vec2_cli
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* Update RESULTS.md
* Update RESULTS.md
* Update base_commands.py
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
2 years ago
zxcd
b1d3f59bcb
[s2t] add whisper asr large model ( #2640 )
...
* add whisper asr large model decoding, test=asr
* fix code style.
* fix json code style.
* remove resource and fix code style.
* fix yapf
* add cli and demos, fix some code style.
* fix some problem by comment.
* fix yapf
2 years ago
kFoodie
dc9d3baf51
Update onnx_api.py ( #2664 )
2 years ago
liangym
25b6bf9668
[tts] Add male voice for tts ( #2660 )
2 years ago
Zth9730
8d3494320d
[ASR] wav2vec2_en, test=asr ( #2637 )
...
* wav2vec2_en, test=asr
* wav2vec2_en, test=asr
* wav2vec2_en, test=asr
2 years ago
HuangLiangJie
b7312e9f0b
Revised TN qualifier for measure notation, test=tts ( #2629 )
2 years ago
Zth9730
e6d20888c5
支持0维Tensor需要的修改 ( #2621 )
2 years ago
David An (An Hongliang)
8a5fe83e1d
add ssml sentences.txt ( #2620 )
2 years ago
Hui Zhang
2c34481ea0
[s2t] quant with wav scp ( #2568 )
...
* add quant hint
* add paddleslim
* using paddleslim 2.3.4 and paddle 2.4
2 years ago
Zth9730
8d3464c050
[s2t] Update wav2vec2 license ( #2600 )
2 years ago