YangZhou
dd71e9a0e4
fix copyright, test=doc
2 years ago
Hui Zhang
225737d4e3
[s2t] fix cli args to config ( #3194 )
...
* fix cli args to config
* fix train cli
2 years ago
Hui Zhang
e3dcfa8815
Merge pull request #3186 from PaddlePaddle/vits_pr
...
[TTS]update lr schedulers from per iter to per epoch for VITS
2 years ago
YangZhou
2e62ac8bfc
Update README.md in engine
2 years ago
zxcd
bc365cbb52
Merge branch 'develop' into amp
2 years ago
zxcd
f3d567f93b
add readme
2 years ago
zxcd
9d8660b2f6
add new aishell model for better CER.
2 years ago
Hui Zhang
7cab869d63
Merge pull request #3197 from PaddlePaddle/speechx
...
[engine] merge speechx
2 years ago
YangZhou
ce4af0e765
Merge branch 'speechx' of github.com:PaddlePaddle/PaddleSpeech into speechx
2 years ago
YangZhou
a9027f18d0
merge dev
2 years ago
YangZhou
500f283dd6
Revert "[engine] merge develop into speechx ( #3198 )" ( #3199 )
...
This reverts commit 6e0044be58
.
2 years ago
YangZhou
6e0044be58
[engine] merge develop into speechx ( #3198 )
...
*merge develop into speechx
2 years ago
YangZhou
8c2196ea0c
[engine] add wfst recognizer in example ( #3173 )
...
* update wfst script
* add skip blank
2 years ago
WongLaw
305375c310
VITS learning rate revised, test=tts
2 years ago
WongLaw
fdeb9b88a7
VITS learning rate revised, test=tts
2 years ago
TianYuan
fc670339d1
[TTS]Fix losses of StarGAN v2 VC ( #3184 )
2 years ago
ljhzxc
84cc5fc98f
Update pretrained model in README ( #3193 )
2 years ago
Hui Zhang
df3be4acae
[s2t] move s2t data preprocess into paddlespeech.dataset ( #3189 )
...
* move s2t data preprocess into paddlespeech.dataset
* avg model, compute wer, format rsl into paddlespeech.dataset
* fix format rsl
* fix avg ckpts
2 years ago
Shuangchi He
8c7859d3bc
Fix some typos. ( #3178 )
...
Signed-off-by: Yulv-git <yulvchi@qq.com>
2 years ago
Hui Zhang
35d874c532
[s2t] mv dataset into paddlespeech.dataset ( #3183 )
...
* mv dataset into paddlespeech.dataset
* add aidatatang
* fix import
2 years ago
masimeng1994
5e2251afda
[Engine] rename cls && add cls && vad android demo ( #3188 )
...
* [Engine] rename cls to AudioClassification
* [Engine] add android && vad demo
2 years ago
WongLaw
47e31f46cb
VITS learning rate revised, test=tts
2 years ago
WongLaw
414de3747c
VITS learning rate revised, test=tts
2 years ago
TianYuan
3ad55a31e7
[TTS]StarGANv2 VC fix some trainer bugs, add add reset_parameters ( #3182 )
2 years ago
PiaoYang
9cf8c1985a
Update quick_start_cn.md ( #3176 )
2 years ago
PiaoYang
5a0103b2ae
[BUG] Fix progress bar unit. ( #3177 )
2 years ago
PiaoYang
6ffd3e4f7f
Update quick_start.md ( #3175 )
2 years ago
ljhzxc
dc56c3a10e
[TTS] [黑客松]Add JETS ( #3109 )
2 years ago
TianYuan
bd0d69ca74
[TTS]add StarGANv2VC preprocess ( #3163 )
2 years ago
YangZhou
b05ead51d7
[engine]add recognizer api && clean params && make a shared decoder resource ( #3165 )
2 years ago
zxcd
a1e5f27003
mv scaler.unscale_ blow grad_clip.
2 years ago
zxcd
7399d560e7
fix scaler save and load.
2 years ago
zxcd
2f4414a5f8
fix scaler save
2 years ago
zxcd
fbd27aab41
add amp for U2 conformer.
2 years ago
TianYuan
c7d24ba42c
fix some preprocess bugs ( #3155 )
2 years ago
masimeng1994
11ce08b260
[engine] replace onnx with fastdeploy ( #3150 )
...
* onnxruntime change to fastdeploy
2 years ago
lemondy
9c387577fd
fix example/aishell local/train.sh if condition bug, test=asr ( #3146 )
2 years ago
twoDogy
c0cc850776
add function for generating srt file ( #3123 )
...
* add function for generating srt file
在原来websocket_client.py的基础上,增加了由wav或mp3格式的音频文件生成对应srt格式字幕文件的功能
* add function for generating srt file
在原来websocket_client.py的基础上,增加了由wav或mp3格式的音频文件生成对应srt格式字幕文件的功能
* keep origin websocket_client.py
恢复原本的websocket_client.py文件
* add generating subtitle function into README
* add generate subtitle funciton into README
* add subtitle generation function
* add subtitle generation function
2 years ago
masimeng1994
d03ebe872a
add vad interface GetVadResult ( #3140 )
...
* add vad interface GetVadResult
* fix comment
2 years ago
longRookie
df37798598
[TTS]【Hackathon + No.190】 + 模型复现:iSTFTNet ( #3006 )
...
* iSTFTNet implementation based on hifigan, not affect the function and execution of HIFIGAN
* modify the comment in iSTFT.yaml
* add the comments in hifigan
* iSTFTNet implementation based on hifigan, not affect the function and execution of HIFIGAN
* modify the comment in iSTFT.yaml
* add the comments in hifigan
* add iSTFTNet.md
* modify the format of iSTFTNet.md
* modify iSTFT.yaml and hifigan.py
* Format code using pre-commit
* modify hifigan.py,delete the unused self.istft_layer_id , move the self.output_conv behind else, change conv_post to output_conv
* update iSTFTNet_csmsc_ckpt.zip download link
* modify iSTFTNet.md
* modify hifigan.py and iSTFT.yaml
* modify iSTFTNet.md
2 years ago
TianYuan
72aa19c32c
[TTS]add starganv2 vc trainer ( #3143 )
...
* add starganv2 vc trainer
* fix StarGANv2VCUpdater and losses
* fix StarGANv2VCEvaluator
* add some typehint
2 years ago
YangZhou
f35a87ab89
[Engine] recognizer controller refactor ( #3139 )
...
* refactor recognizer_controller
* clean frontend file
2 years ago
TianYuan
54ef90fcec
[TTS]Fix VITS lite infer ( #3098 )
2 years ago
TianYuan
d3d86f59aa
Update bug-report-tts.md ( #3120 )
2 years ago
masimeng1994
591b957b96
[runtime] fix linux && android cmake bug ( #3112 )
...
* support vad ios compile
* support onnx model recognize
* add build ios sh
2 years ago
masimeng1994
9e5a39cae0
[runtime] support onnx runtime && support ios compile ( #3101 )
...
* support vad ios compile
* support onnx model recognize
* add build ios sh
2 years ago
liangym
e83b491c34
rm unused dep, test=tts ( #3097 )
2 years ago
YangZhou
767f6dd4e2
[engine] add recognizer_controller && fix build bugs ( #3086 )
...
* fix asr compile
2 years ago
liangym
793effa122
[TTS] update svs_music_score.md ( #3085 )
2 years ago
TianYuan
cc02b007fe
fix paddle2onnx's install version, install the newest paddle2onnx in run.sh ( #3084 )
2 years ago