jiamingkong
8432e8626f
Final cleaning; Modified SSL/infer.py and README for wavlm inclusion in model options
1 year ago
zoooo0820
17f2944a17
fix error in tts/st
2 years ago
zxcd
caca8e2f12
[ASR] fix asr 0-d tensor. ( #3214 )
...
* fix asr infer.py
* add readme.
2 years ago
TianHao Zhang
12e3e76092
[ASR] Support Hubert, fintuned on the librispeech dataset ( #3088 )
...
* librispeech hubert, test=asr
* librispeech hubert, test=asr
* hubert decode
* review
* copyright, notes, example related
* hubert cli
* pre-commit format
* fix conflicts
* fix conflicts
* doc related
* doc and train config
* librispeech.py
* support hubert cli
2 years ago
PiaoYang
5a0103b2ae
[BUG] Fix progress bar unit. ( #3177 )
2 years ago
JiehangXie
59cabdc967
[TTS]Cli Cantonese onnx, test=tts ( #2990 )
...
Co-authored-by: TianYuan <white-sky@qq.com>
2 years ago
HuangLiangJie
c8196d45ae
[TTS]Canton CLI, test=tts ( #2977 )
2 years ago
liangym
d9b041e999
[TTS]Cli male onnx ( #2945 )
2 years ago
zxcd
6728db5b59
[ASR]Whisper remove audio duration limit, test=asr ( #2900 )
2 years ago
zxcd
f6b624ddc8
add encoding=utf8 for text cli. ( #2896 )
2 years ago
TianYuan
a283f8a57e
[TTS]fix open encoding ( #2865 )
2 years ago
zxcd
88fe26f17c
[ASR] add asr code-switch cli and demo, test='asr' ( #2816 )
...
* add asr code-switch cli and demo.
* fix some model named problem.
2 years ago
YangZhou
42ff946007
[audio] mv paddlespeech/audio to paddleaudio ( #2706 )
...
* split paddlespeech/audio to paddleaudio.
* add sox io ,sox effect, kaldi native fbank to paddleaudio.
2 years ago
Zth9730
c67bf7b4ef
[ASR] support wav2vec2-zh cli, test=asr ( #2697 )
...
* support wav2vec2-zh cli, test=asr
* support wav2vec2-zh cli, test=asr
* support wav2vec2-zh cli, test=asr
* support wav2vec2-zh cli, test=asr
2 years ago
zxcd
4542684694
[ASR] fix Whisper cli model download path error. test=asr ( #2679 )
...
* add all whisper model size support
* add choices in parser.
* fix Whisper cli model download path error.
* fix resource download path.
* fix code style
2 years ago
zxcd
b71f1428c7
add all whisper model size support, test=asr ( #2677 )
...
* add all whisper model size support
* add choices in parser.
2 years ago
Zth9730
94a487bd81
[ASR] support wav2vec2 command line and demo ( #2658 )
...
* wav2vec2_cli
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* Update RESULTS.md
* Update RESULTS.md
* Update base_commands.py
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
* wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr
2 years ago
zxcd
b1d3f59bcb
[s2t] add whisper asr large model ( #2640 )
...
* add whisper asr large model decoding, test=asr
* fix code style.
* fix json code style.
* remove resource and fix code style.
* fix yapf
* add cli and demos, fix some code style.
* fix some problem by comment.
* fix yapf
2 years ago
liangym
25b6bf9668
[tts] Add male voice for tts ( #2660 )
2 years ago
Zth9730
68134c8436
fix u2pp model ( #2549 )
2 years ago
Hui Zhang
467cfd4e75
Merge pull request #2489 from Zth9730/u2++_server
...
[ASR] support u2pp based cli and server, optimiz code of u2pp decode (reversed_weight parameter)
2 years ago
ZapBird
7a13b35fe6
BytesIO类型时,要保证切到初始位置,这样多次读取才能够正常。比如__call__函数。 ( #2484 )
...
* BytesIO类型时,要保证切到初始位置,这样多次读取才能够正常。比如__call__函数。
__call__函数的参数audio_file为BytesIO类型时执行到self.preprocess(model, audio_file)会报错,需要判断audio_file为BytesIO类型时执行audio_file.seek(0)。
2 years ago
tianhao zhang
5bbe6e9897
support u2pp cli and server, optimiz code of u2pp decode, test=asr
2 years ago
THUzyt21
bdbacd4249
precomited
2 years ago
Zhao Yuting
d5dec46336
Update README.md
2 years ago
Zhao Yuting
18b71dc136
Update README.md
2 years ago
Zhao Yuting
12a11394bd
Update infer.py
...
add a new faster model to infer in cli
2 years ago
Zhao Yuting
fb7f04e021
Update README.md
2 years ago
Zhao Yuting
92d09d5cce
Update README_cn.md
2 years ago
Zhao Yuting
57dcd0d17f
Update infer.py
...
change the infer in order to implement the new faster model for text
2 years ago
TianYuan
35c6ffa90b
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_vc2
2 years ago
TianYuan
e622f42d92
add aishell3 voice cloning with ECAPA-TDNN spk encoder
2 years ago
TianYuan
795eb7bd10
format paddlespeech with pre-commit ( #2331 )
2 years ago
TianYuan
0baec4325a
fix stats bugs
2 years ago
TianYuan
5d515f3f3f
update mix tts
2 years ago
TianYuan
8da993bbf8
fix fs bug
2 years ago
TianYuan
788a3062d0
fix onnx am_ckpt from list to item in prtrained_mdoels.py
2 years ago
TianYuan
cd662a08e0
fix for load specified model files
2 years ago
TianYuan
b9ade18055
add onnxruntime infer for cli
2 years ago
TianYuan
510e240c5c
achange default voc in cli from pwgan_csmsc to hifigan_csmsc, test=tts
2 years ago
TianYuan
00e9853f66
add mix tts cli, test=tts
2 years ago
Hui Zhang
5ca05fea20
cli batch process support \t
2 years ago
huangyuxin
05d41523ad
Merge branch 'develop' into webdataset
2 years ago
TianYuan
4b1f82d312
log redundancy in server
2 years ago
TianYuan
bc93bffbb4
replace logger.info with logger.debug in cli, change default log level to INFO
2 years ago
TianYuan
cf846f9ebc
rm extra log
2 years ago
KP
adc7c9b4aa
Fix unnecessary download present in issue #2067 .
2 years ago
huangyuxin
6ec6921255
Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
2 years ago
Jackwaterveg
6598216b2f
Merge branch 'develop' into webdataset
2 years ago
huangyuxin
aa12b9ab52
replace s2t.transform with audio.transform
2 years ago