zxcd
b1d3f59bcb
[s2t] add whisper asr large model ( #2640 )
...
* add whisper asr large model decoding, test=asr
* fix code style.
* fix json code style.
* remove resource and fix code style.
* fix yapf
* add cli and demos, fix some code style.
* fix some problem by comment.
* fix yapf
2 years ago
kFoodie
dc9d3baf51
Update onnx_api.py ( #2664 )
2 years ago
liangym
25b6bf9668
[tts] Add male voice for tts ( #2660 )
2 years ago
Zth9730
8d3494320d
[ASR] wav2vec2_en, test=asr ( #2637 )
...
* wav2vec2_en, test=asr
* wav2vec2_en, test=asr
* wav2vec2_en, test=asr
2 years ago
HuangLiangJie
b7312e9f0b
Revised TN qualifier for measure notation, test=tts ( #2629 )
2 years ago
Zth9730
e6d20888c5
支持0维Tensor需要的修改 ( #2621 )
2 years ago
David An (An Hongliang)
8a5fe83e1d
add ssml sentences.txt ( #2620 )
2 years ago
Hui Zhang
2c34481ea0
[s2t] quant with wav scp ( #2568 )
...
* add quant hint
* add paddleslim
* using paddleslim 2.3.4 and paddle 2.4
2 years ago
Zth9730
8d3464c050
[s2t] Update wav2vec2 license ( #2600 )
2 years ago
liangym
e18170228c
[tts] add adversarial loss ( #2588 )
2 years ago
TianYuan
9aab706cba
fix frontend bug, test=tts ( #2606 )
2 years ago
WongLaw
e348aa825d
Added Rhythm Prediction, test=tts
2 years ago
WongLaw
b96fb1d57e
Added Rythm Prediction, test=tts
2 years ago
WongLaw
d27364d141
Added Text Rhythm Prediction, test=tts
2 years ago
HuangLiangJie
872be9c8ce
Merge branch 'PaddlePaddle:develop' into rhy
2 years ago
YangZhou
bbf2401e3e
Merge pull request #2524 from zh794390558/u2
...
[speechx] add u2/u2pp asr inference
2 years ago
WongLaw
72bbabbf79
Revised structure of rhythm prediction, test=tts
2 years ago
david.95
ed0138c6e3
add condition check if a ssml input and filter space line, test=tts
2 years ago
David An (An Hongliang)
21cce0e0bb
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
TianYuan
63c80121e2
fix uvicorn's bug
2 years ago
TianYuan
2a60c3d854
Merge pull request #2554 from dahu1/develop
...
标点恢复代码更新,test=asr
2 years ago
david.95
3ac7ac253f
fix review issue,test=tts
2 years ago
David An (An Hongliang)
0476e645aa
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
Zth9730
68134c8436
fix u2pp model ( #2549 )
2 years ago
dahu1
cb76e66401
1.token配置不写死,2.text显示不乱码, test=asr
2 years ago
Hui Zhang
eac545e1db
Merge pull request #2544 from Zth9730/fix_attention
...
[s2t] fix attention eval bug, do not compose kv in infer
2 years ago
tianhao zhang
1ea828c30e
fix attention val bug
2 years ago
David An (An Hongliang)
103e46f819
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
TianYuan
2d71577e75
fix g2p ( #2539 )
2 years ago
david.95
f295d2d445
remove useless code
2 years ago
david.95
89e9ea69eb
modify __init__
2 years ago
david.95
1067088deb
modify __init__
2 years ago
david.95
f56cc08b18
add license content, test=tts
2 years ago
david.95
29508f400b
to fix CI issue, test=tts
2 years ago
david.95
60801d8f14
Merge branch 'hongliang1014' of https://github.com/david-95/PaddleSpeech into hongliang1014
2 years ago
David An (An Hongliang)
ce21f9bc41
Merge branch 'PaddlePaddle:develop' into hongliang1014
2 years ago
david.95
278c7a41a8
add module define to fix ci, test=tts
2 years ago
Hui Zhang
964c22c677
Merge pull request #2532 from Zth9730/wav2vec2.0
...
[s2t] fix wav2vec2 report loss bug
2 years ago
tianhao zhang
86f65f0b8e
fix wav2vec2 report loss bug
2 years ago
david.95
13a7fa9808
enable chinese words' pinyin specified in text of ssml formats, test=tts
2 years ago
Hui Zhang
f1ca564731
Merge pull request #2518 from Zth9730/wav2vec2.0
...
[ASR] wav2vec2 ASR, pre-trained wav2vec2 based CTC for librispeech
2 years ago
tianhao zhang
2ae94bd277
freeze wav2vec2=True, change loss report and update README.md
2 years ago
tianhao zhang
3d994f5c23
format wav2vec2 demo
2 years ago
Hui Zhang
c6f9764ed6
Merge pull request #2510 from Zth9730/u2pp_jit_export
...
[s2t] use reverse_weight in decode.yaml
2 years ago
tianhao zhang
19180d359d
format wav2vec2 demo
2 years ago
tianhao zhang
6e429f0513
support wav2vec2ASR on librispeech
2 years ago
Hui Zhang
290c23b9d7
add u2 nnet, u2 nnet main, codelab, and can compile
2 years ago
tianhao zhang
e367242765
update dependency of paddle
2 years ago
tianhao zhang
5a66a14659
fix u2pp model version number
2 years ago
tianhao zhang
cda440e6f0
use reverse_weight in decode.yaml
2 years ago
Zth9730
c9b0c96b7b
Merge pull request #2502 from zh794390558/u2pp_export
...
[s2t] streaming conformer u2 and u2pp jit export
2 years ago
Hui Zhang
c98b5dd173
fix masked_fill which will nan in trainning
2 years ago
Hui Zhang
9277fcb8a8
fix attn can not train
2 years ago
Hui Zhang
1f4f98b171
fix bug
2 years ago
liangym
0359c3f6b5
Fix mix front ( #2493 )
...
* update mix frontend, test=tts
2 years ago
Hui Zhang
e86337a423
fix bug
2 years ago
Hui Zhang
925abcca23
format
2 years ago
Hui Zhang
2a75405e9a
Merge branch 'develop' into u2pp_export
2 years ago
Hui Zhang
3ed24474d2
wenetspeech asr1 quant
2 years ago
Hui Zhang
467cfd4e75
Merge pull request #2489 from Zth9730/u2++_server
...
[ASR] support u2pp based cli and server, optimiz code of u2pp decode (reversed_weight parameter)
2 years ago
tianhao zhang
5b5167b586
support u2pp cli and server, optimiz code of u2pp decode, test=asr
2 years ago
YangZhou
3507829a6d
Merge pull request #2464 from THUzyt21/Deploy-text-model-in-server
...
[Server]Deploy text model in server
2 years ago
ZapBird
7a13b35fe6
BytesIO类型时,要保证切到初始位置,这样多次读取才能够正常。比如__call__函数。 ( #2484 )
...
* BytesIO类型时,要保证切到初始位置,这样多次读取才能够正常。比如__call__函数。
__call__函数的参数audio_file为BytesIO类型时执行到self.preprocess(model, audio_file)会报错,需要判断audio_file为BytesIO类型时执行audio_file.seek(0)。
2 years ago
tianhao zhang
5bbe6e9897
support u2pp cli and server, optimiz code of u2pp decode, test=asr
2 years ago
Hui Zhang
bdf876ea7b
Merge branch 'develop' into u2pp_export
2 years ago
Zhao Yuting
304dc2603c
Update text_engine.py
2 years ago
Zhao Yuting
8c945c073d
Update application.yaml
2 years ago
Zhao Yuting
b9693a0e8e
Update text_engine.py
2 years ago
Zhao Yuting
8ecf6796f3
Update text_engine.py
2 years ago
Hui Zhang
afda7ed7d1
remove useless code
2 years ago
YangZhou
4841f94298
Merge pull request #2421 from THUzyt21/Deploy-fast-text-model-for-cli
...
[CLI]Deploy fast text model for cli
2 years ago
Hui Zhang
b20bf7d5de
masked_fill by multiply, remove while
2 years ago
Zhao Yuting
d2da7f50d2
Update text_engine.py
...
precommihted already
2 years ago
Zhao Yuting
82f731c153
Update application.yaml
...
change model
2 years ago
Hui Zhang
feb27e2a84
fuse linear kv
2 years ago
Hui Zhang
3adb20b468
eliminate shape and slice
2 years ago
Hui Zhang
46088c0a16
elimiate attn transpose
2 years ago
Hui Zhang
f9e3eaa024
transpose in matmul
2 years ago
Hui Zhang
3d7ca93861
bool type slice
2 years ago
Hui Zhang
c2c8a662b1
refactor reshape
2 years ago
Hui Zhang
6de81d74d9
elimiete cast dtype for bool op
2 years ago
Hui Zhang
8e7a315e00
remove comment
2 years ago
Hui Zhang
c4a5ae3825
eliminate mul
2 years ago
Hui Zhang
b7388ce25a
eliminate useless unsqueese
2 years ago
Hui Zhang
1a1ce92cb4
Merge pull request #2415 from Zth9730/u2++_decoder
...
[s2t] support bitransformer decoder
2 years ago
TianYuan
52af86fcc3
fix ERNIE-SAT bug when edit the end of sentenses, test=doc ( #2432 )
2 years ago
tianhao zhang
d3e5937591
support bitransformer decoder
2 years ago
Hui Zhang
7382050e21
fix bug on win
2 years ago
TianYuan
b14da765e8
frm random spk embedding in voice cloning, test=doc ( #2429 )
2 years ago
Hui Zhang
d25871a7b0
format
2 years ago
Hui Zhang
b10512eb0e
more config or u2pp
2 years ago
Hui Zhang
00b2c1c8fb
fix forward attention decoder caller
2 years ago
zhoupc2015
2ae0f66d0d
Solve "unknown format: 3" ( #2422 )
...
* Solve execute the following code with return wav:
iob = io.BytesIO(wav)
wave.open(iob, 'rb')
will throw an "unknown format: 3" exception
2 years ago
Hui Zhang
309c8d70d9
add reverse weight
2 years ago
Hui Zhang
9b66680ea4
Merge branch 'u2++_decoder' into u2pp_export
2 years ago
tianhao zhang
027535dec1
support bitransformer decoder, test=asr
2 years ago
THUzyt21
bdbacd4249
precomited
2 years ago
Zhao Yuting
d5dec46336
Update README.md
2 years ago
Zhao Yuting
18b71dc136
Update README.md
2 years ago
tianhao zhang
0a95689461
support bitransformer decoder
2 years ago