lym0302
6b2dd16845
update server cli, test=doc
3 years ago
WilliamZhang06
78c9b7342c
deleted wav file , test=doc
3 years ago
WilliamZhang06
a6ec3a26f1
Merge branch 'develop' into server_asr
3 years ago
WilliamZhang06
8b4602f738
added isinstance code, test=doc
3 years ago
lym0302
bb60561c66
update util, test=doc
3 years ago
WilliamZhang06
147018a8b4
added cli changed code, test=doc
3 years ago
lym0302
332009142b
add server demo, test=doc
3 years ago
WilliamZhang06
7ebe904e20
fixed overload , test=doc
3 years ago
Hui Zhang
60c0877e7a
Merge pull request #1472 from KPatr1ck/cli_batch
...
[CLI][Logger]Add cli logger control.
3 years ago
WilliamZhang06
b8f16ac9b0
Merge branch 'develop' into server_asr
3 years ago
WilliamZhang06
da3ea7bb40
added engine type and asr inference , test=doc
3 years ago
Hui Zhang
49f80afe6a
Merge pull request #1381 from PaddlePaddle/server
...
[server] speech server init version
3 years ago
lym0302
b508c4d0cb
add readme, test=doc
3 years ago
KP
d36a4ccfc8
Add cli logger control.
3 years ago
KP
94ed5969fa
Add cli logger control.
3 years ago
lym0302
42cbe313c2
improve cli code, test=doc
3 years ago
lym0302
2bf4b4521f
add cli, test=doc
3 years ago
lym0302
8fd117e4da
add cli, test=doc
3 years ago
lym0302
80b83b7434
add cli, test=doc
3 years ago
KP
7814fba07f
Update batch input.
3 years ago
KP
05288fe1c3
Update batch input and stdin input.
3 years ago
KP
1818b058aa
Support batch input in cls task.
3 years ago
WilliamZhang06
35e3be9ac8
Merge remote-tracking branch 'remote/develop' into server
3 years ago
TianYuan
ae521d3700
Update infer.py
3 years ago
lym0302
07158b2f12
move dir, test=doc
3 years ago
lym0302
76391275fc
move dir, test=doc
3 years ago
TianYuan
67ec6242c3
fix ci for waveflow, test=tts
3 years ago
TianYuan
f51097618b
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into fix_ci_waveflow
3 years ago
TianYuan
fc8c0e3ea2
fix ci for waveflow, test=tts
3 years ago
huangyuxin
95d5274aef
fix sortagrad, test=asr
3 years ago
Hui Zhang
718c849f68
Merge pull request #1445 from yt605155624/update_train
...
[TTS]init for all works in train.py when ngpu>1
3 years ago
Hui Zhang
f3ec985aaf
Merge pull request #1439 from Jackwaterveg/tipc
...
[TIPC]Add tipc_benchmark of conformer
3 years ago
TianYuan
4ac7db185e
init for all works in train.py when ngpu>1, test=tts
3 years ago
Jackwaterveg
426bae3de1
Merge pull request #1440 from yt605155624/merge_datasets
...
[TTS]Merge datasets, change style of docstring
3 years ago
TianYuan
2cec8f6c76
update tts cli, test=doc
3 years ago
TianYuan
9699c00769
change the docstring style from numpydoc to google, test=tts
3 years ago
huangyuxin
aefe9e93a7
add tipc benchmark of conformer
3 years ago
TianYuan
683679bec7
merge data and datasets, test=tts
3 years ago
TianYuan
7dc1f2daa3
fix some librosa bugs, test=tts
3 years ago
TianYuan
30085ac229
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into rename_tacotron2
3 years ago
TianYuan
25347bb6a3
rename tacotron2, test=tts
3 years ago
huangyuxin
9a55783aa0
fix resample
3 years ago
Hui Zhang
dcfc32f1ec
Merge pull request #1379 from yt605155624/new_wavernn
...
[TTS] add wavernn
3 years ago
TianYuan
0747600c95
[TTS]add ljspeech new tacotron2 ( #1416 )
...
* add ljspeech new tacotron2, test=tts
* update ljspeech waveflow's synthesize
* add config, test=doc
Co-authored-by: Hui Zhang <zhtclz@foxmail.com>
3 years ago
TianYuan
348a1a33bf
update tacotron2 voice cloning in aishell3 with new tacotron2, test=tts ( #1419 )
3 years ago
huangyuxin
f428ec4431
change log of cli/asr/infer
3 years ago
TianYuan
1b0c034134
update wavernn, test=tts
3 years ago
TianYuan
89e69ee10e
[TTS]fix tacotron2 dygraph to static ( #1414 )
...
* fix tacotron2 dygraph to static , test=tts
* fix tacotron2 dygraph to static , test=tts
* simplify synthesize_e2e.py , test=tts
3 years ago
huangyuxin
2a42421a63
cli add ds2-librispeech offline, fix versionm, test=asr
3 years ago
Hui Zhang
4128f4d61f
fix __version__ error in develop ( #1398 )
3 years ago
TianYuan
001afee644
fix wavernn dygraph to static , test=tts
3 years ago
TianYuan
2844f388dc
[doc ]add tacotron2 readme ( #1385 )
...
* add tacotron2 readme, test=doc
* update changelog.md, test=doc
3 years ago
TianYuan
2071774d81
add wavernn in synthesize_e2e, test=tts
3 years ago
TianYuan
1cc7905d51
rm csmsc.py, test=tts
3 years ago
TianYuan
4c3e57a23c
align preprocess of wavernn, test=tts
3 years ago
Jackwaterveg
f49cf838a8
Update u2.py ( #1378 )
3 years ago
TianYuan
fb0acd40a2
add wavernn, test=tts
3 years ago
Jackwaterveg
d7222c0453
[ASR] Support CTC decoder online ( #821 )
...
* fix the destructer problem for prefixes
* unified offline and online in ctcdecoders, test=asr
* rename swig_decoders to paddlespeech_ctcdecoders, test=asr
* add reset_stage for ctcdecoder
* fix some problems
* fix ctconline
* fix a bug
* fix the format
* fix 1xt2x
3 years ago
Jerryuhoo
f515416c4a
fix missing model choice, test=doc
3 years ago
Jerryuhoo
a22080130b
Add speedyspeech multi-speaker support for synthesize_e2e.py, test=tts
3 years ago
Hui Zhang
97db74ca60
Merge pull request #1314 from yt605155624/add_new_tacotron2
...
[TTS]Add new tacotron2
3 years ago
huangyuxin
3845804cc9
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into Setup
3 years ago
TianYuan
96323816e9
fix yamls, change labels to stop_labels, test=tts
3 years ago
TianYuan
1bf1a876ae
Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into add_new_tacotron2, test=tts
3 years ago
TianYuan
3fd7a7790b
add typehit for updater and evaluator, test=tts
3 years ago
huangyuxin
4e31247633
refacto the code
3 years ago
TianYuan
41d24337cb
fix fastspeech2 multi speaker to static, test=tts
3 years ago
TianYuan
1a9e59612a
fix fastspeech2 multi speaker to static, test=tts
3 years ago
huangyuxin
565a63c5ef
refactor the setup in paddleaudio
3 years ago
huangyuxin
eb91ce84f9
refactor the version
3 years ago
Hui Zhang
4a133619a1
Merge pull request #1356 from Jackwaterveg/CLI
...
[CLI] asr, Add Deepspeech2 online and offline model
3 years ago
Hui Zhang
d4acf4704f
Merge pull request #1350 from LittleChenCc/develop
...
[ST] beam search with optimality guarantees
3 years ago
huangyuxin
ab759b16de
Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into CLI
3 years ago
huangyuxin
38edfd1a89
Add Deepspeech2 online and offline in cli
3 years ago
TianYuan
d368d57d67
fix low ips bug of speedyspeech and fastspeech2, test=tts ( #1349 )
3 years ago
TianYuan
9c7f0762b0
update racotron2 and transformer tts, test=tts
3 years ago
huangyuxin
8028f33b7f
synchronize the version
3 years ago
Junkun
44408e5211
sync the variable name to others
3 years ago
Junkun
f866059b74
config and formalize
3 years ago
Junkun
43aad7a018
beam search with optimality guarantees
3 years ago
Jackwaterveg
26524031d2
Merge pull request #1343 from Jackwaterveg/fix
...
[ASR] Fix some bugs
3 years ago
huangyuxin
5e7e8a3e24
fix the u2 export, test=asr
3 years ago
TianYuan
a1867c20c3
fix slice bug of speedyspeech expand, test=tts ( #1337 )
3 years ago
Hui Zhang
ec1c88ae1a
[s2t] remove nltk ( #1332 )
3 years ago
TianYuan
7ae4f7221e
Update length_regulator.py
3 years ago
TianYuan
acfe2b9084
Update duration_predictor.py
3 years ago
TianYuan
caa391f461
fix speedyspeech inference, test=tts ( #1322 )
3 years ago
Jackwaterveg
0c4895cd0b
mv the ctcdecoders to third_part ( #1313 )
3 years ago
TianYuan
8f507ba4ba
Merge pull request #1302 from jerryuhoo/develop
...
[TTS] Add support for finetuning speedyspeech
3 years ago
Jerryuhoo
111a452378
Fix the code format, test=tts
3 years ago
TianYuan
89e988a69e
add csmsc tacotron2, test=tts
3 years ago
TianYuan
c088b9a304
add csmsc tacotron2
3 years ago
huangyuxin
fe1dc9d211
refactor the cli/st, test=st
3 years ago
TianYuan
27bb76bdb9
fix tone_sandhi of yi, test=tts
3 years ago
Jerryuhoo
be99807d61
Add durations to gen_gta_mel.py inference
3 years ago
KP
52a8b2f320
Add ECAPA_TDNN. ( #1301 )
3 years ago
Jerryuhoo
fcc34e3e95
[tts] add gen_gta_mel.py for finetuning speedypeech, test=tts
3 years ago
Jackwaterveg
010aa65b2b
[cli] asr - support English, decode_metod and unified config ( #1297 )
...
* fix config, test=asr
* fix config, test=doc_fix
* add en and decode_method for cli/asr, test=asr
* test=asr
* fix, test=doc_fix
3 years ago
KP
c09466ebbe
Add ECAPA_TDNN. ( #1295 )
3 years ago
TianYuan
fb238d83f4
update vctk voc1, test=tts ( #1294 )
3 years ago
TianYuan
73dc0e2535
fix_ning
3 years ago
billishyahao
ddf184be60
fix some typos
3 years ago
TianYuan
318cc9e539
Merge branch 'develop' into develop
3 years ago
Jackwaterveg
e69abc9265
Merge pull request #1273 from zh794390558/batch_sampler
...
[s2t] Fix Batch sampler set epoch
3 years ago
KP
a810cd4e5c
Add cli logging. ( #1274 )
3 years ago
Jerryuhoo
d6e9b76e76
change link_wav.py path, test=tts
3 years ago
Jerryuhoo
c94f346207
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/
...
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/ so that different vocoders can use one script.
3 years ago
Jerryuhoo
e239ee1cd2
add multi-speaker support for finetuning hifigan vocoder
3 years ago
huangyuxin
07d457859d
use pre-commit, test=doc_fix
3 years ago
Hui Zhang
45832f6770
fix default dist_samlper to False
3 years ago
Hui Zhang
3a2db414e6
format code
3 years ago
Hui Zhang
6f651d762e
fix batch sampler set_epoch when epcoh start
3 years ago
TianYuan
680eac02b9
[tts]Update mb melgan ( #1272 )
...
* update mb melgan
* update mb melgan, test=tts
3 years ago
TianYuan
98ce69d0aa
Merge pull request #1259 from jerryuhoo/develop
...
[TTS]Add multi-speaker support for the SpeedySpeech model
3 years ago
huangyuxin
ffadbe22a7
merge the develop, test=asr
3 years ago
JiehangXie
bdc48114a4
Update text_normlization.py
3 years ago
JiehangXie
d88ceef7bc
Fix punctuation bug
...
修复顿号和英文冒号停顿和分句的问题
3 years ago
huangyuxin
8b63485ce3
fix some bug, test=asr
3 years ago
JiehangXie
6065b1b607
Fix punctuation bug
...
修复顿号和英文冒号停顿和分句的问题
3 years ago
Jerry
0719698118
Merge branch 'develop' into develop
3 years ago
AdamBear
36c9eaa437
Cache the TextFeaturizer instance for infer speed improvement. ( #1260 )
3 years ago
huangyuxin
3e2cc898cb
remove default cfg and fix some bugs,test=asr
3 years ago
Jerryuhoo
2dccd5315d
remove useless "other" dataset
3 years ago
Jerryuhoo
f191d0b022
change speaker embedding position
...
Change speaker embedding position into the encoder.
3 years ago
Jerryuhoo
11991b6d35
add multi-speaker support for speedyspeech
3 years ago
huangyuxin
a1d8ab0f99
merge the develop
3 years ago
huangyuxin
c907a8deda
change all recipes
3 years ago
TianYuan
b9a55262f1
Update fastspeech2.py
3 years ago
Hui Zhang
c81a3f0f83
[s2t] DataLoader with BatchSampler or DistributeBatchSampler ( #1242 )
...
* batchsampler or distributebatchsampler
* format
3 years ago
Junkun Chen
420709e5ce
[st] Distributed sampler and new dataloader with MIMO ( #1239 )
...
* update timit result, test=doc_fix
* result update
* fix bug
* add triplet loader
* empty preprocess file
* sync to u2, updating
* sync to u2 config
* fix bugs
* code refine
* update config
* customize decoding batch size
* update optimizer and lr scheduler
* minor
* minor
* minor
* fix bugs of refs
* minor
* distributed sampler
* minor
* refine the loader
3 years ago
TianYuan
fbe3c05137
add style_melgan and hifigan in tts cli, test=tts ( #1241 )
3 years ago
TianYuan
a232cd8b12
Update fastspeech2.py
3 years ago
huangyuxin
41eeed0450
add librispeech asr1
3 years ago
huangyuxin
fb6d1e2c11
merge the develop
3 years ago
huangyuxin
2c5902d7c5
rename decoding to decode
3 years ago
TianYuan
42c109216d
[tts]add style melgan pretraied model ( #1228 )
...
* add style melgan pretraied model
* add style melgan pretraied model, test=tts
Co-authored-by: Hui Zhang <zhtclz@foxmail.com>
3 years ago
Hui Zhang
bb2a370b23
[asr] remove useless conf of librispeech ( #1227 )
...
* remve useless conf
* format code
* update conf
* update conf
* update conf
3 years ago
huangyuxin
c40b6f4062
refactor the train and test config,test=asr
3 years ago
TianYuan
5692b0ff04
fix log for t2s ( #1219 )
3 years ago
TianYuan
b031ee43c4
Merge pull request #1215 from yt605155624/refactor_punc
...
[text]Refactor punc
3 years ago
TianYuan
e1798e1eeb
update
3 years ago
KP
d362d28d35
Remove logging file in cli api.
3 years ago
TianYuan
15b8904fa1
refactor punc
3 years ago
JiehangXie
927c9bbdb6
Fix a bug when sentence inputed contain English words
3 years ago
KP
1632af7706
Update examples/esc50. ( #1203 )
3 years ago
Jerryuhoo
3cbfd7bf35
Add speaker embedding and speaker id for style fastspeech2 inference
3 years ago
Hui Zhang
db121226b8
clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch ( #1191 )
3 years ago
Hui Zhang
d852aee2ff
[asr] logfbank with dither ( #1179 )
...
* fix logfbank dither
* format
3 years ago
KP
9ec2bc8e2e
Update README. test=doc_fix
3 years ago
Jackwaterveg
879857332d
[version]add paddlespeech.__version__ ( #1166 )
...
* add paddlespeech.__version__
* version 0.1.0 is ready
3 years ago