TianYuan
|
318cc9e539
|
Merge branch 'develop' into develop
|
3 years ago |
Jackwaterveg
|
e69abc9265
|
Merge pull request #1273 from zh794390558/batch_sampler
[s2t] Fix Batch sampler set epoch
|
3 years ago |
KP
|
a810cd4e5c
|
Add cli logging. (#1274)
|
3 years ago |
Jerryuhoo
|
d6e9b76e76
|
change link_wav.py path, test=tts
|
3 years ago |
Jerryuhoo
|
c94f346207
|
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/
move link_wav.py to paddlespeech/t2s/exps/gan_vocoder/ so that different vocoders can use one script.
|
3 years ago |
Jerryuhoo
|
e239ee1cd2
|
add multi-speaker support for finetuning hifigan vocoder
|
3 years ago |
huangyuxin
|
07d457859d
|
use pre-commit, test=doc_fix
|
3 years ago |
Hui Zhang
|
45832f6770
|
fix default dist_samlper to False
|
3 years ago |
Hui Zhang
|
3a2db414e6
|
format code
|
3 years ago |
Hui Zhang
|
6f651d762e
|
fix batch sampler set_epoch when epcoh start
|
3 years ago |
TianYuan
|
680eac02b9
|
[tts]Update mb melgan (#1272)
* update mb melgan
* update mb melgan, test=tts
|
3 years ago |
TianYuan
|
98ce69d0aa
|
Merge pull request #1259 from jerryuhoo/develop
[TTS]Add multi-speaker support for the SpeedySpeech model
|
3 years ago |
huangyuxin
|
ffadbe22a7
|
merge the develop, test=asr
|
3 years ago |
JiehangXie
|
bdc48114a4
|
Update text_normlization.py
|
3 years ago |
JiehangXie
|
d88ceef7bc
|
Fix punctuation bug
修复顿号和英文冒号停顿和分句的问题
|
3 years ago |
huangyuxin
|
8b63485ce3
|
fix some bug, test=asr
|
3 years ago |
JiehangXie
|
6065b1b607
|
Fix punctuation bug
修复顿号和英文冒号停顿和分句的问题
|
3 years ago |
Jerry
|
0719698118
|
Merge branch 'develop' into develop
|
3 years ago |
AdamBear
|
36c9eaa437
|
Cache the TextFeaturizer instance for infer speed improvement. (#1260)
|
3 years ago |
huangyuxin
|
3e2cc898cb
|
remove default cfg and fix some bugs,test=asr
|
3 years ago |
Jerryuhoo
|
2dccd5315d
|
remove useless "other" dataset
|
3 years ago |
Jerryuhoo
|
f191d0b022
|
change speaker embedding position
Change speaker embedding position into the encoder.
|
3 years ago |
Jerryuhoo
|
11991b6d35
|
add multi-speaker support for speedyspeech
|
3 years ago |
huangyuxin
|
a1d8ab0f99
|
merge the develop
|
3 years ago |
huangyuxin
|
c907a8deda
|
change all recipes
|
3 years ago |
TianYuan
|
b9a55262f1
|
Update fastspeech2.py
|
3 years ago |
Hui Zhang
|
c81a3f0f83
|
[s2t] DataLoader with BatchSampler or DistributeBatchSampler (#1242)
* batchsampler or distributebatchsampler
* format
|
3 years ago |
Junkun Chen
|
420709e5ce
|
[st] Distributed sampler and new dataloader with MIMO (#1239)
* update timit result, test=doc_fix
* result update
* fix bug
* add triplet loader
* empty preprocess file
* sync to u2, updating
* sync to u2 config
* fix bugs
* code refine
* update config
* customize decoding batch size
* update optimizer and lr scheduler
* minor
* minor
* minor
* fix bugs of refs
* minor
* distributed sampler
* minor
* refine the loader
|
3 years ago |
TianYuan
|
fbe3c05137
|
add style_melgan and hifigan in tts cli, test=tts (#1241)
|
3 years ago |
TianYuan
|
a232cd8b12
|
Update fastspeech2.py
|
3 years ago |
huangyuxin
|
41eeed0450
|
add librispeech asr1
|
3 years ago |
huangyuxin
|
fb6d1e2c11
|
merge the develop
|
3 years ago |
huangyuxin
|
2c5902d7c5
|
rename decoding to decode
|
3 years ago |
TianYuan
|
42c109216d
|
[tts]add style melgan pretraied model (#1228)
* add style melgan pretraied model
* add style melgan pretraied model, test=tts
Co-authored-by: Hui Zhang <zhtclz@foxmail.com>
|
3 years ago |
Hui Zhang
|
bb2a370b23
|
[asr] remove useless conf of librispeech (#1227)
* remve useless conf
* format code
* update conf
* update conf
* update conf
|
3 years ago |
huangyuxin
|
c40b6f4062
|
refactor the train and test config,test=asr
|
3 years ago |
TianYuan
|
5692b0ff04
|
fix log for t2s (#1219)
|
3 years ago |
TianYuan
|
b031ee43c4
|
Merge pull request #1215 from yt605155624/refactor_punc
[text]Refactor punc
|
3 years ago |
TianYuan
|
e1798e1eeb
|
update
|
3 years ago |
KP
|
d362d28d35
|
Remove logging file in cli api.
|
3 years ago |
TianYuan
|
15b8904fa1
|
refactor punc
|
3 years ago |
JiehangXie
|
927c9bbdb6
|
Fix a bug when sentence inputed contain English words
|
3 years ago |
KP
|
1632af7706
|
Update examples/esc50. (#1203)
|
3 years ago |
Jerryuhoo
|
3cbfd7bf35
|
Add speaker embedding and speaker id for style fastspeech2 inference
|
3 years ago |
Hui Zhang
|
db121226b8
|
clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch (#1191)
|
3 years ago |
Hui Zhang
|
d852aee2ff
|
[asr] logfbank with dither (#1179)
* fix logfbank dither
* format
|
3 years ago |
KP
|
9ec2bc8e2e
|
Update README. test=doc_fix
|
3 years ago |
Jackwaterveg
|
879857332d
|
[version]add paddlespeech.__version__ (#1166)
* add paddlespeech.__version__
* version 0.1.0 is ready
|
3 years ago |
TianYuan
|
19ef7210a0
|
[TTS]Add hifigan (#1097)
* add hifigan
* add hifigan
* integrate synthesize synthesize_e2e, inference for tts, test=tts
* add some python files, test=tts
* update readme, test=doc_fix
|
3 years ago |
TianYuan
|
675cff258b
|
[TTS]fix praatio version, test=tts (#1158)
* fix praatio version, test=tts
* fix praatio version, test=tts
|
3 years ago |
Jackwaterveg
|
e9748faa71
|
[Cli]optimize the cli, add --yes, and delete transformer_aishell (#1154)
* optimize the cli/asr,test=asr
* test=doc_fix
|
3 years ago |
Jackwaterveg
|
2bccde3def
|
update the version of ctcdecoders and feat,test=doc_fix (#1155)
|
3 years ago |
Jackwaterveg
|
0151f2463f
|
fix bug of pad_sequence in u2,test=asr (#1153)
|
3 years ago |
Jackwaterveg
|
68164dd39f
|
[asr]rename test_hub to test_wav (#1132)
* add the readme, librispeech_asr1
* fix the test_hub
* test=asr
|
3 years ago |
KP
|
16d6ed3842
|
Add automatic_video_subtitiles demo.
|
3 years ago |
KP
|
7394a18732
|
Add default arguments in cls python api.
|
3 years ago |
TianYuan
|
f9efbf3063
|
Update generate_lexicon.py
|
3 years ago |
Jackwaterveg
|
5b446f6321
|
[Config]clear the u2 decode config for asr (#1107)
* clear the u2 decode config
* rename the vocab_filepath and cmvn_path
|
3 years ago |
KP
|
074559fe90
|
[CLI][Demo][Text]Refactor punctuation_restoration. (#1013)
* Refactor punctuation_restoration.
* Add text cli and punc demo.
|
3 years ago |
Hui Zhang
|
51d7a07c6d
|
format and fix pre-commit (#1120)
|
3 years ago |
TianYuan
|
5f0f76f249
|
add eval() for inference model (#1114)
|
3 years ago |
TianYuan
|
59e4a34071
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into tts_cli
|
3 years ago |
TianYuan
|
3de4130dfc
|
update am name
|
3 years ago |
TianYuan
|
9db1710ba7
|
add conformer demos (#1108)
|
3 years ago |
TianYuan
|
3fe75f833d
|
Merge pull request #1109 from yt605155624/tts_cli
[cli]update voc name
|
3 years ago |
TianYuan
|
ca12a83d5a
|
update voc name
|
3 years ago |
TianYuan
|
965a57ef0e
|
Update README.md
|
3 years ago |
Jackwaterveg
|
9e31a606d1
|
set default encoding utf8 for win (#1101)
Co-authored-by: KP <109694228@qq.com>
|
3 years ago |
Hui Zhang
|
764a5d4271
|
Merge branch 'develop' into ctc
|
3 years ago |
Hui Zhang
|
b1c80c45e0
|
remove ctc grad norm type in config
|
3 years ago |
huangyuxin
|
1d4002409f
|
separate the sox and soxbindings with the requirements
|
3 years ago |
TianYuan
|
df5fe035e5
|
Update README.md
|
3 years ago |
TianYuan
|
a6e0a69da8
|
Merge pull request #1095 from KPatr1ck/demo
[Demo]Add tts demo.
|
3 years ago |
TianYuan
|
963e906f56
|
Merge pull request #1068 from yt605155624/add_style_melgan
[TTS]add style_melgan
|
3 years ago |
KP
|
1909f2f620
|
Add tts demo.
|
3 years ago |
KP
|
3701fba0be
|
Update download logic and fix README typos.
|
3 years ago |
TianYuan
|
f701882b66
|
update add_style_melgan
|
3 years ago |
gongel
|
dc60aeb8c2
|
format
|
3 years ago |
gongel
|
31510d088c
|
refactor: rm kaldi_io
|
3 years ago |
TianYuan
|
2189b46004
|
add tts cli
|
3 years ago |
KP
|
70a8a75476
|
Add st demo.
|
3 years ago |
Hui Zhang
|
6dedb63e8b
|
Merge pull request #1087 from Jackwaterveg/setup
[ctcdecoders] Separate the ctcdecoders
|
3 years ago |
huangyuxin
|
9fe0beee54
|
fix the bug: miss import after install
|
3 years ago |
huangyuxin
|
cea5ffe0e4
|
refactor the code
|
3 years ago |
gongel
|
20d88ec673
|
refactor: update params/input/output/namestyle
|
3 years ago |
KP
|
6c1e6e7876
|
Update recommended model to cnn14 and argument name in __call__.
|
3 years ago |
huangyuxin
|
ed12db61a6
|
Separate the ctcdecoders
|
3 years ago |
KP
|
0b7e0d1e2e
|
Update tags of pretrained_models.
|
3 years ago |
KP
|
d08b824d72
|
Update README.
|
3 years ago |
KP
|
61e39daccc
|
Optimize model init.
|
3 years ago |
KP
|
528c70e515
|
Remove TODO.
|
3 years ago |
KP
|
b072453ca8
|
Fix decompressing problem.
|
3 years ago |
KP
|
29da318379
|
Add audio classification cli.
|
3 years ago |
gongel
|
f5c61ced28
|
feat: add st cli
|
3 years ago |
Hui Zhang
|
0818c1601d
|
add __init__.py
|
3 years ago |
TianYuan
|
7b2ecb6eed
|
add style_melgan, test=tts
|
3 years ago |
Hui Zhang
|
03678c08c5
|
Merge branch 'develop' into fix_cli
|
3 years ago |
huangyuxin
|
1b57d05d1b
|
rm the os.chdir in cli asr
|
3 years ago |
TianYuan
|
aead853b1d
|
Update zh_frontend.py
|
3 years ago |
huangyuxin
|
021311c76b
|
add transformer to cli infer
|
3 years ago |
TianYuan
|
a070524d37
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_style_melgan, test=tts
|
3 years ago |
TianYuan
|
dd36eafe34
|
add style_melgan
|
3 years ago |
KP
|
54cf048b2a
|
Merge remote-tracking branch 'update_stream/develop' into cli
|
3 years ago |
huangyuxin
|
a258a34ec0
|
revise the convert pcm
|
3 years ago |
Jackwaterveg
|
8ec576f477
|
Update infer.py
|
3 years ago |
huangyuxin
|
b0356ae489
|
revise
|
3 years ago |
huangyuxin
|
957f2e3a1c
|
revise
|
3 years ago |
huangyuxin
|
aee530af27
|
revise the sample rate
|
3 years ago |
Junkun
|
4e31a4445d
|
eval mode
|
3 years ago |
KP
|
a19e51d7da
|
Update python api.
|
3 years ago |
KP
|
e0642ffc77
|
Update doc strings.
|
3 years ago |
huangyuxin
|
90d648a601
|
support using by __call__
|
3 years ago |
huangyuxin
|
aecb5f567c
|
Merge branch 'tmp' into 1048
|
3 years ago |
KP
|
44e9b032d5
|
Update inputs and outputs of executor.
|
3 years ago |
huangyuxin
|
3fadcde5e2
|
revise the asr infer.py
|
3 years ago |
Hui Zhang
|
4823892169
|
Merge pull request #1058 from Jackwaterveg/benchmark
[benchmark]fix the benchmark
|
3 years ago |
Junkun
|
3a14b82844
|
minor
|
3 years ago |
Junkun
|
f50a2ab4ca
|
fix bugs
|
3 years ago |
huangyuxin
|
cb383a39c3
|
fix the benchmark
|
3 years ago |
huangyuxin
|
d0bf506fee
|
fix the load checkpoint
|
3 years ago |
KP
|
1707244472
|
Update device usage.
|
3 years ago |
KP
|
000294132c
|
Rename s2t to asr.
|
3 years ago |
huangyuxin
|
43f4d47bfa
|
add the call in infer.py
|
3 years ago |
Hui Zhang
|
39228864bb
|
format code
|
3 years ago |
Hui Zhang
|
d395c2b8e3
|
jsonlines reade manifest file
|
3 years ago |
Hui Zhang
|
7554b6107a
|
using visualdl; fix read_manifest
|
3 years ago |
huangyuxin
|
cdc8520969
|
add the infer
|
3 years ago |
KP
|
c94ebdc52c
|
Add python api for executor.
|
3 years ago |
Junkun
|
d2fab3238b
|
fix bugs
|
3 years ago |
Junkun
|
cdd0845127
|
add translate function
|
3 years ago |
KP
|
e9798498d6
|
Update asr inference in paddlespeech.cli.
|
3 years ago |
huangyuxin
|
895a086fdd
|
rename the config.feat_size and the config.vocab.size to input_size and output_size
|
3 years ago |
KP
|
4d39a7746e
|
Add paddlespeech.cli.
|
3 years ago |
KP
|
98f0806353
|
Add paddlespeech.cli.
|
3 years ago |
TianYuan
|
6e3257ab8a
|
Create __init__.py
|
3 years ago |
TianYuan
|
022f1ce8e9
|
Merge pull request #1040 from yt605155624/fix_frontend
[TTS]update text frontend
|
3 years ago |
TianYuan
|
a861e56e91
|
rm space for pure Chinese
|
3 years ago |
TianYuan
|
dad1cbbcd6
|
update text frontend
|
3 years ago |
KP
|
6e1ac1cc15
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
33f0e7622c
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
2c531d78ac
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
bdb3ce23ee
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
KP
|
1189117784
|
Add paddlespeech.cls and esc50 example.
|
3 years ago |
Hui Zhang
|
2bbfdbae91
|
Merge pull request #1015 from yt605155624/fs2_conformer
[TTS]fastspeech2 conformer
|
3 years ago |
TianYuan
|
b0a1d8ab60
|
fix base
|
3 years ago |
TianYuan
|
469329221b
|
refactor encoder, rm old code
|
3 years ago |
Hui Zhang
|
fe83adfbcb
|
nproc to ngpu
|
3 years ago |
Hui Zhang
|
789471bfca
|
test wav for u2
|
3 years ago |
TianYuan
|
bc0dd51149
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into HEAD
|
3 years ago |
Jackwaterveg
|
09931d2ccc
|
Merge pull request #1019 from zh794390558/feat
[bugfix] Kaldi Feature using dither in train
|
3 years ago |
huangyuxin
|
8aebfeac81
|
fix the prc-commit
|
3 years ago |
Hui Zhang
|
56480e1033
|
fix format
|
3 years ago |
Hui Zhang
|
7ec0ed4aaf
|
kaldi feat dither when train
|
3 years ago |
Hui Zhang
|
2ba3f00bbd
|
Merge branch 'develop' into datapipe
|
3 years ago |
Hui Zhang
|
b944418d6f
|
new format data support ds2/st
|
3 years ago |
Hui Zhang
|
0defc658e1
|
update aishell/librispeech transformer result; wenetspeech pretrain conformer result
|
3 years ago |
Hui Zhang
|
d2a05df02e
|
Merge pull request #1014 from Jackwaterveg/auto_log
[asr]hidden the auto_log
|
3 years ago |
huangyuxin
|
fb6974f950
|
update the auto_log
|
3 years ago |
TianYuan
|
4370c5cfa6
|
Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into fs2_conformer
|
3 years ago |
Hui Zhang
|
638b96bf07
|
check if cmvn_file in config for u2
|
3 years ago |
Hui Zhang
|
c354e9154b
|
Merge pull request #1003 from yt605155624/fs2_ge2e
[TTS]add fastspeech2 voice cloning in aishell3
|
3 years ago |
TianYuan
|
133ee7db0b
|
rename num_speakers
|
3 years ago |
TianYuan
|
3d5e078c91
|
add conformer
|
3 years ago |
TianYuan
|
a97c7b5206
|
rename spembs
|
3 years ago |
huangyuxin
|
f646d4c3a1
|
renew the setup.py for paddlespeech feat and ctcdecoders
|
3 years ago |
huangyuxin
|
ca06b91fc4
|
renew the setup.py for paddlespeech feat and ctcdecoders
|
3 years ago |
Hui Zhang
|
3bd87bc379
|
add wenet lincense
|
3 years ago |
TianYuan
|
8d025451de
|
add fastspeech2 voice cloning in aishell3
|
3 years ago |
TianYuan
|
c5c9f19091
|
rename to gen_gta_mel.py, remove stats compute when gen fintune data
|
3 years ago |
TianYuan
|
a6ac497f8e
|
add multi-band melgan finetune scripts
|
3 years ago |
Hui Zhang
|
fe29f74a1c
|
Merge pull request #992 from yt605155624/fix_docs
[TTS] add tts tutorial
|
3 years ago |
TianYuan
|
30d09b411d
|
fix style_syn, replace DeepSpeech with PaddleSpeech in readme
|
3 years ago |
TianYuan
|
0bc9450c51
|
Merge branch 'develop' of github.com:PaddlePaddle/DeepSpeech into fix_docs
|
3 years ago |
Hui Zhang
|
f9b66d0d97
|
Remove useless folders (#990)
|
3 years ago |
Hui Zhang
|
2d76638d62
|
more speech domain
|
3 years ago |
TianYuan
|
0fcc5005a2
|
add tts tutorial
|
3 years ago |
Hui Zhang
|
1ae1ead80f
|
more install scripts
|
3 years ago |
Hui Zhang
|
51a6845564
|
Merge pull request #985 from Jackwaterveg/benchmark
revise the benchmark
|
3 years ago |
huangyuxin
|
843ea1c12e
|
revise the benchmark
|
3 years ago |
Hui Zhang
|
080b0431f4
|
format code
|
3 years ago |
Junkun
|
7c8843448c
|
add word reward into beam search.
|
3 years ago |
Hui Zhang
|
9a71c091c5
|
remove debug info and format code
|
3 years ago |
Hui Zhang
|
8b0e344c69
|
fix logfbank using PCM16
|
3 years ago |
Hui Zhang
|
7ceef6c3f5
|
format code
|
3 years ago |
Hui Zhang
|
f9221b4b74
|
fix ctc align
|
3 years ago |
Hui Zhang
|
fb853167d3
|
format code
|
3 years ago |
Hui Zhang
|
18d9abc7a0
|
add sox speed pertrub
|
3 years ago |
Hui Zhang
|
56d06f2aaf
|
Merge pull request #968 from yt605155624/merge_paddlespeech
[TTS] change nprocs to ngpu
|
3 years ago |
Hui Zhang
|
000fac53fe
|
Merge pull request #966 from Jackwaterveg/dev
change the lm dataset dir, add the 'LM_BIN_DIR' in s2 path.sh
|
3 years ago |
Hui Zhang
|
6a7e0265cd
|
add josn global cmvn
|
3 years ago |
Hui Zhang
|
9cdd2643b1
|
fix bug for batch dataloader using
|
3 years ago |
Hui Zhang
|
69bccb4f02
|
fix ctc align
|
3 years ago |
TianYuan
|
bacdf5756b
|
Merge remote-tracking branch 'origin/develop' into merge_paddlespeech
|
3 years ago |
Hui Zhang
|
69055698a2
|
transformer using batch data loader
|
3 years ago |
TianYuan
|
35c37ace17
|
change nprocs to ngpu, add aishell3/voc1
|
3 years ago |
huangyuxin
|
d647cde870
|
change the lm dataset dir
|
3 years ago |
TianYuan
|
6655728b08
|
add reference
|
3 years ago |
Hui Zhang
|
38cf56295a
|
fix reference format
|
3 years ago |
Hui Zhang
|
c463a00f81
|
add reference code license
|
3 years ago |
Hui Zhang
|
2a66c2c13b
|
format code
|
3 years ago |
Hui Zhang
|
e2bcaee4f1
|
merge deepspeech, parakeet and text_processing into paddlespeech
|
3 years ago |