Commit Graph

283 Commits (d36a4ccfc89fe1edd74f77baf355b16154247fce)

Author SHA1 Message Date
AdamBear 36c9eaa437
Cache the TextFeaturizer instance for infer speed improvement. (#1260)
3 years ago
huangyuxin 3e2cc898cb remove default cfg and fix some bugs,test=asr
3 years ago
Jerryuhoo 2dccd5315d remove useless "other" dataset
3 years ago
Jerryuhoo f191d0b022 change speaker embedding position
3 years ago
Jerryuhoo 11991b6d35 add multi-speaker support for speedyspeech
3 years ago
huangyuxin a1d8ab0f99 merge the develop
3 years ago
huangyuxin c907a8deda change all recipes
3 years ago
TianYuan b9a55262f1
Update fastspeech2.py
3 years ago
Hui Zhang c81a3f0f83
[s2t] DataLoader with BatchSampler or DistributeBatchSampler (#1242)
3 years ago
Junkun Chen 420709e5ce
[st] Distributed sampler and new dataloader with MIMO (#1239)
3 years ago
TianYuan fbe3c05137
add style_melgan and hifigan in tts cli, test=tts (#1241)
3 years ago
TianYuan a232cd8b12
Update fastspeech2.py
3 years ago
huangyuxin 41eeed0450 add librispeech asr1
3 years ago
huangyuxin fb6d1e2c11 merge the develop
3 years ago
huangyuxin 2c5902d7c5 rename decoding to decode
3 years ago
TianYuan 42c109216d
[tts]add style melgan pretraied model (#1228)
3 years ago
Hui Zhang bb2a370b23
[asr] remove useless conf of librispeech (#1227)
3 years ago
huangyuxin c40b6f4062 refactor the train and test config,test=asr
3 years ago
TianYuan 5692b0ff04
fix log for t2s (#1219)
3 years ago
TianYuan b031ee43c4
Merge pull request #1215 from yt605155624/refactor_punc
3 years ago
TianYuan e1798e1eeb update
3 years ago
KP d362d28d35 Remove logging file in cli api.
3 years ago
TianYuan 15b8904fa1 refactor punc
3 years ago
JiehangXie 927c9bbdb6 Fix a bug when sentence inputed contain English words
3 years ago
KP 1632af7706
Update examples/esc50. (#1203)
3 years ago
Jerryuhoo 3cbfd7bf35 Add speaker embedding and speaker id for style fastspeech2 inference
3 years ago
Hui Zhang db121226b8
clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch (#1191)
3 years ago
Hui Zhang d852aee2ff
[asr] logfbank with dither (#1179)
3 years ago
KP 9ec2bc8e2e Update README. test=doc_fix
3 years ago
Jackwaterveg 879857332d
[version]add paddlespeech.__version__ (#1166)
3 years ago
TianYuan 19ef7210a0
[TTS]Add hifigan (#1097)
3 years ago
TianYuan 675cff258b
[TTS]fix praatio version, test=tts (#1158)
3 years ago
Jackwaterveg e9748faa71
[Cli]optimize the cli, add --yes, and delete transformer_aishell (#1154)
3 years ago
Jackwaterveg 2bccde3def
update the version of ctcdecoders and feat,test=doc_fix (#1155)
3 years ago
Jackwaterveg 0151f2463f
fix bug of pad_sequence in u2,test=asr (#1153)
3 years ago
Jackwaterveg 68164dd39f
[asr]rename test_hub to test_wav (#1132)
3 years ago
KP 16d6ed3842 Add automatic_video_subtitiles demo.
3 years ago
KP 7394a18732 Add default arguments in cls python api.
3 years ago
TianYuan f9efbf3063
Update generate_lexicon.py
3 years ago
Jackwaterveg 5b446f6321
[Config]clear the u2 decode config for asr (#1107)
3 years ago
KP 074559fe90
[CLI][Demo][Text]Refactor punctuation_restoration. (#1013)
3 years ago
Hui Zhang 51d7a07c6d
format and fix pre-commit (#1120)
3 years ago
TianYuan 5f0f76f249
add eval() for inference model (#1114)
3 years ago
TianYuan 59e4a34071 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into tts_cli
3 years ago
TianYuan 3de4130dfc update am name
3 years ago
TianYuan 9db1710ba7
add conformer demos (#1108)
3 years ago
TianYuan 3fe75f833d
Merge pull request #1109 from yt605155624/tts_cli
3 years ago
TianYuan ca12a83d5a update voc name
3 years ago
TianYuan 965a57ef0e
Update README.md
3 years ago
Jackwaterveg 9e31a606d1
set default encoding utf8 for win (#1101)
3 years ago
Hui Zhang 764a5d4271
Merge branch 'develop' into ctc
3 years ago
Hui Zhang b1c80c45e0 remove ctc grad norm type in config
3 years ago
huangyuxin 1d4002409f separate the sox and soxbindings with the requirements
3 years ago
TianYuan df5fe035e5
Update README.md
3 years ago
TianYuan a6e0a69da8
Merge pull request #1095 from KPatr1ck/demo
3 years ago
TianYuan 963e906f56
Merge pull request #1068 from yt605155624/add_style_melgan
3 years ago
KP 1909f2f620 Add tts demo.
3 years ago
KP 3701fba0be Update download logic and fix README typos.
3 years ago
TianYuan f701882b66 update add_style_melgan
3 years ago
gongel dc60aeb8c2 format
3 years ago
gongel 31510d088c refactor: rm kaldi_io
3 years ago
TianYuan 2189b46004 add tts cli
3 years ago
KP 70a8a75476 Add st demo.
3 years ago
Hui Zhang 6dedb63e8b
Merge pull request #1087 from Jackwaterveg/setup
3 years ago
huangyuxin 9fe0beee54 fix the bug: miss import after install
3 years ago
huangyuxin cea5ffe0e4 refactor the code
3 years ago
gongel 20d88ec673 refactor: update params/input/output/namestyle
3 years ago
KP 6c1e6e7876 Update recommended model to cnn14 and argument name in __call__.
3 years ago
huangyuxin ed12db61a6 Separate the ctcdecoders
3 years ago
KP 0b7e0d1e2e Update tags of pretrained_models.
3 years ago
KP d08b824d72 Update README.
3 years ago
KP 61e39daccc Optimize model init.
3 years ago
KP 528c70e515 Remove TODO.
3 years ago
KP b072453ca8 Fix decompressing problem.
3 years ago
KP 29da318379 Add audio classification cli.
3 years ago
gongel f5c61ced28 feat: add st cli
3 years ago
Hui Zhang 0818c1601d add __init__.py
3 years ago
TianYuan 7b2ecb6eed add style_melgan, test=tts
3 years ago
Hui Zhang 03678c08c5
Merge branch 'develop' into fix_cli
3 years ago
huangyuxin 1b57d05d1b rm the os.chdir in cli asr
3 years ago
TianYuan aead853b1d
Update zh_frontend.py
3 years ago
huangyuxin 021311c76b add transformer to cli infer
3 years ago
TianYuan a070524d37 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into add_style_melgan, test=tts
3 years ago
TianYuan dd36eafe34 add style_melgan
3 years ago
KP 54cf048b2a Merge remote-tracking branch 'update_stream/develop' into cli
3 years ago
huangyuxin a258a34ec0 revise the convert pcm
3 years ago
Jackwaterveg 8ec576f477
Update infer.py
3 years ago
huangyuxin b0356ae489 revise
3 years ago
huangyuxin 957f2e3a1c revise
3 years ago
huangyuxin aee530af27 revise the sample rate
3 years ago
Junkun 4e31a4445d eval mode
3 years ago
KP a19e51d7da Update python api.
3 years ago
KP e0642ffc77 Update doc strings.
3 years ago
huangyuxin 90d648a601 support using by __call__
3 years ago
huangyuxin aecb5f567c Merge branch 'tmp' into 1048
3 years ago
KP 44e9b032d5 Update inputs and outputs of executor.
3 years ago
huangyuxin 3fadcde5e2 revise the asr infer.py
3 years ago
Hui Zhang 4823892169
Merge pull request #1058 from Jackwaterveg/benchmark
3 years ago
Junkun 3a14b82844 minor
3 years ago
Junkun f50a2ab4ca fix bugs
3 years ago
huangyuxin cb383a39c3 fix the benchmark
3 years ago
huangyuxin d0bf506fee fix the load checkpoint
3 years ago
KP 1707244472 Update device usage.
3 years ago
KP 000294132c Rename s2t to asr.
3 years ago
huangyuxin 43f4d47bfa add the call in infer.py
3 years ago
Hui Zhang 39228864bb format code
3 years ago
Hui Zhang d395c2b8e3 jsonlines reade manifest file
3 years ago
Hui Zhang 7554b6107a using visualdl; fix read_manifest
3 years ago
huangyuxin cdc8520969 add the infer
3 years ago
KP c94ebdc52c Add python api for executor.
3 years ago
Junkun d2fab3238b fix bugs
3 years ago
Junkun cdd0845127 add translate function
3 years ago
KP e9798498d6 Update asr inference in paddlespeech.cli.
3 years ago
huangyuxin 895a086fdd rename the config.feat_size and the config.vocab.size to input_size and output_size
3 years ago
KP 4d39a7746e Add paddlespeech.cli.
3 years ago
KP 98f0806353 Add paddlespeech.cli.
3 years ago
TianYuan 6e3257ab8a
Create __init__.py
3 years ago
TianYuan 022f1ce8e9
Merge pull request #1040 from yt605155624/fix_frontend
3 years ago
TianYuan a861e56e91 rm space for pure Chinese
3 years ago
TianYuan dad1cbbcd6 update text frontend
3 years ago
KP 6e1ac1cc15 Add paddlespeech.cls and esc50 example.
3 years ago
KP 33f0e7622c Add paddlespeech.cls and esc50 example.
3 years ago
KP 2c531d78ac Add paddlespeech.cls and esc50 example.
3 years ago
KP bdb3ce23ee Add paddlespeech.cls and esc50 example.
3 years ago
KP 1189117784 Add paddlespeech.cls and esc50 example.
3 years ago
Hui Zhang 2bbfdbae91
Merge pull request #1015 from yt605155624/fs2_conformer
3 years ago
TianYuan b0a1d8ab60 fix base
3 years ago
TianYuan 469329221b refactor encoder, rm old code
3 years ago
Hui Zhang fe83adfbcb nproc to ngpu
3 years ago
Hui Zhang 789471bfca test wav for u2
3 years ago
TianYuan bc0dd51149 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into HEAD
3 years ago
Jackwaterveg 09931d2ccc
Merge pull request #1019 from zh794390558/feat
3 years ago
huangyuxin 8aebfeac81 fix the prc-commit
3 years ago
Hui Zhang 56480e1033 fix format
3 years ago
Hui Zhang 7ec0ed4aaf kaldi feat dither when train
3 years ago
Hui Zhang 2ba3f00bbd Merge branch 'develop' into datapipe
3 years ago
Hui Zhang b944418d6f new format data support ds2/st
3 years ago
Hui Zhang 0defc658e1 update aishell/librispeech transformer result; wenetspeech pretrain conformer result
3 years ago
Hui Zhang d2a05df02e
Merge pull request #1014 from Jackwaterveg/auto_log
3 years ago
huangyuxin fb6974f950 update the auto_log
3 years ago
TianYuan 4370c5cfa6 Merge branch 'develop' of github.com:PaddlePaddle/PaddleSpeech into fs2_conformer
3 years ago
Hui Zhang 638b96bf07 check if cmvn_file in config for u2
3 years ago
Hui Zhang c354e9154b
Merge pull request #1003 from yt605155624/fs2_ge2e
3 years ago
TianYuan 133ee7db0b rename num_speakers
3 years ago
TianYuan 3d5e078c91 add conformer
3 years ago
TianYuan a97c7b5206 rename spembs
3 years ago
huangyuxin f646d4c3a1 renew the setup.py for paddlespeech feat and ctcdecoders
3 years ago
huangyuxin ca06b91fc4 renew the setup.py for paddlespeech feat and ctcdecoders
3 years ago
Hui Zhang 3bd87bc379 add wenet lincense
3 years ago
TianYuan 8d025451de add fastspeech2 voice cloning in aishell3
3 years ago