Commit Graph

307 Commits (188444f77841725fd720cb1115fd700bc6363615)

Author SHA1 Message Date
huangyuxin aa12b9ab52 replace s2t.transform with audio.transform
2 years ago
huangyuxin 0c7abc1f17 add training scripts
2 years ago
huangyuxin c7a7b113c8 support multi-gpu training with webdataset
2 years ago
KP bf056c013d Refactor paddleaudio to paddlespeech.audio
2 years ago
Hui Zhang dfdf450b22 fix #2013; and format
2 years ago
huangyuxin e48e1d5e81 fix tiny and local script, test=asr
3 years ago
huangyuxin 47dd61e5b2 refactor ds2, cli, server
3 years ago
TianYuan aa3d151d1d
Merge pull request #1994 from Jackwaterveg/develop
3 years ago
huangyuxin 10819e0fa2 not install ctc on win, test=asr
3 years ago
Hui Zhang 42fba661c9 more detail of copyright
3 years ago
Hui Zhang 3d88ac4e68
Merge pull request #1950 from Jackwaterveg/develop
3 years ago
Hui Zhang f07f57a3a8
Merge pull request #1945 from PaddlePaddle/asr_line
3 years ago
huangyuxin b23bde8ec5 tensor.shape => paddle.shape(tensor)
3 years ago
huangyuxin 4c09927f61 fix
3 years ago
huangyuxin e1888f9ae6 remove size,test=asr
3 years ago
Zhangjingyu06 acb19cf465 deepspeech2 modify for kunlun
3 years ago
Zhangjingyu06 b0eaeccd67 deepspeech2 modify for kunlun
3 years ago
Zhangjingyu06 1e91f7da35 deepspeech2 modify for kunlun
3 years ago
huangyuxin 1cdd41bd03 fix pad_sequence, test=asr
3 years ago
Hui Zhang c15278ed80 format
3 years ago
xiongxinlei b1ef434983 update the max len compute method, test=doc
3 years ago
xiongxinlei 0ea39f837b add asr time limt configuration, test=doc
3 years ago
root 9f389a7a33 support cpu, test=asr
3 years ago
root 864041085f replace dist.spawn with dist.launch, test=asr
3 years ago
Hui Zhang fc96130fdc fix speechx core dump when stop immediately after start
3 years ago
Hui Zhang ebde26030b patch func to var
3 years ago
huangyuxin 0df8d80833 remove logfbank from python_speech_features, test=asr
3 years ago
huangyuxin fcdaef6cb4 replace fbank, test=asr
3 years ago
huangyuxin 5912ba53e4 fix log_interval and lr when resume training, test=asr
3 years ago
Hui Zhang 91e24b0480 format code
3 years ago
Jackwaterveg 85b50c4700
Merge pull request #1741 from Jackwaterveg/debug
3 years ago
Hui Zhang c7d9b11529 format
3 years ago
huangyuxin 8e37a7c7f0 remove redundant log, test=doc
3 years ago
Jackwaterveg 8d1ee8262e
Merge branch 'develop' into CER
3 years ago
xiongxinlei ff4ddd229e fix the unuseful code, test=doc
3 years ago
xiongxinlei 9c03280ca6 remove debug info, test=doc
3 years ago
xiongxinlei 48fa84bee9 fix the asr online client bug, return None, test=doc
3 years ago
huangyuxin 6e80618e3d add ds2
3 years ago
Honei 9d20a10b5a
Merge branch 'develop' into server
3 years ago
Hui Zhang 0cde9f87ab
Merge pull request #1710 from Honei/deepspeech_server
3 years ago
xiongxinlei efc269b75f remove unuseful code, test=doc
3 years ago
xiongxinlei 89b102a7dd fix the ws send bug, cache buffer, text=doc
3 years ago
xiongxinlei d21ccd0287 add conformer online server, test=doc
3 years ago
buchongyu 48358055d0 修改hack 单词拼写错误
3 years ago
huangyuxin ca860e3d2f supplement note
3 years ago
Hui Zhang cb39777a60 format code
3 years ago
Hui Zhang 61941d14b0
Merge pull request #1627 from WilliamZhang06/ws-develop
3 years ago
WilliamZhang06 d847fe29cf added online asr engine , test=doc
3 years ago
Hui Zhang 943d4ac1ee
Merge pull request #1612 from Jackwaterveg/update
3 years ago
huangyuxin f47146af49 add docstring, test=asr
3 years ago
huangyuxin ed490b66cb update spectrogram, test=asr
3 years ago
Hui Zhang 84d712d493 format code, test=doc
3 years ago
huangyuxin 0ffe1f9114 replace kaidi_fbank with paddleaudio
3 years ago
huangyuxin e1b581b622 fix some bug, test=asr
3 years ago
huangyuxin 6da8465f14 add dist_sampler args, test=asr
3 years ago
huangyuxin a4f5a68074 fix some format, test=asr
3 years ago
huangyuxin d53e1163a6 update the code, test=asr
3 years ago
huangyuxin ab16d8ce3c change default initializer to kaiming_uniform, test=asr
3 years ago
Hui Zhang 75098698d8 format,test=doc
3 years ago
Hui Zhang 6b1fe70100 format code,test=doc
3 years ago
WilliamZhang06 da3ea7bb40 added engine type and asr inference , test=doc
3 years ago
huangyuxin 95d5274aef fix sortagrad, test=asr
3 years ago
huangyuxin aefe9e93a7 add tipc benchmark of conformer
3 years ago
TianYuan 7dc1f2daa3 fix some librosa bugs, test=tts
3 years ago
huangyuxin 9a55783aa0 fix resample
3 years ago
huangyuxin 2a42421a63 cli add ds2-librispeech offline, fix versionm, test=asr
3 years ago
Jackwaterveg f49cf838a8
Update u2.py (#1378)
3 years ago
Jackwaterveg d7222c0453
[ASR] Support CTC decoder online (#821)
3 years ago
Junkun 44408e5211 sync the variable name to others
3 years ago
Junkun f866059b74 config and formalize
3 years ago
Junkun 43aad7a018 beam search with optimality guarantees
3 years ago
Jackwaterveg 26524031d2
Merge pull request #1343 from Jackwaterveg/fix
3 years ago
huangyuxin 5e7e8a3e24 fix the u2 export, test=asr
3 years ago
Hui Zhang ec1c88ae1a
[s2t] remove nltk (#1332)
3 years ago
Jackwaterveg 0c4895cd0b
mv the ctcdecoders to third_part (#1313)
3 years ago
Jackwaterveg 010aa65b2b
[cli] asr - support English, decode_metod and unified config (#1297)
3 years ago
billishyahao ddf184be60 fix some typos
3 years ago
Jackwaterveg e69abc9265
Merge pull request #1273 from zh794390558/batch_sampler
3 years ago
huangyuxin 07d457859d use pre-commit, test=doc_fix
3 years ago
Hui Zhang 45832f6770 fix default dist_samlper to False
3 years ago
Hui Zhang 3a2db414e6 format code
3 years ago
Hui Zhang 6f651d762e fix batch sampler set_epoch when epcoh start
3 years ago
huangyuxin 8b63485ce3 fix some bug, test=asr
3 years ago
huangyuxin 3e2cc898cb remove default cfg and fix some bugs,test=asr
3 years ago
huangyuxin a1d8ab0f99 merge the develop
3 years ago
huangyuxin c907a8deda change all recipes
3 years ago
Hui Zhang c81a3f0f83
[s2t] DataLoader with BatchSampler or DistributeBatchSampler (#1242)
3 years ago
Junkun Chen 420709e5ce
[st] Distributed sampler and new dataloader with MIMO (#1239)
3 years ago
huangyuxin 41eeed0450 add librispeech asr1
3 years ago
huangyuxin 2c5902d7c5 rename decoding to decode
3 years ago
Hui Zhang bb2a370b23
[asr] remove useless conf of librispeech (#1227)
3 years ago
huangyuxin c40b6f4062 refactor the train and test config,test=asr
3 years ago
TianYuan 5692b0ff04
fix log for t2s (#1219)
3 years ago
KP d362d28d35 Remove logging file in cli api.
3 years ago
Hui Zhang db121226b8
clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch (#1191)
3 years ago
Hui Zhang d852aee2ff
[asr] logfbank with dither (#1179)
3 years ago
Jackwaterveg 2bccde3def
update the version of ctcdecoders and feat,test=doc_fix (#1155)
3 years ago
Jackwaterveg 0151f2463f
fix bug of pad_sequence in u2,test=asr (#1153)
3 years ago
Jackwaterveg 68164dd39f
[asr]rename test_hub to test_wav (#1132)
3 years ago
Jackwaterveg 5b446f6321
[Config]clear the u2 decode config for asr (#1107)
3 years ago
Hui Zhang 51d7a07c6d
format and fix pre-commit (#1120)
3 years ago
Hui Zhang 764a5d4271
Merge branch 'develop' into ctc
3 years ago
Hui Zhang b1c80c45e0 remove ctc grad norm type in config
3 years ago
huangyuxin 1d4002409f separate the sox and soxbindings with the requirements
3 years ago
TianYuan 2189b46004 add tts cli
3 years ago
huangyuxin 9fe0beee54 fix the bug: miss import after install
3 years ago
huangyuxin cea5ffe0e4 refactor the code
3 years ago
huangyuxin ed12db61a6 Separate the ctcdecoders
3 years ago
Hui Zhang 0818c1601d add __init__.py
3 years ago
Junkun 4e31a4445d eval mode
3 years ago
Hui Zhang 4823892169
Merge pull request #1058 from Jackwaterveg/benchmark
3 years ago
Junkun 3a14b82844 minor
3 years ago
Junkun f50a2ab4ca fix bugs
3 years ago
huangyuxin cb383a39c3 fix the benchmark
3 years ago
huangyuxin d0bf506fee fix the load checkpoint
3 years ago
Hui Zhang 39228864bb format code
3 years ago
Hui Zhang d395c2b8e3 jsonlines reade manifest file
3 years ago
Hui Zhang 7554b6107a using visualdl; fix read_manifest
3 years ago
Junkun d2fab3238b fix bugs
3 years ago
Junkun cdd0845127 add translate function
3 years ago
huangyuxin 895a086fdd rename the config.feat_size and the config.vocab.size to input_size and output_size
3 years ago
Hui Zhang fe83adfbcb nproc to ngpu
3 years ago
Hui Zhang 789471bfca test wav for u2
3 years ago
Jackwaterveg 09931d2ccc
Merge pull request #1019 from zh794390558/feat
3 years ago
huangyuxin 8aebfeac81 fix the prc-commit
3 years ago
Hui Zhang 56480e1033 fix format
3 years ago
Hui Zhang 7ec0ed4aaf kaldi feat dither when train
3 years ago
Hui Zhang 2ba3f00bbd Merge branch 'develop' into datapipe
3 years ago
Hui Zhang b944418d6f new format data support ds2/st
3 years ago
Hui Zhang 0defc658e1 update aishell/librispeech transformer result; wenetspeech pretrain conformer result
3 years ago
Hui Zhang d2a05df02e
Merge pull request #1014 from Jackwaterveg/auto_log
3 years ago
huangyuxin fb6974f950 update the auto_log
3 years ago
Hui Zhang 638b96bf07 check if cmvn_file in config for u2
3 years ago
huangyuxin f646d4c3a1 renew the setup.py for paddlespeech feat and ctcdecoders
3 years ago
huangyuxin ca06b91fc4 renew the setup.py for paddlespeech feat and ctcdecoders
3 years ago
Hui Zhang 3bd87bc379 add wenet lincense
3 years ago
Hui Zhang 1ae1ead80f more install scripts
3 years ago
Hui Zhang 51a6845564
Merge pull request #985 from Jackwaterveg/benchmark
3 years ago
huangyuxin 843ea1c12e revise the benchmark
3 years ago
Hui Zhang 080b0431f4 format code
3 years ago
Junkun 7c8843448c add word reward into beam search.
3 years ago
Hui Zhang 9a71c091c5 remove debug info and format code
3 years ago
Hui Zhang 8b0e344c69 fix logfbank using PCM16
3 years ago
Hui Zhang 7ceef6c3f5 format code
3 years ago
Hui Zhang f9221b4b74 fix ctc align
3 years ago
Hui Zhang fb853167d3 format code
3 years ago
Hui Zhang 18d9abc7a0 add sox speed pertrub
3 years ago
Hui Zhang 000fac53fe
Merge pull request #966 from Jackwaterveg/dev
3 years ago
Hui Zhang 6a7e0265cd add josn global cmvn
3 years ago
Hui Zhang 9cdd2643b1 fix bug for batch dataloader using
3 years ago
Hui Zhang 69bccb4f02 fix ctc align
3 years ago
Hui Zhang 69055698a2 transformer using batch data loader
3 years ago
huangyuxin d647cde870 change the lm dataset dir
3 years ago
Hui Zhang 38cf56295a fix reference format
3 years ago
Hui Zhang c463a00f81 add reference code license
3 years ago
Hui Zhang 2a66c2c13b format code
3 years ago
Hui Zhang e2bcaee4f1 merge deepspeech, parakeet and text_processing into paddlespeech
3 years ago