Commit Graph

304 Commits (1a693448faaa07aa54d1a2a1b89c7c7c4d7427a2)

Author SHA1 Message Date
Hui Zhang 53d6baff0b format
2 years ago
Hui Zhang 549d477592 fix code style
2 years ago
Hui Zhang 4d5cfd4003 export param from cnofig
2 years ago
Hui Zhang e3298c79ce Merge branch 'develop' into u2_export
2 years ago
Hui Zhang 260752aa2a using forward_attention_decoder
2 years ago
Hui Zhang 8690a00bd8 add feature pipeline layer(cmvn, fbank), but to_static and jit.layer output is not equal
2 years ago
Hui Zhang 3a8869fba4 rm to_static decarator; configure jit save for ctc_activation
2 years ago
Hui Zhang 1c9f238ba0 configurable export
2 years ago
Hui Zhang 63aeb747b0 more comment
2 years ago
Hui Zhang d638325c46 do not jit save forward; using slice for zeros([0,0,0,0]) tensor
2 years ago
tianhao zhang 663e3ab58e fix dp init
2 years ago
tianhao zhang 6745e9dd6b fix dp init
2 years ago
tianhao zhang 598eb1a5ef Merge branch 'develop' into fix_dp_init
2 years ago
tianhao zhang 9560d650db fix dp init
2 years ago
tianhao zhang 82e04d7815 fix trianer
2 years ago
Hui Zhang 2bb40c41ba
Merge pull request #2351 from Zth9730/fix_deepspeech
2 years ago
tianhao zhang ab92e2c98c fix deepspeech2 decode_wav
2 years ago
TianYuan 795eb7bd10
format paddlespeech with pre-commit (#2331)
2 years ago
tianhao zhang cdcb1a5316 s2t: fix encoder.py
2 years ago
tianhao zhang ed2819d7af fix format test=asr
2 years ago
tianhao zhang ed80b0e2c3 fix multigpu training test=asr
2 years ago
tianhao zhang 733ec7f2bc fix conformer multi-gpu training test=asr
2 years ago
Hui Zhang c1fbfe928e add test
2 years ago
Hui Zhang 05bc258833 update docstring
2 years ago
Hui Zhang 6149daa221 export ctc_activation
2 years ago
huangyuxin 060e337623 fix dataloader factory, test=asr
2 years ago
Hui Zhang 812d80ab1c Merge branch 'develop' of https://github.com/PaddlePaddle/DeepSpeech into u2_export
2 years ago
Hui Zhang e5a6c243f1 fix jit save for conformer
2 years ago
0x45f 4e7106d9e2 Support dy2st
2 years ago
Hui Zhang ef37f73a01 fix cnn cache dy2st shape
2 years ago
0x45f e21cceea51 Remove blank line
2 years ago
0x45f e6ac8881f1 Fix comments
2 years ago
0x45f ac680aa783 Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleSpeech into new_api
2 years ago
0x45f 294b7b00bd Supprot dy2st for conformer
2 years ago
huangyuxin 75997d8277 merge
2 years ago
Hui Zhang e81849277e att cache for streaming asr
2 years ago
Hui Zhang fb40602d94 refactor attention cache
2 years ago
huangyuxin 05d41523ad Merge branch 'develop' into webdataset
2 years ago
huangyuxin 92d1d08b9a fix scripts
2 years ago
TianYuan e4a8e15334
Merge pull request #2111 from yt605155624/rm_more_log
2 years ago
TianYuan 496e2dd14b fix Pillow's version
2 years ago
TianYuan bc93bffbb4 replace logger.info with logger.debug in cli, change default log level to INFO
2 years ago
huangyuxin 98cfdc4c05 fix nxpu
2 years ago
huangyuxin 7463df89c5 fix nxpu
2 years ago
huangyuxin 6ec6921255 Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
2 years ago
Jackwaterveg 6598216b2f
Merge branch 'develop' into webdataset
2 years ago
huangyuxin 9b5655f6ad fix 'print log' in cli
2 years ago
huangyuxin aa12b9ab52 replace s2t.transform with audio.transform
2 years ago
huangyuxin 0c7abc1f17 add training scripts
2 years ago
huangyuxin c7a7b113c8 support multi-gpu training with webdataset
2 years ago
KP bf056c013d Refactor paddleaudio to paddlespeech.audio
2 years ago
Hui Zhang dfdf450b22 fix #2013; and format
2 years ago
huangyuxin e48e1d5e81 fix tiny and local script, test=asr
2 years ago
huangyuxin 47dd61e5b2 refactor ds2, cli, server
2 years ago
TianYuan aa3d151d1d
Merge pull request #1994 from Jackwaterveg/develop
2 years ago
huangyuxin 10819e0fa2 not install ctc on win, test=asr
2 years ago
Hui Zhang 42fba661c9 more detail of copyright
2 years ago
Hui Zhang 3d88ac4e68
Merge pull request #1950 from Jackwaterveg/develop
2 years ago
Hui Zhang f07f57a3a8
Merge pull request #1945 from PaddlePaddle/asr_line
2 years ago
huangyuxin b23bde8ec5 tensor.shape => paddle.shape(tensor)
2 years ago
huangyuxin 4c09927f61 fix
2 years ago
huangyuxin e1888f9ae6 remove size,test=asr
2 years ago
Zhangjingyu06 acb19cf465 deepspeech2 modify for kunlun
2 years ago
Zhangjingyu06 b0eaeccd67 deepspeech2 modify for kunlun
2 years ago
Zhangjingyu06 1e91f7da35 deepspeech2 modify for kunlun
2 years ago
huangyuxin 1cdd41bd03 fix pad_sequence, test=asr
2 years ago
Hui Zhang c15278ed80 format
2 years ago
xiongxinlei b1ef434983 update the max len compute method, test=doc
2 years ago
xiongxinlei 0ea39f837b add asr time limt configuration, test=doc
2 years ago
root 9f389a7a33 support cpu, test=asr
2 years ago
root 864041085f replace dist.spawn with dist.launch, test=asr
2 years ago
Hui Zhang fc96130fdc fix speechx core dump when stop immediately after start
2 years ago
Hui Zhang ebde26030b patch func to var
2 years ago
huangyuxin 0df8d80833 remove logfbank from python_speech_features, test=asr
2 years ago
huangyuxin fcdaef6cb4 replace fbank, test=asr
2 years ago
huangyuxin 5912ba53e4 fix log_interval and lr when resume training, test=asr
2 years ago
Hui Zhang 91e24b0480 format code
2 years ago
Jackwaterveg 85b50c4700
Merge pull request #1741 from Jackwaterveg/debug
2 years ago
Hui Zhang c7d9b11529 format
2 years ago
huangyuxin 8e37a7c7f0 remove redundant log, test=doc
2 years ago
Jackwaterveg 8d1ee8262e
Merge branch 'develop' into CER
2 years ago
xiongxinlei ff4ddd229e fix the unuseful code, test=doc
2 years ago
xiongxinlei 9c03280ca6 remove debug info, test=doc
2 years ago
xiongxinlei 48fa84bee9 fix the asr online client bug, return None, test=doc
2 years ago
huangyuxin 6e80618e3d add ds2
2 years ago
Honei 9d20a10b5a
Merge branch 'develop' into server
2 years ago
Hui Zhang 0cde9f87ab
Merge pull request #1710 from Honei/deepspeech_server
2 years ago
xiongxinlei efc269b75f remove unuseful code, test=doc
2 years ago
xiongxinlei 89b102a7dd fix the ws send bug, cache buffer, text=doc
2 years ago
xiongxinlei d21ccd0287 add conformer online server, test=doc
2 years ago
buchongyu 48358055d0 修改hack 单词拼写错误
2 years ago
huangyuxin ca860e3d2f supplement note
2 years ago
Hui Zhang cb39777a60 format code
2 years ago
Hui Zhang 61941d14b0
Merge pull request #1627 from WilliamZhang06/ws-develop
2 years ago
WilliamZhang06 d847fe29cf added online asr engine , test=doc
2 years ago
Hui Zhang 943d4ac1ee
Merge pull request #1612 from Jackwaterveg/update
2 years ago
huangyuxin f47146af49 add docstring, test=asr
2 years ago
huangyuxin ed490b66cb update spectrogram, test=asr
2 years ago
Hui Zhang 84d712d493 format code, test=doc
2 years ago
huangyuxin 0ffe1f9114 replace kaidi_fbank with paddleaudio
2 years ago
huangyuxin e1b581b622 fix some bug, test=asr
2 years ago
huangyuxin 6da8465f14 add dist_sampler args, test=asr
2 years ago
huangyuxin a4f5a68074 fix some format, test=asr
2 years ago
huangyuxin d53e1163a6 update the code, test=asr
2 years ago
huangyuxin ab16d8ce3c change default initializer to kaiming_uniform, test=asr
2 years ago
Hui Zhang 75098698d8 format,test=doc
3 years ago
Hui Zhang 6b1fe70100 format code,test=doc
3 years ago
WilliamZhang06 da3ea7bb40 added engine type and asr inference , test=doc
3 years ago
huangyuxin 95d5274aef fix sortagrad, test=asr
3 years ago
huangyuxin aefe9e93a7 add tipc benchmark of conformer
3 years ago
TianYuan 7dc1f2daa3 fix some librosa bugs, test=tts
3 years ago
huangyuxin 9a55783aa0 fix resample
3 years ago
huangyuxin 2a42421a63 cli add ds2-librispeech offline, fix versionm, test=asr
3 years ago
Jackwaterveg f49cf838a8
Update u2.py (#1378)
3 years ago
Jackwaterveg d7222c0453
[ASR] Support CTC decoder online (#821)
3 years ago
Junkun 44408e5211 sync the variable name to others
3 years ago
Junkun f866059b74 config and formalize
3 years ago
Junkun 43aad7a018 beam search with optimality guarantees
3 years ago
Jackwaterveg 26524031d2
Merge pull request #1343 from Jackwaterveg/fix
3 years ago
huangyuxin 5e7e8a3e24 fix the u2 export, test=asr
3 years ago
Hui Zhang ec1c88ae1a
[s2t] remove nltk (#1332)
3 years ago
Jackwaterveg 0c4895cd0b
mv the ctcdecoders to third_part (#1313)
3 years ago
Jackwaterveg 010aa65b2b
[cli] asr - support English, decode_metod and unified config (#1297)
3 years ago
billishyahao ddf184be60 fix some typos
3 years ago
Jackwaterveg e69abc9265
Merge pull request #1273 from zh794390558/batch_sampler
3 years ago
huangyuxin 07d457859d use pre-commit, test=doc_fix
3 years ago
Hui Zhang 45832f6770 fix default dist_samlper to False
3 years ago
Hui Zhang 3a2db414e6 format code
3 years ago
Hui Zhang 6f651d762e fix batch sampler set_epoch when epcoh start
3 years ago
huangyuxin 8b63485ce3 fix some bug, test=asr
3 years ago
huangyuxin 3e2cc898cb remove default cfg and fix some bugs,test=asr
3 years ago
huangyuxin a1d8ab0f99 merge the develop
3 years ago
huangyuxin c907a8deda change all recipes
3 years ago
Hui Zhang c81a3f0f83
[s2t] DataLoader with BatchSampler or DistributeBatchSampler (#1242)
3 years ago
Junkun Chen 420709e5ce
[st] Distributed sampler and new dataloader with MIMO (#1239)
3 years ago
huangyuxin 41eeed0450 add librispeech asr1
3 years ago
huangyuxin 2c5902d7c5 rename decoding to decode
3 years ago
Hui Zhang bb2a370b23
[asr] remove useless conf of librispeech (#1227)
3 years ago
huangyuxin c40b6f4062 refactor the train and test config,test=asr
3 years ago
TianYuan 5692b0ff04
fix log for t2s (#1219)
3 years ago
KP d362d28d35 Remove logging file in cli api.
3 years ago
Hui Zhang db121226b8
clean aishell asr1 conf & compare ctc loss with torch and warpctc_pytorch (#1191)
3 years ago
Hui Zhang d852aee2ff
[asr] logfbank with dither (#1179)
3 years ago
Jackwaterveg 2bccde3def
update the version of ctcdecoders and feat,test=doc_fix (#1155)
3 years ago
Jackwaterveg 0151f2463f
fix bug of pad_sequence in u2,test=asr (#1153)
3 years ago
Jackwaterveg 68164dd39f
[asr]rename test_hub to test_wav (#1132)
3 years ago
Jackwaterveg 5b446f6321
[Config]clear the u2 decode config for asr (#1107)
3 years ago
Hui Zhang 51d7a07c6d
format and fix pre-commit (#1120)
3 years ago
Hui Zhang 764a5d4271
Merge branch 'develop' into ctc
3 years ago
Hui Zhang b1c80c45e0 remove ctc grad norm type in config
3 years ago