Commit Graph

215 Commits (c4a5ae382524cc1461f172e8659ef39b8a310081)

Author SHA1 Message Date
TianYuan e4a8e15334
Merge pull request #2111 from yt605155624/rm_more_log
2 years ago
TianYuan 496e2dd14b fix Pillow's version
2 years ago
TianYuan bc93bffbb4 replace logger.info with logger.debug in cli, change default log level to INFO
2 years ago
huangyuxin 98cfdc4c05 fix nxpu
2 years ago
huangyuxin 7463df89c5 fix nxpu
2 years ago
huangyuxin 6ec6921255 Merge branch 'webdataset' of https://github.com/Jackwaterveg/DeepSpeech into webdataset
2 years ago
Jackwaterveg 6598216b2f
Merge branch 'develop' into webdataset
2 years ago
huangyuxin 9b5655f6ad fix 'print log' in cli
2 years ago
huangyuxin aa12b9ab52 replace s2t.transform with audio.transform
2 years ago
huangyuxin 0c7abc1f17 add training scripts
2 years ago
huangyuxin c7a7b113c8 support multi-gpu training with webdataset
2 years ago
KP bf056c013d Refactor paddleaudio to paddlespeech.audio
2 years ago
Hui Zhang dfdf450b22 fix #2013; and format
2 years ago
huangyuxin e48e1d5e81 fix tiny and local script, test=asr
2 years ago
huangyuxin 47dd61e5b2 refactor ds2, cli, server
2 years ago
TianYuan aa3d151d1d
Merge pull request #1994 from Jackwaterveg/develop
2 years ago
huangyuxin 10819e0fa2 not install ctc on win, test=asr
2 years ago
Hui Zhang 42fba661c9 more detail of copyright
2 years ago
Hui Zhang 3d88ac4e68
Merge pull request #1950 from Jackwaterveg/develop
2 years ago
Hui Zhang f07f57a3a8
Merge pull request #1945 from PaddlePaddle/asr_line
2 years ago
huangyuxin b23bde8ec5 tensor.shape => paddle.shape(tensor)
2 years ago
huangyuxin 4c09927f61 fix
2 years ago
huangyuxin e1888f9ae6 remove size,test=asr
2 years ago
Zhangjingyu06 acb19cf465 deepspeech2 modify for kunlun
2 years ago
Zhangjingyu06 b0eaeccd67 deepspeech2 modify for kunlun
2 years ago
Zhangjingyu06 1e91f7da35 deepspeech2 modify for kunlun
2 years ago
huangyuxin 1cdd41bd03 fix pad_sequence, test=asr
2 years ago
Hui Zhang c15278ed80 format
2 years ago
xiongxinlei b1ef434983 update the max len compute method, test=doc
2 years ago
xiongxinlei 0ea39f837b add asr time limt configuration, test=doc
2 years ago
root 9f389a7a33 support cpu, test=asr
2 years ago
root 864041085f replace dist.spawn with dist.launch, test=asr
2 years ago
Hui Zhang fc96130fdc fix speechx core dump when stop immediately after start
2 years ago
Hui Zhang ebde26030b patch func to var
2 years ago
huangyuxin 0df8d80833 remove logfbank from python_speech_features, test=asr
2 years ago
huangyuxin fcdaef6cb4 replace fbank, test=asr
2 years ago
huangyuxin 5912ba53e4 fix log_interval and lr when resume training, test=asr
2 years ago
Hui Zhang 91e24b0480 format code
2 years ago
Jackwaterveg 85b50c4700
Merge pull request #1741 from Jackwaterveg/debug
2 years ago
Hui Zhang c7d9b11529 format
2 years ago
huangyuxin 8e37a7c7f0 remove redundant log, test=doc
2 years ago
Jackwaterveg 8d1ee8262e
Merge branch 'develop' into CER
2 years ago
xiongxinlei ff4ddd229e fix the unuseful code, test=doc
2 years ago
xiongxinlei 9c03280ca6 remove debug info, test=doc
2 years ago
xiongxinlei 48fa84bee9 fix the asr online client bug, return None, test=doc
2 years ago
huangyuxin 6e80618e3d add ds2
2 years ago
Honei 9d20a10b5a
Merge branch 'develop' into server
2 years ago
Hui Zhang 0cde9f87ab
Merge pull request #1710 from Honei/deepspeech_server
2 years ago
xiongxinlei efc269b75f remove unuseful code, test=doc
2 years ago
xiongxinlei 89b102a7dd fix the ws send bug, cache buffer, text=doc
2 years ago
xiongxinlei d21ccd0287 add conformer online server, test=doc
2 years ago
buchongyu 48358055d0 修改hack 单词拼写错误
2 years ago
huangyuxin ca860e3d2f supplement note
2 years ago
Hui Zhang cb39777a60 format code
3 years ago
Hui Zhang 61941d14b0
Merge pull request #1627 from WilliamZhang06/ws-develop
3 years ago
WilliamZhang06 d847fe29cf added online asr engine , test=doc
3 years ago
Hui Zhang 943d4ac1ee
Merge pull request #1612 from Jackwaterveg/update
3 years ago
huangyuxin f47146af49 add docstring, test=asr
3 years ago
huangyuxin ed490b66cb update spectrogram, test=asr
3 years ago
Hui Zhang 84d712d493 format code, test=doc
3 years ago
huangyuxin 0ffe1f9114 replace kaidi_fbank with paddleaudio
3 years ago
huangyuxin e1b581b622 fix some bug, test=asr
3 years ago
huangyuxin 6da8465f14 add dist_sampler args, test=asr
3 years ago
huangyuxin a4f5a68074 fix some format, test=asr
3 years ago
huangyuxin d53e1163a6 update the code, test=asr
3 years ago
huangyuxin ab16d8ce3c change default initializer to kaiming_uniform, test=asr
3 years ago
Hui Zhang 75098698d8 format,test=doc
3 years ago
Hui Zhang 6b1fe70100 format code,test=doc
3 years ago
WilliamZhang06 da3ea7bb40 added engine type and asr inference , test=doc
3 years ago
huangyuxin 95d5274aef fix sortagrad, test=asr
3 years ago
huangyuxin aefe9e93a7 add tipc benchmark of conformer
3 years ago
TianYuan 7dc1f2daa3 fix some librosa bugs, test=tts
3 years ago
huangyuxin 9a55783aa0 fix resample
3 years ago
huangyuxin 2a42421a63 cli add ds2-librispeech offline, fix versionm, test=asr
3 years ago
Jackwaterveg f49cf838a8
Update u2.py (#1378)
3 years ago
Jackwaterveg d7222c0453
[ASR] Support CTC decoder online (#821)
3 years ago
Junkun 44408e5211 sync the variable name to others
3 years ago
Junkun f866059b74 config and formalize
3 years ago
Junkun 43aad7a018 beam search with optimality guarantees
3 years ago
Jackwaterveg 26524031d2
Merge pull request #1343 from Jackwaterveg/fix
3 years ago
huangyuxin 5e7e8a3e24 fix the u2 export, test=asr
3 years ago
Hui Zhang ec1c88ae1a
[s2t] remove nltk (#1332)
3 years ago
Jackwaterveg 0c4895cd0b
mv the ctcdecoders to third_part (#1313)
3 years ago
Jackwaterveg 010aa65b2b
[cli] asr - support English, decode_metod and unified config (#1297)
3 years ago
billishyahao ddf184be60 fix some typos
3 years ago
Jackwaterveg e69abc9265
Merge pull request #1273 from zh794390558/batch_sampler
3 years ago
huangyuxin 07d457859d use pre-commit, test=doc_fix
3 years ago
Hui Zhang 45832f6770 fix default dist_samlper to False
3 years ago
Hui Zhang 3a2db414e6 format code
3 years ago
Hui Zhang 6f651d762e fix batch sampler set_epoch when epcoh start
3 years ago
huangyuxin 8b63485ce3 fix some bug, test=asr
3 years ago
huangyuxin 3e2cc898cb remove default cfg and fix some bugs,test=asr
3 years ago
huangyuxin a1d8ab0f99 merge the develop
3 years ago
huangyuxin c907a8deda change all recipes
3 years ago
Hui Zhang c81a3f0f83
[s2t] DataLoader with BatchSampler or DistributeBatchSampler (#1242)
3 years ago
Junkun Chen 420709e5ce
[st] Distributed sampler and new dataloader with MIMO (#1239)
3 years ago
huangyuxin 41eeed0450 add librispeech asr1
3 years ago
huangyuxin 2c5902d7c5 rename decoding to decode
3 years ago
Hui Zhang bb2a370b23
[asr] remove useless conf of librispeech (#1227)
3 years ago
huangyuxin c40b6f4062 refactor the train and test config,test=asr
3 years ago