PaddleSpeech

Commit Graph

Author	SHA1	Message	Date
zxcd	538f260061	【ASR】whisper large v3 (#4101 ) * whisper large v3 * add convert.py * mv nlp tokenizer to tiktoken. * fix bug * remove convert.py * add new model file. * fix * fix version number * fix version number * fix some bug * fix bug	3 weeks ago
co63oc	c2dc4dae2d	Fix typos (#4021 ) * Fix * Fix * Fix	6 months ago
megemini	9752f0a03b	[Hackathon 7th] 修复 `asr5` 的 `test.sh` 脚本路径错误 (#3941 ) * [Fix] test parsearg and test.sh * [Update] use short path	9 months ago
Wang Xin	c11b19df90	[Hackathon 7th] updating the paths to utility scripts and modifying method parameters (#3942 ) * fix path error * Update examples/aishell/asr0/local/test.sh * remove some broken symbolic link	9 months ago
megemini	b4c2f3bae3	[Hackathon 7th] 修复 `s2t` 示例错误 (#3950 ) * [Fix] s2t * [Fix] s2t test	9 months ago
megemini	2d7cf7f0e6	[Hackathon 7th] 修复 `asr4` 的 `test_wav` 多余的 argument (#3940 ) * [Fix] test_wav parse arg * [Fix] remove line	10 months ago
张春乔	5069111e6d	[Hackathon 7th] 修复 deepspeech2online 的导出问题 (#3935 ) * Update model.py * Update model.py	10 months ago
megemini	d17361cf8c	[Fix] duplicated arg (#3934 )	10 months ago
megemini	890c87ea93	[Fix] import TimeDomainSpecAugment (#3919 )	10 months ago
megemini	5e8c727fd6	[Hackathon 7th] 修复 `tal_cs` 测试中 0D tensor to 1D (#3913 ) * [Fix] 0D tensor to 1D * [Update] feat dim	10 months ago
megemini	a397ebe207	[Fix] import print_arguments (#3918 )	10 months ago
张春乔	7dc806dc1d	run with aishell/asr3 (#3904 )	10 months ago
张春乔	419af4503c	【Hackathon 7th】Remove parser.add_argument (#3878 ) * Update test_wav.py * Update export.py * Update test_export.py * Update model.py * Update README.md * Apply suggestions from code review * Apply suggestions from code review * Update README.md * Update README.md * Update test.py * Update README.md	10 months ago
张春乔	290d161d8a	【Hackathon 7th】add implemention of strtobool (#3877 ) * add implemention of strtobool * Apply suggestions from code review * Apply suggestions from code review * Apply suggestions from code review	10 months ago
Hui Zhang	a2ae6396ef	old grad clip has 0d tensor problem, fix it (#3334 )	2 years ago
jiamingkong	8432e8626f	Final cleaning; Modified SSL/infer.py and README for wavlm inclusion in model options	2 years ago
jiamingkong	60bd7f202e	Code clean up according to comments in https://github.com/PaddlePaddle/PaddleSpeech/pull/3242	2 years ago
jiamingkong	3b6651ba7c	Adding WavLM implementation	2 years ago
TianHao Zhang	12e3e76092	[ASR] Support Hubert, fintuned on the librispeech dataset (#3088 ) * librispeech hubert, test=asr * librispeech hubert, test=asr * hubert decode * review * copyright, notes, example related * hubert cli * pre-commit format * fix conflicts * fix conflicts * doc related * doc and train config * librispeech.py * support hubert cli	2 years ago
Hui Zhang	8371d14f5d	Merge pull request #3167 from zxcd/amp [ASR] add amp for U2 conformer	2 years ago
Hui Zhang	225737d4e3	[s2t] fix cli args to config (#3194 ) * fix cli args to config * fix train cli	2 years ago
zxcd	bc365cbb52	Merge branch 'develop' into amp	2 years ago
Hui Zhang	df3be4acae	[s2t] move s2t data preprocess into paddlespeech.dataset (#3189 ) * move s2t data preprocess into paddlespeech.dataset * avg model, compute wer, format rsl into paddlespeech.dataset * fix format rsl * fix avg ckpts	2 years ago
zxcd	a1e5f27003	mv scaler.unscale_ blow grad_clip.	2 years ago
zxcd	7399d560e7	fix scaler save and load.	2 years ago
zxcd	fbd27aab41	add amp for U2 conformer.	2 years ago
zxcd	9bf5471613	optional tokenizer and fix some doc. (#3042 )	3 years ago
zxcd	4e9bca177a	[ASR] change optimizer and fix import error, test=asr (#3023 ) * mv dataio.py to s2t.io.speechbrain.dataio mv dataio.py to paddlespeech.s2t.io.speechbrain.dataio * remove transformers import. * change optimizer same with released model * add paddlenlp version in RESULT.md. * fix run.sh * fix data.sh step_num. * add adadelta optimizer config. * fix wav2vec2 test_wav.sh run error. * add tokenizer config.	3 years ago
zxcd	5186319f48	fix load model schedule error, config optional. (#3008 )	3 years ago
zxcd	004a4d6096	[ASR] rm transformers import and modify variable name consistent with infer.py, test=asr (#2929 ) * rm transformers import and modify variable name consistent with infer.py * add condition ctc_prefix_beam_search decode.	3 years ago
zxcd	047092de8e	add wav2vev2_zh aishell recipe, and speechbrain dataloader. (#2916 )	3 years ago
zxcd	4542684694	[ASR] fix Whisper cli model download path error. test=asr (#2679 ) * add all whisper model size support * add choices in parser. * fix Whisper cli model download path error. * fix resource download path. * fix code style	3 years ago
Zth9730	94a487bd81	[ASR] support wav2vec2 command line and demo (#2658 ) * wav2vec2_cli * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * Update RESULTS.md * Update RESULTS.md * Update base_commands.py * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr * wav2vec2 demo update: support different optimizer and lr_schedular, align mdoel, update input type, test=asr	3 years ago
zxcd	b1d3f59bcb	[s2t] add whisper asr large model (#2640 ) * add whisper asr large model decoding, test=asr * fix code style. * fix json code style. * remove resource and fix code style. * fix yapf * add cli and demos, fix some code style. * fix some problem by comment. * fix yapf	3 years ago
Zth9730	8d3494320d	[ASR] wav2vec2_en, test=asr (#2637 ) * wav2vec2_en, test=asr * wav2vec2_en, test=asr * wav2vec2_en, test=asr	3 years ago
Hui Zhang	2c34481ea0	[s2t] quant with wav scp (#2568 ) * add quant hint * add paddleslim * using paddleslim 2.3.4 and paddle 2.4	3 years ago
Zth9730	8d3464c050	[s2t] Update wav2vec2 license (#2600 )	3 years ago
YangZhou	bbf2401e3e	Merge pull request #2524 from zh794390558/u2 [speechx] add u2/u2pp asr inference	3 years ago
Hui Zhang	964c22c677	Merge pull request #2532 from Zth9730/wav2vec2.0 [s2t] fix wav2vec2 report loss bug	3 years ago
tianhao zhang	86f65f0b8e	fix wav2vec2 report loss bug	3 years ago
Hui Zhang	f1ca564731	Merge pull request #2518 from Zth9730/wav2vec2.0 [ASR] wav2vec2 ASR, pre-trained wav2vec2 based CTC for librispeech	3 years ago
tianhao zhang	2ae94bd277	freeze wav2vec2=True, change loss report and update README.md	3 years ago
tianhao zhang	3d994f5c23	format wav2vec2 demo	3 years ago
tianhao zhang	19180d359d	format wav2vec2 demo	3 years ago
tianhao zhang	6e429f0513	support wav2vec2ASR on librispeech	3 years ago
Hui Zhang	290c23b9d7	add u2 nnet, u2 nnet main, codelab, and can compile	3 years ago
tianhao zhang	cda440e6f0	use reverse_weight in decode.yaml	3 years ago
Hui Zhang	1f4f98b171	fix bug	3 years ago
Hui Zhang	e86337a423	fix bug	3 years ago
Hui Zhang	2a75405e9a	Merge branch 'develop' into u2pp_export	3 years ago

1 2 3

145 Commits (develop)