diff --git a/speechx/examples/u2pp_ol/wenetspeech/RESULTS.md b/speechx/examples/u2pp_ol/wenetspeech/RESULTS.md index 8d4c2b567..6a8e8c46d 100644 --- a/speechx/examples/u2pp_ol/wenetspeech/RESULTS.md +++ b/speechx/examples/u2pp_ol/wenetspeech/RESULTS.md @@ -1,8 +1,12 @@ # aishell test +7176 utts, duration 36108.9 sec. + ## Attention Rescore -### u2++ CER +### u2++ FP32 + +#### CER ``` Overall -> 5.75 % N=104765 C=99035 S=5587 D=143 I=294 @@ -11,11 +15,19 @@ English -> 0.00 % N=0 C=0 S=0 D=0 I=0 Other -> 100.00 % N=3 C=0 S=3 D=0 I=0 ``` -### RTF +#### RTF > RTF with feature and decoder which is more end to end. -* Intel(R) Xeon(R) Gold 6148 CPU @ 2.40GHz, not support `avx512_vnni`, FP32 model +* Intel(R) Xeon(R) Gold 6271C CPU @ 2.60GHz, support `avx512_vnni` + +``` +I1027 10:52:38.662868 51665 u2_recognizer_main.cc:122] total wav duration is: 36108.9 sec +I1027 10:52:38.662858 51665 u2_recognizer_main.cc:121] total cost:11169.1 sec +I1027 10:52:38.662876 51665 u2_recognizer_main.cc:123] RTF is: 0.309318 +``` + +* Intel(R) Xeon(R) Gold 6148 CPU @ 2.40GHz, not support `avx512_vnni` ``` I1026 16:13:26.247121 48038 u2_recognizer_main.cc:123] total wav duration is: 36108.9 sec