x64: utils: jit_io_helper: fix xf16 store from Xmm (fixes MFDNN-12635) #2509

tczeszun · 2025-01-24T18:33:11Z

This is a short fix for MFDNN-12635.

Function jit_io_helper_t<Vmm>::store_bf16 doesn't support Xmm, that's why it crashed when using bf16 with blocked format with channels blocked by 4.
I changed a way it works to use jit_io_helper_t<Vmm>::store_byte_by_byte function instead.
Though, I have other idea that maybe using store_bf16 could work if we introduced another Opmask to filter out the lower half of Xmm. It would require some changes to jit_io_helper_t class, so I prefer to not submit it now as a bugfix, but after some time (as it requires some analysis), that's why I left a TODO comment.

Also I'm adding reorder regression input to benchdnn, as it was likely not covered on CPU side.

tczeszun · 2025-01-24T18:46:29Z

make test
enable benchdnn_nightly
disable test_device_gpu

dzarukin · 2025-01-24T18:46:35Z

src/cpu/x64/utils/jit_io_helper.cpp

-        host_->vcvtneps2bf16(cvt_lower_vmm, vmm, Xbyak::VexEncoding);
+        host_->vcvtneps2bf16(cvt_lower_vmm, vmm,
+                mayiuse(avx512_core) ? Xbyak::EvexEncoding
+                                     : Xbyak::VexEncoding);


How come an instruction that supports only Evex encoding has an option for Vex encoding?

Good question, but according to xbyak it has parameter for encoding:
https://github.com/oneapi-src/oneDNN/blob/main/third_party/xbyak/xbyak_mnemonic.h#L1260
And I see it's not the only place where Vex encoding is used, for example pooling:
https://github.com/oneapi-src/oneDNN/blob/main/src/cpu/x64/jit_uni_pool_kernel.cpp#L931

I think we should limit that instruction to Evex only, regardless if xbyak provides an opportunity for encoding argument.

I'll do separate PR for this, as it occurs in many files.

dzarukin · 2025-01-24T18:49:58Z

tests/benchdnn/inputs/reorder/harness_reorder_regression

+
+# Test bf16 with aBcde4b format
+--reset
+--skip-impl=ref,simple # ! test jit version only


Suggested change

--skip-impl=ref,simple # ! test jit version only

--skip-impl=simple #skip non-jit version

tczeszun · 2025-01-24T19:46:15Z

make test
enable benchdnn_nightly
disable test_device_gpu

x64: utils: jit_io_helper: fix xf16 store from Xmm

5a6d57c

tczeszun requested review from a team as code owners January 24, 2025 18:33

github-actions bot added platform:cpu-x64 Intel64/AMD64 processors. Codeowner: @oneapi-src/onednn-cpu-x64 component:tests Codeowner: @oneapi-src/onednn-arch labels Jan 24, 2025

tczeszun added the bug A confirmed library bug label Jan 24, 2025

dzarukin approved these changes Jan 24, 2025

View reviewed changes

xuxinzen approved these changes Jan 24, 2025

View reviewed changes

tests: benchdnn: inputs: reorder: add regression test case

3d10aca

tczeszun force-pushed the tczeszun/fix_io_helper_xf16_xmm branch from c1c72ef to 3d10aca Compare January 24, 2025 19:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

x64: utils: jit_io_helper: fix xf16 store from Xmm (fixes MFDNN-12635) #2509

x64: utils: jit_io_helper: fix xf16 store from Xmm (fixes MFDNN-12635) #2509

tczeszun commented Jan 24, 2025

tczeszun commented Jan 24, 2025

dzarukin Jan 24, 2025

tczeszun Jan 24, 2025

dzarukin Jan 24, 2025

tczeszun Jan 24, 2025

dzarukin Jan 24, 2025

tczeszun Jan 24, 2025

tczeszun commented Jan 24, 2025

	--skip-impl=ref,simple # ! test jit version only
	--skip-impl=simple #skip non-jit version

x64: utils: jit_io_helper: fix xf16 store from Xmm (fixes MFDNN-12635) #2509

Are you sure you want to change the base?

x64: utils: jit_io_helper: fix xf16 store from Xmm (fixes MFDNN-12635) #2509

Conversation

tczeszun commented Jan 24, 2025

tczeszun commented Jan 24, 2025

dzarukin Jan 24, 2025

Choose a reason for hiding this comment

tczeszun Jan 24, 2025

Choose a reason for hiding this comment

dzarukin Jan 24, 2025

Choose a reason for hiding this comment

tczeszun Jan 24, 2025

Choose a reason for hiding this comment

dzarukin Jan 24, 2025

Choose a reason for hiding this comment

tczeszun Jan 24, 2025

Choose a reason for hiding this comment

tczeszun commented Jan 24, 2025