Prevent error from being fused with scalar in simd_op_check #8867

stevesuzuki-arm · 2025-11-17T16:27:37Z

Fix the output mismatch in fmls with float16 type where error() was optimized in a way that it is fused with scalar computation. compute_root() makes sure scalar result is computed independently.

stevesuzuki-arm · 2025-11-17T20:43:43Z

In simd_op_check_wasm, i8x16.splat generates

	v128.load8_splat	0

, which was previously

	i32.load8_u	0
	local.tee	19
	i8x16.splat

for data reuse .

zvookin · 2025-11-18T07:32:30Z

test/correctness/simd_op_check.h

        // Include a scalar version
        Halide::Func f_scalar("scalar_" + name);
        f_scalar(x, y) = e;
+        f_scalar.compute_root();


Might be worth a comment as to why this is necessary for correctness.

alexreinking · 2025-11-18T16:42:53Z

In simd_op_check_wasm, i8x16.splat generates
	v128.load8_splat	0

That's coming from these tests:

// Load vector with identical lanes generates *.splat.
check("i8x16.splat", 16 * w, in_u8(0));
check("i16x8.splat", 8 * w, in_u16(0));
check("i32x4.splat", 4 * w, in_u32(0));
check("i64x2.splat", 2 * w, in_u64(0));

I think it's actually an improvement to use v128.load8_splat in these cases and these tests can be updated (along with the comment to read _splat instead of .splat).

alexreinking

I just fixed simd_op_check_wasm myself. Hope this works!

Prevent error from being fused with scalar in simd_op_check

b8ed5c1

Fix the output mismatch in fmls with float16 type where error() was optimized in a way that it is fused with scalar computation. compute_root() makes sure scalar result is computed independently.

alexreinking requested a review from halidebuildbots November 17, 2025 19:11

zvookin reviewed Nov 18, 2025

View reviewed changes

zvookin approved these changes Nov 18, 2025

View reviewed changes

Add comments

d525073

Update vector load*_splat checks in simd_op_check_wasm.cpp

0246ae3

alexreinking approved these changes Nov 18, 2025

View reviewed changes

zvookin merged commit a1de3e3 into halide:main Nov 21, 2025
7 of 18 checks passed

stevesuzuki-arm deleted the pr-no-fusion branch December 1, 2025 20:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prevent error from being fused with scalar in simd_op_check #8867

Prevent error from being fused with scalar in simd_op_check #8867

stevesuzuki-arm commented Nov 17, 2025

Uh oh!

stevesuzuki-arm commented Nov 17, 2025

Uh oh!

zvookin Nov 18, 2025

Uh oh!

stevesuzuki-arm Nov 18, 2025

Uh oh!

alexreinking commented Nov 18, 2025 •

edited

Loading

Uh oh!

alexreinking left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Prevent error from being fused with scalar in simd_op_check #8867

Prevent error from being fused with scalar in simd_op_check #8867

Conversation

stevesuzuki-arm commented Nov 17, 2025

Uh oh!

stevesuzuki-arm commented Nov 17, 2025

Uh oh!

zvookin Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

stevesuzuki-arm Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

alexreinking commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alexreinking left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

alexreinking commented Nov 18, 2025 •

edited

Loading