colordifference_stdc: use float as in SSE #58

Artoria2e5 · 2021-07-14T09:06:39Z

The SSE implementation of the function uses single-precision float, whereas this one goes for.... double all over the place.

Extremely unscientific comparisons on godbolt (https://gcc.godbolt.org/z/oa3hP5ffs) shows that both Clang and GCC do much better generating x64 code when float is used. Other SIMD systems should act similarly, but I can't remember the target names. (For more human-like code in clang, try -Ofast. I could add an attribute or some assumes there, but ehhhh... sounds unnecessary.)

PS: It might be a good idea to review other internal uses of double too. The two cases left appear to be generally sums and other statistics, which I guess is better with double, and gamma, which has an external double API.

The SSE implementation of the function uses single-precision float, whereas this one goes for.... double all over the place. Extremely unscientific comparisons on godbolt (https://gcc.godbolt.org/z/oa3hP5ffs) shows that both Clang and GCC do much better generating code when float is used.

kornelski · 2021-07-14T10:27:38Z

Thank you

kornelski merged commit c2ce900 into ImageOptim:master Jul 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

colordifference_stdc: use float as in SSE #58

colordifference_stdc: use float as in SSE #58

Uh oh!

Artoria2e5 commented Jul 14, 2021 •

edited

Loading

kornelski commented Jul 14, 2021

Labels

2 participants

colordifference_stdc: use float as in SSE #58

colordifference_stdc: use float as in SSE #58

Uh oh!

Conversation

Artoria2e5 commented Jul 14, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

kornelski commented Jul 14, 2021

Labels

2 participants

Artoria2e5 commented Jul 14, 2021 •

edited

Loading