Verified this compiles with clang-cl + MSVC, and uses the SSE version on x86_64.
Fixes #2953 (closed).