Simd vs plain Swift (simd is slower?)

possibly related: Swift SIMD just seems to fallback to scalar operations