Use native scalar fma
instruction
#1267
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Cranelift 0.87 now supports lowering
fma
as a libcall on x86.With 0.88 enabling the native x86 instruction under the
has_fma
flag.aarch64 and s390x already support this as a native instruction, so it's nice that we emit it for those.
We can't lower the SIMD version using the
fma
instruction since the lowering can fail if the x86has_fma
flag is not enabled. Cranelift doesn't yet know how to fallback for these cases.We need to wait for the 0.87 release before merging this.