-
Notifications
You must be signed in to change notification settings - Fork 107
unimplemented intrinsics for matrixmultiply
#1405
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
FWIW, aarch64 also fails:
|
avx::_mm256_permute2f128_ps
and _pd
matrixmultiply
Directly compiling Aarch64:
x86_64:
|
Implemented Edit: Seems those are actually tests that use |
I wrote an entire comment about how I couldn't reproduce any crash on x86 and then I tried using the rustup version instead of the version built from this repo, which did indeed crash with this error message. I'm currently investigating what the difference between the two is that could have caused this. |
Ah, yes I'm using the rustup component, as of:
|
It seems like |
I think I know the issue. |
Just a quick update. I have |
Got matrixmultiply working correctly in the implement_xgetbv branch. You can download a precompiled version from https://github.com/rust-lang/rustc_codegen_cranelift/actions/runs/6763047493 once it is done. I will probably work on implementing the rest of the reported missing intrinsics from other issues before opening a PR. |
Should be fixed in the latest nightly. |
Confirmed, thanks! |
I have some code using
ndarray
dot products, which in turn callsmatrixmultiply::sgemm
ordgemm
, and these trap when built with cranelift. Here's a reproducer:The text was updated successfully, but these errors were encountered: