From 937b31a91cf5973b60e6ef0b00f7ccde4aa9fa8a Mon Sep 17 00:00:00 2001 From: Christian Legnitto Date: Wed, 20 Nov 2024 17:48:46 -0400 Subject: [PATCH] Add Zach's twitter link and thank you --- blog/2024-11-21-optimizing-matrix-mul/index.md | 9 ++++++--- 1 file changed, 6 insertions(+), 3 deletions(-) diff --git a/blog/2024-11-21-optimizing-matrix-mul/index.md b/blog/2024-11-21-optimizing-matrix-mul/index.md index 4096329..c1fb5a6 100644 --- a/blog/2024-11-21-optimizing-matrix-mul/index.md +++ b/blog/2024-11-21-optimizing-matrix-mul/index.md @@ -7,9 +7,9 @@ draft: true --- I read the excellent post [Optimizing a WebGPU Matmul Kernel for 1TFLOP+ -Performance](https://www.nuss-and-bolts.com/p/optimizing-a-webgpu-matmul-kernel) by Zach -Nussbaum and thought it might be fun to reimplement it with [Rust -GPU](https://Rust-gpu.github.io/). +Performance](https://www.nuss-and-bolts.com/p/optimizing-a-webgpu-matmul-kernel) by +[Zach Nussbaum](https://x.com/zach_nussbaum) and thought it might be fun to reimplement +it with [Rust GPU](https://Rust-gpu.github.io/). We'll follow Zach's original post closely, comparing and contrasting using Rust vs the WGSL and Typescript from his post. @@ -19,6 +19,9 @@ GPU. +A big thank you to [Zach](https://x.com/zach_nussbaum) for allowing me to reimplement +his blog post! + :::tip The complete runnable code can be [found on