Faster convolve1d in numba backend #1378

ricardoV94 · 2025-04-24T17:09:56Z

Reimplementing the core logic in the numba overload of convolve/correlate gives a speedup of 6x in the benchmarked test with relatively small inputs. I guess the overloads don't optimize/propagate constant checks as well? It's a bit surprising but the results are crystal clear.

Also added a rewrite to optimize the gradient of valid convolutions wrt to the smallest inputs, in which case we don't need a full convolve. This is done at the rewrite level because static shape may not be known at the time of grad.

Finally, renamed Conv1d to Convolve1d which is more in line with the user-facing function

codecov · 2025-04-25T18:10:41Z

Codecov Report

Attention: Patch coverage is 50.53763% with 46 lines in your changes missing coverage. Please review.

Project coverage is 82.01%. Comparing base (e98cbbc) to head (f0ef8fb).
Report is 16 commits behind head on main.

Files with missing lines	Patch %	Lines
pytensor/link/numba/dispatch/signal/conv.py	26.08%	33 Missing and 1 partial ⚠️
pytensor/tensor/rewriting/conv.py	70.00%	6 Missing and 6 partials ⚠️

❌ Your patch status has failed because the patch coverage (50.53%) is below the target coverage (100.00%). You can increase the patch coverage or adjust the target coverage.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1378      +/-   ##
==========================================
- Coverage   82.07%   82.01%   -0.06%     
==========================================
  Files         206      207       +1     
  Lines       49174    49250      +76     
  Branches     8720     8734      +14     
==========================================
+ Hits        40359    40394      +35     
- Misses       6656     6692      +36     
- Partials     2159     2164       +5

Files with missing lines	Coverage Δ
pytensor/link/jax/dispatch/signal/conv.py	`100.00% <100.00%> (ø)`
pytensor/tensor/signal/conv.py	`97.05% <100.00%> (ø)`
pytensor/tensor/rewriting/conv.py	`70.00% <70.00%> (ø)`
pytensor/link/numba/dispatch/signal/conv.py	`32.00% <26.08%> (-58.91%)`	⬇️

... and 1 file with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copilot

Pull Request Overview

This PR reimplements the core logic of convolve1d in the numba backend for a 6× speedup in benchmarks with small inputs, while also optimizing the gradient computation for valid convolutions when the smaller input’s shape is known statically. In addition, the PR renames Conv1d to Convolve1d for improved consistency in function naming and updates various test and dispatch files to reflect these changes.

Renames Conv1d to Convolve1d across modules.
Adds new tests for gradient optimization and benchmarks for numba convolve1d.
Updates rewriting and dispatch code to support the new implementation.

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
tests/tensor/signal/test_conv.py	Updated to import Convolve1d and added a test for gradient rewrite optimization.
tests/link/numba/signal/test_conv.py	Adjusted tests to optionally swap inputs, and added a benchmark test.
pytensor/tensor/signal/conv.py	Renamed Conv1d to Convolve1d and updated internal variable naming for clarity.
pytensor/tensor/rewriting/conv.py	Added a rewrite rule to optimize valid convolution gradients for static shapes.
pytensor/tensor/rewriting/init.py	Imported the new conv rewriting module.
pytensor/link/numba/dispatch/signal/conv.py	Updated to register Convolve1d and implemented specialized numba functions.
pytensor/link/jax/dispatch/signal/conv.py	Updated to register Convolve1d.

jessegrabowski

lgtm, left ignorable suggestions

pytensor/link/numba/dispatch/signal/conv.py

pytensor/tensor/rewriting/conv.py

jessegrabowski · 2025-04-25T17:29:58Z

pytensor/tensor/rewriting/conv.py

+
+    if (
+        start == len_y - 1
+        # equivalent to stop = conv.shape[-1] - len_y - 1


Why not use that form then? I don't understand this comment

Because I already extracted len_x, and I can use that directly

tests/link/numba/signal/test_conv.py

pytensor/tensor/rewriting/conv.py

These show up in the gradient of Convolve1D

ricardoV94 added numba gradients graph rewriting performance convolution labels Apr 24, 2025

ricardoV94 force-pushed the faster_conv1d_numba branch 2 times, most recently from 66fa69a to 02823cc Compare April 25, 2025 17:17

ricardoV94 mentioned this pull request Apr 25, 2025

New batched_convolution can be slower for small datasets pymc-labs/pymc-marketing#1649

Open

ricardoV94 requested review from jessegrabowski and Copilot April 25, 2025 19:07

Copilot AI reviewed Apr 25, 2025

View reviewed changes

jessegrabowski approved these changes Apr 25, 2025

View reviewed changes

ricardoV94 force-pushed the faster_conv1d_numba branch from 02823cc to f1102ba Compare April 27, 2025 08:44

Rename core Conv1d to Convolve1d

10bff2e

ricardoV94 force-pushed the faster_conv1d_numba branch from f1102ba to e2c8464 Compare April 27, 2025 08:46

ricardoV94 added 2 commits April 27, 2025 10:54

Faster implementation of numba convolve1d

437b801

Rewrite sliced full convolutions as valid

f0ef8fb

These show up in the gradient of Convolve1D

ricardoV94 force-pushed the faster_conv1d_numba branch from e2c8464 to f0ef8fb Compare April 27, 2025 08:54

ricardoV94 merged commit 4378d48 into pymc-devs:main Apr 27, 2025
72 of 73 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Faster convolve1d in numba backend #1378

Faster convolve1d in numba backend #1378

Uh oh!

ricardoV94 commented Apr 24, 2025 •

edited

Loading

Uh oh!

codecov bot commented Apr 25, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

jessegrabowski left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jessegrabowski Apr 25, 2025

Uh oh!

ricardoV94 Apr 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Faster convolve1d in numba backend #1378

Faster convolve1d in numba backend #1378

Uh oh!

Conversation

ricardoV94 commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

jessegrabowski left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jessegrabowski Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Apr 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ricardoV94 commented Apr 24, 2025 •

edited

Loading

codecov bot commented Apr 25, 2025 •

edited

Loading