Faster convolve1d in numba backend #1378

ricardoV94 · 2025-04-24T17:09:56Z

Reimplementing the core logic in the numba overload of convolve/correlate gives a speedup of 6x in the benchmarked test with relatively small inputs. I guess the overloads don't optimize/propagate constant checks as well? It's a bit surprising but the results are crystal clear.

Also added a rewrite to optimize the gradient of valid convolutions wrt to the smallest inputs, in which case we don't need a full convolve. This is done at the rewrite level because static shape may not be known at the time of grad.

Finally, renamed Conv1d to Convolve1d which is more in line with the user-facing function

codecov · 2025-04-25T18:10:41Z

Codecov Report

❌ Patch coverage is 50.53763% with 46 lines in your changes missing coverage. Please review.
✅ Project coverage is 82.01%. Comparing base (e98cbbc) to head (f0ef8fb).
⚠️ Report is 187 commits behind head on main.

Files with missing lines	Patch %	Lines
pytensor/link/numba/dispatch/signal/conv.py	26.08%	33 Missing and 1 partial ⚠️
pytensor/tensor/rewriting/conv.py	70.00%	6 Missing and 6 partials ⚠️

Additional details and impacted files

@@ Coverage Diff @@ ## main #1378 +/- ## ========================================== - Coverage 82.07% 82.01% -0.06%  ========================================== Files 206 207 +1 Lines 49174 49250 +76 Branches 8720 8734 +14 ========================================== + Hits 40359 40394 +35  - Misses 6656 6692 +36  - Partials 2159 2164 +5

Files with missing lines	Coverage Δ
pytensor/link/jax/dispatch/signal/conv.py	`100.00% <100.00%> (ø)`
pytensor/tensor/signal/conv.py	`97.05% <100.00%> (ø)`
pytensor/tensor/rewriting/conv.py	`70.00% <70.00%> (ø)`
pytensor/link/numba/dispatch/signal/conv.py	`32.00% <26.08%> (-58.91%)`	⬇️

... and 1 file with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copilot

Pull Request Overview

This PR reimplements the core logic of convolve1d in the numba backend for a 6× speedup in benchmarks with small inputs, while also optimizing the gradient computation for valid convolutions when the smaller input’s shape is known statically. In addition, the PR renames Conv1d to Convolve1d for improved consistency in function naming and updates various test and dispatch files to reflect these changes.

Renames Conv1d to Convolve1d across modules.
Adds new tests for gradient optimization and benchmarks for numba convolve1d.
Updates rewriting and dispatch code to support the new implementation.

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
tests/tensor/signal/test_conv.py	Updated to import Convolve1d and added a test for gradient rewrite optimization.
tests/link/numba/signal/test_conv.py	Adjusted tests to optionally swap inputs, and added a benchmark test.
pytensor/tensor/signal/conv.py	Renamed Conv1d to Convolve1d and updated internal variable naming for clarity.
pytensor/tensor/rewriting/conv.py	Added a rewrite rule to optimize valid convolution gradients for static shapes.
pytensor/tensor/rewriting/init.py	Imported the new conv rewriting module.
pytensor/link/numba/dispatch/signal/conv.py	Updated to register Convolve1d and implemented specialized numba functions.
pytensor/link/jax/dispatch/signal/conv.py	Updated to register Convolve1d.

jessegrabowski

lgtm, left ignorable suggestions

pytensor/link/numba/dispatch/signal/conv.py

pytensor/tensor/rewriting/conv.py

jessegrabowski · 2025-04-25T17:29:58Z

pytensor/tensor/rewriting/conv.py

+
+ if (
+ start == len_y - 1
+ # equivalent to stop = conv.shape[-1] - len_y - 1


Why not use that form then? I don't understand this comment

Because I already extracted len_x, and I can use that directly

tests/link/numba/signal/test_conv.py

pytensor/tensor/rewriting/conv.py

These show up in the gradient of Convolve1D

ricardoV94 added numba gradients graph rewriting performance convolution labels Apr 24, 2025

ricardoV94 force-pushed the faster_conv1d_numba branch 2 times, most recently from 66fa69a to 02823cc Compare April 25, 2025 17:17

ricardoV94 mentioned this pull request Apr 25, 2025

New batched_convolution can be slower for small datasets pymc-labs/pymc-marketing#1649

Open

ricardoV94 requested review from Copilot and jessegrabowski April 25, 2025 19:07

Copilot AI reviewed Apr 25, 2025

View reviewed changes

jessegrabowski approved these changes Apr 25, 2025

View reviewed changes

ricardoV94 force-pushed the faster_conv1d_numba branch from 02823cc to f1102ba Compare April 27, 2025 08:44

Rename core Conv1d to Convolve1d

10bff2e

ricardoV94 force-pushed the faster_conv1d_numba branch from f1102ba to e2c8464 Compare April 27, 2025 08:46

ricardoV94 added 2 commits April 27, 2025 10:54

Faster implementation of numba convolve1d

437b801

Rewrite sliced full convolutions as valid

f0ef8fb

These show up in the gradient of Convolve1D

ricardoV94 force-pushed the faster_conv1d_numba branch from e2c8464 to f0ef8fb Compare April 27, 2025 08:54

ricardoV94 merged commit 4378d48 into pymc-devs:main Apr 27, 2025
72 of 73 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Faster convolve1d in numba backend #1378

Faster convolve1d in numba backend #1378

Uh oh!

ricardoV94 commented Apr 24, 2025 •

edited

Loading

codecov bot commented Apr 25, 2025 •

edited

Loading

Copilot AI left a comment

jessegrabowski left a comment

Uh oh!

Uh oh!

Uh oh!

jessegrabowski Apr 25, 2025

ricardoV94 Apr 27, 2025

Uh oh!

Uh oh!

Uh oh!

Labels

2 participants

Faster convolve1d in numba backend #1378

Faster convolve1d in numba backend #1378

Uh oh!

Conversation

ricardoV94 commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

codecov bot commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

jessegrabowski left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jessegrabowski Apr 25, 2025

Choose a reason for hiding this comment

ricardoV94 Apr 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Labels

2 participants

ricardoV94 commented Apr 24, 2025 •

edited

Loading

codecov bot commented Apr 25, 2025 •

edited

Loading