CI: Continuous benchmarking #36860

dsaxton · 2020-10-04T16:33:11Z

I think it would be helpful if pandas had a set of performance benchmarks to run automatically during every CI run (it looks like there is something for the asvs, but it seems they almost never actually run?). It would reduce some of the friction involved in manually running the asv suite and pasting results as comments, and also help prevent certain things from slipping by just not thinking to run benchmarks.

There exists the pytest plugin pytest-benchmark which seems even more lightweight than asv, and appears to be what RAPIDS uses for a benchmark tool of their own https://github.com/rapidsai/benchmark. Could it be an option to have a GitHub Action that runs pytest-benchmark on every PR (I don't know a lot about the plugin, but I am assuming it could be configured to either take a delta between master and the given branch, as well as possibly between that branch and some baseline commit on master such as a major release)? It may also be possible to cache the result from master somewhere to prevent it from being rerun every time.

What constitutes "failure" is another question, and maybe it's best to configure things to only warn on failure instead of making the whole run red (would help address flaky benchmarks as well). Also we would presumably need fine-grained control over the architecture used by GitHub to make sure it doesn't drift, and I'm not sure if that's possible.

jreback · 2020-10-04T17:10:01Z

this is nearly impossible to do because you actually want to run the entire benchmark suite and not just a few

if you could configure to run a subset then it could at least be manually triggered (arrow does this)

but -1 on rewriting benchmarks themselves

dsaxton · 2020-10-04T18:10:25Z

if you could configure to run a subset then it could at least be manually triggered (arrow does this)

Might be possible using workflow_dispatch: https://github.blog/changelog/2020-07-06-github-actions-manual-triggers-with-workflow_dispatch/

TomAugspurger · 2020-10-05T11:05:31Z

(it looks like there is something for the asvs, but it seems they almost never actually run?)

There's a machine in my basement that runs these. It crashed a while back and I haven't had time to debug it. Hopefully this week some time.

dsaxton added Performance Memory or execution speed performance CI Continuous Integration Benchmark Performance (ASV) benchmarks labels Oct 4, 2020

dsaxton closed this as completed Oct 8, 2020

lithomas1 mentioned this issue Apr 26, 2021

CI: ASV Bot #41157

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI: Continuous benchmarking #36860

CI: Continuous benchmarking #36860

dsaxton commented Oct 4, 2020 •

edited

Loading

jreback commented Oct 4, 2020

dsaxton commented Oct 4, 2020

TomAugspurger commented Oct 5, 2020

CI: Continuous benchmarking #36860

CI: Continuous benchmarking #36860

Comments

dsaxton commented Oct 4, 2020 • edited Loading

jreback commented Oct 4, 2020

dsaxton commented Oct 4, 2020

TomAugspurger commented Oct 5, 2020

dsaxton commented Oct 4, 2020 •

edited

Loading