add a page on optimizations and profiling #45

the8472 · 2022-10-03T12:08:29Z

@jyn514 requested a guide how to benchmark std changes.

jyn514

This is fantastic, thank you so much!! I wouldn't have thought of half of these ideas :)

jyn514 · 2022-10-03T13:48:15Z

src/development/perf-benchmarking.md

+e.g. to run it under `perf stat` or cachegrind.
+
+Build and link the [stage1](https://rustc-dev-guide.rust-lang.org/building/how-to-build-and-run.html#creating-a-rustup-toolchain)
+compiler as rustup toolchain and then use that to build the standalone benchmark with a modified standard library.


Thanks for linking this! I want to reland that PR but haven't had time.

Kobzol · 2022-10-04T20:57:46Z

src/development/perf-benchmarking.md

+* [disable ASLR](https://man7.org/linux/man-pages/man8/setarch.8.html)
+* [pinning](https://man7.org/linux/man-pages/man1/taskset.1.html) the benchmark process to a specific core
+* [disable clock boosts](https://wiki.archlinux.org/title/CPU_frequency_scaling#Configuring_frequency_boosting),
+  especially on thermal-limited systems such as laptops


We can also add a link to something like https://github.com/JuliaCI/BenchmarkTools.jl/blob/master/docs/src/linuxtips.md.

Some of those things may not be relevant to std benchmarks, which are mostly are CPU- or memory-bandwidth-bound and single-threaded. They shouldn't suffer much from swap, IRQs or SMT-siblibgs if you ensured the system is mostly idle since they depend on system activity (well, depends on how many cores one has... maybe core isolation is still worth it).

Scheduling and throttling have the biggest impact in my experience. If we had a benchmark that tried to do a parallel sort on a huge dataset that would be a different story.

Adjusting the scaling governor is a good point.

- mention scaling governors - linking stage0 as rustup toolchain is now supported

jyn514 · 2023-02-18T16:21:01Z

Thank you!

generated from commit b61d0a2

jyn514 approved these changes Oct 3, 2022

View reviewed changes

Kobzol reviewed Oct 4, 2022

View reviewed changes

the8472 added 2 commits February 18, 2023 14:53

add a page on optimizations and profiling

fbb1d07

- reword vectorization section

722cb2f

- mention scaling governors - linking stage0 as rustup toolchain is now supported

the8472 force-pushed the perf-docs branch from a0adbc4 to 722cb2f Compare February 18, 2023 14:38

jyn514 merged commit b61d0a2 into rust-lang:master Feb 18, 2023

github-actions bot pushed a commit that referenced this pull request Feb 18, 2023

publish: Merge pull request #45 from the8472/perf-docs

11e5a6d

generated from commit b61d0a2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add a page on optimizations and profiling #45

add a page on optimizations and profiling #45

the8472 commented Oct 3, 2022

jyn514 left a comment

jyn514 Oct 3, 2022

Kobzol Oct 4, 2022

the8472 Oct 4, 2022 •

edited

Loading

jyn514 commented Feb 18, 2023

add a page on optimizations and profiling #45

add a page on optimizations and profiling #45

Conversation

the8472 commented Oct 3, 2022

jyn514 left a comment

Choose a reason for hiding this comment

jyn514 Oct 3, 2022

Choose a reason for hiding this comment

Kobzol Oct 4, 2022

Choose a reason for hiding this comment

the8472 Oct 4, 2022 • edited Loading

Choose a reason for hiding this comment

jyn514 commented Feb 18, 2023

the8472 Oct 4, 2022 •

edited

Loading