Skip to content

PERF: Performance improvement in to_csv with unused levels in multiindex #44943

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Dec 17, 2021

Conversation

phofl
Copy link
Member

@phofl phofl commented Dec 17, 2021

before 1.97 s ± 102 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)
after 39.6 ms ± 3.1 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

@phofl phofl added IO CSV read_csv, to_csv Performance Memory or execution speed performance labels Dec 17, 2021
@jreback jreback added this to the 1.4 milestone Dec 17, 2021
@jreback jreback merged commit 94c08c4 into pandas-dev:master Dec 17, 2021
@jreback
Copy link
Contributor

jreback commented Dec 17, 2021

wow!

@phofl phofl deleted the 37484 branch December 17, 2021 14:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
IO CSV read_csv, to_csv Performance Memory or execution speed performance
Projects
None yet
Development

Successfully merging this pull request may close these issues.

BUG (Performance): Performance of to_csv varies significantly depending on when/how index is set
2 participants