Skip to content

PERF: large perf regression in DataFrame repr #36636

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
jorisvandenbossche opened this issue Sep 25, 2020 · 1 comment · Fixed by #36638
Closed

PERF: large perf regression in DataFrame repr #36636

jorisvandenbossche opened this issue Sep 25, 2020 · 1 comment · Fixed by #36638
Assignees
Labels
Output-Formatting __repr__ of pandas objects, to_string Performance Memory or execution speed performance Regression Functionality that used to work in a prior pandas version
Milestone

Comments

@jorisvandenbossche
Copy link
Member

On master:

In [1]: df = pd.DataFrame(np.random.randn(1_000_000, 10))

In [3]: %timeit repr(df) 
2.5 s ± 340 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

while on released version this takes around 20ms

cc @ivanovmg I suppose this is from #36434

@jorisvandenbossche jorisvandenbossche added Performance Memory or execution speed performance Output-Formatting __repr__ of pandas objects, to_string Regression Functionality that used to work in a prior pandas version labels Sep 25, 2020
@jorisvandenbossche jorisvandenbossche added this to the 1.2 milestone Sep 25, 2020
@ivanovmg
Copy link
Member

@jorisvandenbossche, good catch. I'm on my way to fix it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Output-Formatting __repr__ of pandas objects, to_string Performance Memory or execution speed performance Regression Functionality that used to work in a prior pandas version
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants