-
-
Notifications
You must be signed in to change notification settings - Fork 18.4k
PERF: Melt 2x slower when future.infer_string option enabled #59657
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@maver1ck Thanks for the report! On main (and on my laptop), I see:
So for me it is actually two times faster (didn't check memory usage though) |
And testing with release pandas 2.2.2, I indeed see that it is slower with |
The same problem exists in Pandas 2.2.3. |
Pandas version checks
I have checked that this issue has not already been reported.
I have confirmed this issue exists on the latest version of pandas.
I have confirmed this issue exists on the main branch of pandas.
Reproducible Example
Installed Versions
Prior Performance
This code with
pd.options.future.infer_string = False
runs in:5.23 s ± 1.35 s per loop (mean ± std. dev. of 7 runs, 1 loop each)
Memory consumption is around 14 GB.
Enabling
pd.options.future.infer_string = True
makes it 2 times slower:10.6 s ± 40.9 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)
Also memory consumption is bigger with peak around 25GB.
The text was updated successfully, but these errors were encountered: