You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When concatenating columns of the same dtype, even with copy=False option, the columns are consolidated together which involves a copy and a costly vstack. The performance is actually worse for copy=False than the default copy=True which is misleading.
There are use cases where consolidated data is not required for an application, so this unneeded performance penalty is undesired.
These kinds of changes can have non-obvious behavior changes to downstream operations so we need to be careful. But given that this is with copy=False I think changing this is relatively safe.
Problem description
When concatenating columns of the same dtype, even with
copy=False
option, the columns are consolidated together which involves a copy and a costlyvstack
. The performance is actually worse forcopy=False
than the defaultcopy=True
which is misleading.There are use cases where consolidated data is not required for an application, so this unneeded performance penalty is undesired.
Sample Program
Execution Time
Execution time is in seconds.
Problem Trace
The consolidation occurs as a result of:
located near "pandas/core/reshape/concat.py:499".
The text was updated successfully, but these errors were encountered: