These are the changes in pandas 2.0.3. See :ref:`release` for a full changelog including other versions of pandas.
{{ header }}
- Fixed performance regression in merging on datetime-like columns (:issue:`53231`)
- For external ExtensionArray implementations, restored the default use of
_values_for_factorize
for hashing arrays (:issue:`53475`)
- Bug in :func:`RangeIndex.union` when using
sort=True
with another :class:`RangeIndex` (:issue:`53490`) - Bug in :func:`read_csv` when defining
dtype
withbool[pyarrow]
for the"c"
and"python"
engines (:issue:`53390`) - Bug in :meth:`Series.str.split` and :meth:`Series.str.rsplit` with
expand=True
for :class:`ArrowDtype` withpyarrow.string
(:issue:`53532`) - Bug in indexing methods (e.g. :meth:`DataFrame.__getitem__`) where taking the entire :class:`DataFrame`/:class:`Series` would raise an
OverflowError
when Copy on Write was enabled and the length of the array was over the maximum size a 32-bit integer can hold (:issue:`53616`)
.. contributors:: v2.0.2..v2.0.3|HEAD