What's new in 2.3.0 (Month XX, 2024)

These are the changes in pandas 2.3.0. See :ref:`release` for a full changelog including other versions of pandas.

{{ header }}

Upcoming changes in pandas 3.0

Enhancements

enhancement1

Other enhancements

The semantics for the copy keyword in __array__ methods (i.e. called when using np.array() or np.asarray() on pandas objects) has been updated to work correctly with NumPy >= 2 (:issue:`57739`)
:meth:`Series.str.decode` result now has StringDtype when future.infer_string is True (:issue:`60709`)
:meth:`~Series.to_hdf` and :meth:`~DataFrame.to_hdf` now round-trip with StringDtype (:issue:`60663`)
Improved repr of :class:`.NumpyExtensionArray` to account for NEP51 (:issue:`61085`)
Reverted the minimum version for the sqlalchemy optional dependency back to 1.4.36 (:issue:`57049`)
The :meth:`Series.str.decode` has gained the argument dtype to control the dtype of the result (:issue:`60940`)
The :meth:`~Series.cumsum`, :meth:`~Series.cummin`, and :meth:`~Series.cummax` reductions are now implemented for StringDtype columns (:issue:`60633`)
The :meth:`~Series.sum` reduction is now implemented for StringDtype columns (:issue:`59853`)

Notable bug fixes

These are bug fixes that might have notable behavior changes.

notable_bug_fix1

API changes

When enabling the future.infer_string option: Index set operations (like union or intersection) will now ignore the dtype of an empty RangeIndex or empty Index with object dtype when determining the dtype of the resulting Index (:issue:`60797`)

Deprecations

Deprecated allowing non-bool values for na in :meth:`.str.contains`, :meth:`.str.startswith`, and :meth:`.str.endswith` for dtypes that do not already disallow these (:issue:`59615`)
Deprecated the "pyarrow_numpy" storage option for :class:`StringDtype` (:issue:`60152`)

Performance improvements

Bug fixes

Categorical

Datetimelike

Timedelta

Timezones

Numeric

Enabled :class:`Series.mode` and :class:`DataFrame.mode` with dropna=False to sort the result for all dtypes in the presence of NA values; previously only certain dtypes would sort (:issue:`60702`)

Conversion

Strings

Bug in :meth:`DataFrame.sum` with axis=1, :meth:`.DataFrameGroupBy.sum` or :meth:`.SeriesGroupBy.sum` with skipna=True, and :meth:`.Resampler.sum` on :class:`StringDtype` with all NA values resulted in 0 and is now the empty string "" (:issue:`60229`)
Bug in :meth:`Series.__pos__` and :meth:`DataFrame.__pos__` did not raise for :class:`StringDtype` with storage="pyarrow" (:issue:`60710`)
Bug in :meth:`Series.rank` for :class:`StringDtype` with storage="pyarrow" incorrectly returning integer results in case of method="average" and raising an error if it would truncate results (:issue:`59768`)
Bug in :meth:`Series.replace` with :class:`StringDtype` when replacing with a non-string value was not upcasting to object dtype (:issue:`60282`)
Bug in :meth:`Series.str.replace` when n < 0 for :class:`StringDtype` with storage="pyarrow" (:issue:`59628`)
Bug in ser.str.slice with negative step with :class:`ArrowDtype` and :class:`StringDtype` with storage="pyarrow" giving incorrect results (:issue:`59710`)
Bug in the center method on :class:`Series` and :class:`Index` object str accessors with pyarrow-backed dtype not matching the python behavior in corner cases with an odd number of fill characters (:issue:`54792`)

Interval

Indexing

Fixed bug in :meth:`Index.get_indexer` round-tripping through string dtype when infer_string is enabled (:issue:`55834`)

Missing

MultiIndex

I/O

:meth:`DataFrame.to_excel` was storing decimals as strings instead of numbers (:issue:`49598`)

Period

Plotting

Groupby/resample/rolling

Reshaping

Sparse

ExtensionArray

Styler

Other

Fixed usage of inspect when the optional dependencies pyarrow or jinja2 are not installed (:issue:`60196`)

Contributors