What's new in 1.1.0 (??)

These are the changes in pandas 1.1.0. See :ref:`release` for a full changelog including other versions of pandas.

Enhancements

Nonmonotonic PeriodIndex Partial String Slicing

:class:`PeriodIndex` now supports partial string slicing for non-monotonic indexes, mirroring :class:`DatetimeIndex` behavior (:issue:`31096`)

For example:

.. ipython:: python

   dti = pd.date_range("2014-01-01", periods=30, freq="30D")
   pi = dti.to_period("D")
   ser_monotonic = pd.Series(np.arange(30), index=pi)
   shuffler = list(range(0, 30, 2)) + list(range(1, 31, 2))
   ser = ser_monotonic[shuffler]
   ser

.. ipython:: python

   ser["2014"]
   ser.loc["May 2015"]

Fold argument support in Timestamp constructor

:class:`Timestamp:` now supports the keyword-only fold argument according to PEP 495 similar to parent datetime.datetime class. It supports both accepting fold as an initialization argument and inferring fold from other constructor arguments (:issue:`25057`, :issue:`31338`). Support is limited to dateutil timezones as pytz doesn't support fold.

For example:

.. ipython:: python

    ts = pd.Timestamp("2019-10-27 01:30:00+00:00")
    ts.fold

.. ipython:: python

    ts = pd.Timestamp(year=2019, month=10, day=27, hour=1, minute=30,
                      tz="dateutil/Europe/London", fold=1)
    ts

For more on working with fold, see :ref:`Fold subsection <timeseries.fold>` in the user guide.

Other enhancements

:class:`Styler` may now render CSS more efficiently where multiple cells have the same styling (:issue:`30876`)
:meth:`Styler.highlight_null` now accepts subset argument (:issue:`31345`)
When writing directly to a sqlite connection :func:`to_sql` now supports the multi method (:issue:`29921`)
OptionError is now exposed in pandas.errors (:issue:`27553`)

Other API changes

:meth:`Series.describe` will now show distribution percentiles for datetime dtypes, statistics first and last will now be min and max to match with numeric dtypes in :meth:`DataFrame.describe` (:issue:`30164`)
Added :meth:`DataFrame.value_counts` (:issue:`5377`)
:meth:`Groupby.groups` now returns an abbreviated representation when called on large dataframes (:issue:`1135`)
loc lookups with an object-dtype :class:`Index` and an integer key will now raise KeyError instead of TypeError when key is missing (:issue:`31905`)

Backwards incompatible API changes

:meth:`DataFrame.swaplevels` now raises a TypeError if the axis is not a :class:`MultiIndex`. Previously a AttributeError was raised (:issue:`31126`)
:meth:`DataFrameGroupby.mean` and :meth:`SeriesGroupby.mean` (and similarly for :meth:`~DataFrameGroupby.median`, :meth:`~DataFrameGroupby.std`` and :meth:`~DataFrameGroupby.var``) now raise a TypeError if a not-accepted keyword argument is passed into it. Previously a UnsupportedFunctionCall was raised (AssertionError if min_count passed into :meth:`~DataFrameGroupby.median``) (:issue:`31485`)
:meth:`DataFrame.at` and :meth:`Series.at` will raise a TypeError instead of a ValueError if an incompatible key is passed, and KeyError if a missing key is passed, matching the behavior of .loc[] (:issue:`31722`)

Failed Label-Based Lookups Always Raise KeyError

Label lookups series[key], series.loc[key] and frame.loc[key] used to raises either KeyError or TypeError depending on the type of key and type of :class:`Index`. These now consistently raise KeyError (:issue:`31867`)

.. ipython:: python

    ser1 = pd.Series(range(3), index=[0, 1, 2])
    ser2 = pd.Series(range(3), index=pd.date_range("2020-02-01", periods=3))

Previous behavior:

In [3]: ser1[1.5]
...
TypeError: cannot do label indexing on Int64Index with these indexers [1.5] of type float

In [4] ser1["foo"]
...
KeyError: 'foo'

In [5]: ser1.loc[1.5]
...
TypeError: cannot do label indexing on Int64Index with these indexers [1.5] of type float

In [6]: ser1.loc["foo"]
...
KeyError: 'foo'

In [7]: ser2.loc[1]
...
TypeError: cannot do label indexing on DatetimeIndex with these indexers [1] of type int

In [8]: ser2.loc[pd.Timestamp(0)]
...
KeyError: Timestamp('1970-01-01 00:00:00')

New behavior:

In [3]: ser1[1.5]
...
KeyError: 1.5

In [4] ser1["foo"]
...
KeyError: 'foo'

In [5]: ser1.loc[1.5]
...
KeyError: 1.5

In [6]: ser1.loc["foo"]
...
KeyError: 'foo'

In [7]: ser2.loc[1]
...
KeyError: 1

In [8]: ser2.loc[pd.Timestamp(0)]
...
KeyError: Timestamp('1970-01-01 00:00:00')

Deprecations

Lookups on a :class:`Series` with a single-item list containing a slice (e.g. ser[[slice(0, 4)]]) are deprecated, will raise in a future version. Either convert the list to tuple, or pass the slice directly instead (:issue:`31333`)
:meth:`DataFrame.mean` and :meth:`DataFrame.median` with numeric_only=None will include datetime64 and datetime64tz columns in a future version (:issue:`29941`)

Performance improvements

Performance improvement in :class:`Timedelta` constructor (:issue:`30543`)
Performance improvement in :class:`Timestamp` constructor (:issue:`30543`)

Bug fixes

Categorical

Bug where :func:`merge` was unable to join on non-unique categorical indices (:issue:`28189`)
Bug when passing categorical data to :class:`Index` constructor along with dtype=object incorrectly returning a :class:`CategoricalIndex` instead of object-dtype :class:`Index` (:issue:`32167`)

Datetimelike

Bug in :class:`Timestamp` where constructing :class:`Timestamp` from ambiguous epoch time and calling constructor again changed :meth:`Timestamp.value` property (:issue:`24329`)
:meth:`DatetimeArray.searchsorted`, :meth:`TimedeltaArray.searchsorted`, :meth:`PeriodArray.searchsorted` not recognizing non-pandas scalars and incorrectly raising ValueError instead of TypeError (:issue:`30950`)
Bug in :class:`Timestamp` where constructing :class:`Timestamp` with dateutil timezone less than 128 nanoseconds before daylight saving time switch from winter to summer would result in nonexistent time (:issue:`31043`)
Bug in :meth:`Period.to_timestamp`, :meth:`Period.start_time` with microsecond frequency returning a timestamp one nanosecond earlier than the correct time (:issue:`31475`)
:class:`Timestamp` raising confusing error message when year, month or day is missing (:issue:`31200`)

Timedelta

Bug in constructing a :class:`Timedelta` with a high precision integer that would round the :class:`Timedelta` components (:issue:`31354`)

Timezones

Numeric

Bug in :meth:`DataFrame.floordiv` with axis=0 not treating division-by-zero like :meth:`Series.floordiv` (:issue:`31271`)

Conversion

Bug in :class:`Series` construction from NumPy array with big-endian datetime64 dtype (:issue:`29684`)

Strings

Interval

Indexing

Bug in slicing on a :class:`DatetimeIndex` with a partial-timestamp dropping high-resolution indices near the end of a year, quarter, or month (:issue:`31064`)
Bug in :meth:`PeriodIndex.get_loc` treating higher-resolution strings differently from :meth:`PeriodIndex.get_value` (:issue:`31172`)
Bug in :meth:`Series.at` and :meth:`DataFrame.at` not matching .loc behavior when looking up an integer in a :class:`Float64Index` (:issue:`31329`)
Bug in :meth:`PeriodIndex.is_monotonic` incorrectly returning True when containing leading NaT entries (:issue:`31437`)
Bug in :meth:`DatetimeIndex.get_loc` raising KeyError with converted-integer key instead of the user-passed key (:issue:`31425`)
Bug in :meth:`Series.xs` incorrectly returning Timestamp instead of datetime64 in some object-dtype cases (:issue:`31630`)
Bug in :meth:`DataFrame.iat` incorrectly returning Timestamp instead of datetime in some object-dtype cases (:issue:`32809`)
Bug in :meth:`Series.loc` and :meth:`DataFrame.loc` when indexing with an integer key on a object-dtype :class:`Index` that is not all-integers (:issue:`31905`)

Missing

MultiIndex

Bug in :meth:`Dataframe.loc` when used with a :class:`MultiIndex`. The returned values were not in the same order as the given inputs (:issue:`22797`)

.. ipython:: python

        df = pd.DataFrame(np.arange(4),
                          index=[["a", "a", "b", "b"], [1, 2, 1, 2]])
        # Rows are now ordered as the requested keys
        df.loc[(['b', 'a'], [2, 1]), :]

Bug in :meth:`MultiIndex.intersection` was not guaranteed to preserve order when sort=False. (:issue:`31325`)

.. ipython:: python

        left = pd.MultiIndex.from_arrays([["b", "a"], [2, 1]])
        right = pd.MultiIndex.from_arrays([["a", "b", "c"], [1, 2, 3]])
        # Common elements are now guaranteed to be ordered by the left side
        left.intersection(right, sort=False)

I/O

Bug in :meth:`read_json` where integer overflow was occuring when json contains big number strings. (:issue:`30320`)
read_csv will now raise a ValueError when the arguments header and prefix both are not None. (:issue:`27394`)
Bug in :meth:`DataFrame.to_json` was raising NotFoundError when path_or_buf was an S3 URI (:issue:`28375`)
Bug in :meth:`DataFrame.to_parquet` overwriting pyarrow's default for coerce_timestamps; following pyarrow's default allows writing nanosecond timestamps with version="2.0" (:issue:`31652`).
Bug in :class:`HDFStore` that caused it to set to int64 the dtype of a datetime64 column when reading a DataFrame in Python 3 from fixed format written in Python 2 (:issue:`31750`)

Plotting

:func:`.plot` for line/bar now accepts color by dictonary (:issue:`8193`).
Bug in :meth:`DataFrame.boxplot` and :meth:`DataFrame.plot.boxplot` lost color attributes of medianprops, whiskerprops, capprops and medianprops (:issue:`30346`)

Groupby/resample/rolling

Bug in :meth:`GroupBy.apply` raises ValueError when the by axis is not sorted and has duplicates and the applied func does not mutate passed in objects (:issue:`30667`)
Bug in :meth:`DataFrameGroupby.transform` produces incorrect result with transformation functions (:issue:`30918`)

Reshaping

Bug effecting all numeric and boolean reduction methods not returning subclassed data type. (:issue:`25596`)
Bug in :meth:`DataFrame.pivot_table` when only MultiIndexed columns is set (:issue:`17038`)
Bug in :meth:`DataFrame.unstack` and :meth:`Series.unstack` can take tuple names in MultiIndexed data (:issue:`19966`)
Bug in :meth:`DataFrame.pivot_table` when margin is True and only column is defined (:issue:`31016`)
Fix incorrect error message in :meth:`DataFrame.pivot` when columns is set to None. (:issue:`30924`)
Bug in :func:`crosstab` when inputs are two Series and have tuple names, the output will keep dummy MultiIndex as columns. (:issue:`18321`)
:meth:`DataFrame.pivot` can now take lists for index and columns arguments (:issue:`21425`)
Bug in :func:`concat` where the resulting indices are not copied when copy=True (:issue:`29879`)

Sparse

ExtensionArray

Other

Appending a dictionary to a :class:`DataFrame` without passing ignore_index=True will raise TypeError: Can only append a dict if ignore_index=True instead of TypeError: Can only append a Series if ignore_index=True or if the Series has a name (:issue:`30871`)
Set operations on an object-dtype :class:`Index` now always return object-dtype results (:issue:`31401`)
Bug in :meth:`AbstractHolidayCalendar.holidays` when no rules were defined (:issue:`31415`)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.1.0.rst

v1.1.0.rst

What's new in 1.1.0 (??)

Enhancements

Nonmonotonic PeriodIndex Partial String Slicing

Fold argument support in Timestamp constructor

Other enhancements

Other API changes

Backwards incompatible API changes

Failed Label-Based Lookups Always Raise KeyError

Deprecations

Performance improvements

Bug fixes

Categorical

Datetimelike

Timedelta

Timezones

Numeric

Conversion

Strings

Interval

Indexing

Missing

MultiIndex

I/O

Plotting

Groupby/resample/rolling

Reshaping

Sparse

ExtensionArray

Other

Contributors

Files

v1.1.0.rst

Latest commit

History

v1.1.0.rst

File metadata and controls

What's new in 1.1.0 (??)

Enhancements

Nonmonotonic PeriodIndex Partial String Slicing

Fold argument support in Timestamp constructor

Other enhancements

Other API changes

Backwards incompatible API changes

Failed Label-Based Lookups Always Raise KeyError

Deprecations

Performance improvements

Bug fixes

Categorical

Datetimelike

Timedelta

Timezones

Numeric

Conversion

Strings

Interval

Indexing

Missing

MultiIndex

I/O

Plotting

Groupby/resample/rolling

Reshaping

Sparse

ExtensionArray

Other

Contributors