You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: doc/source/install.rst
+3-1
Original file line number
Diff line number
Diff line change
@@ -286,7 +286,9 @@ Optional Dependencies
286
286
`xsel <http://www.vergenet.net/~conrad/software/xsel/>`__, or
287
287
`xclip <https://github.com/astrand/xclip/>`__: necessary to use
288
288
:func:`~pandas.read_clipboard`. Most package managers on Linux distributions will have ``xclip`` and/or ``xsel`` immediately available for installation.
289
-
* `pandas-gbq <https://pandas-gbq.readthedocs.io/en/latest/install.html#dependencies>`__: for Google BigQuery I/O.
* `Backports.lzma <https://pypi.org/project/backports.lzma/>`__: Only for Python 2, for writing to and/or reading from an xz compressed DataFrame in CSV; Python 3 support is built into the standard library.
Copy file name to clipboardExpand all lines: doc/source/whatsnew/v0.24.0.rst
+31-10
Original file line number
Diff line number
Diff line change
@@ -24,7 +24,8 @@ New features
24
24
the user to override the engine's default behavior to include or omit the
25
25
dataframe's indexes from the resulting Parquet file. (:issue:`20768`)
26
26
- :meth:`DataFrame.corr` and :meth:`Series.corr` now accept a callable for generic calculation methods of correlation, e.g. histogram intersection (:issue:`22684`)
27
-
27
+
- :func:`DataFrame.to_string` now accepts ``decimal`` as an argument, allowing
28
+
the user to specify which decimal separator should be used in the output. (:issue:`23614`)
- New method :meth:`HDFStore.walk` will recursively walk the group hierarchy of an HDF5 file (:issue:`10932`)
266
270
- :func:`read_html` copies cell data across ``colspan`` and ``rowspan``, and it treats all-``th`` table rows as headers if ``header`` kwarg is not given and there is no ``thead`` (:issue:`17054`)
267
271
- :meth:`Series.nlargest`, :meth:`Series.nsmallest`, :meth:`DataFrame.nlargest`, and :meth:`DataFrame.nsmallest` now accept the value ``"all"`` for the ``keep`` argument. This keeps all ties for the nth largest/smallest value (:issue:`16818`)
@@ -287,6 +291,7 @@ Other Enhancements
287
291
- :meth:`read_excel()` now accepts ``usecols`` as a list of column names or callable (:issue:`18273`)
288
292
- :meth:`MultiIndex.to_flat_index` has been added to flatten multiple levels into a single-level :class:`Index` object.
289
293
- :meth:`DataFrame.to_stata` and :class:` pandas.io.stata.StataWriter117` can write mixed sting columns to Stata strl format (:issue:`23633`)
294
+
- :meth:`DataFrame.between_time` and :meth:`DataFrame.at_time` have gained the an ``axis`` parameter (:issue: `8839`)
290
295
291
296
.. _whatsnew_0240.api_breaking:
292
297
@@ -312,17 +317,19 @@ If installed, we now require:
312
317
+-----------------+-----------------+----------+
313
318
| bottleneck | 1.2.0 ||
314
319
+-----------------+-----------------+----------+
320
+
| fastparquet | 0.1.2 ||
321
+
+-----------------+-----------------+----------+
315
322
| matplotlib | 2.0.0 ||
316
323
+-----------------+-----------------+----------+
317
324
| numexpr | 2.6.1 ||
318
325
+-----------------+-----------------+----------+
319
-
| pytables | 3.4.2 ||
320
-
+-----------------+-----------------+----------+
321
-
| scipy | 0.18.1 ||
326
+
| pandas-gbq | 0.8.0 ||
322
327
+-----------------+-----------------+----------+
323
328
| pyarrow | 0.7.0 ||
324
329
+-----------------+-----------------+----------+
325
-
| fastparquet | 0.1.2 ||
330
+
| pytables | 3.4.2 ||
331
+
+-----------------+-----------------+----------+
332
+
| scipy | 0.18.1 ||
326
333
+-----------------+-----------------+----------+
327
334
328
335
Additionally we no longer depend on `feather-format` for feather based storage
@@ -1002,7 +1009,10 @@ Other API Changes
1002
1009
- Slicing a single row of a DataFrame with multiple ExtensionArrays of the same type now preserves the dtype, rather than coercing to object (:issue:`22784`)
1003
1010
- :class:`DateOffset` attribute `_cacheable` and method `_should_cache` have been removed (:issue:`23118`)
1004
1011
- Comparing :class:`Timedelta` to be less or greater than unknown types now raises a ``TypeError`` instead of returning ``False`` (:issue:`20829`)
1012
+
- :meth:`Categorical.searchsorted`, when supplied a scalar value to search for, now returns a scalar instead of an array (:issue:`23466`).
1013
+
- :meth:`Categorical.searchsorted` now raises a ``KeyError`` rather that a ``ValueError``, if a searched for key is not found in its categories (:issue:`23466`).
1005
1014
- :meth:`Index.hasnans` and :meth:`Series.hasnans` now always return a python boolean. Previously, a python or a numpy boolean could be returned, depending on circumstances (:issue:`23294`).
1015
+
- The order of the arguments of :func:`DataFrame.to_html` and :func:`DataFrame.to_string` is rearranged to be consistent with each other. (:issue:`23614`)
1006
1016
1007
1017
.. _whatsnew_0240.deprecations:
1008
1018
@@ -1029,6 +1039,9 @@ Deprecations
1029
1039
- :meth:`ExtensionArray._formatting_values` is deprecated. Use `ExtensionArray._formatter` instead. (:issue:`23601`)
1030
1040
- :func:`pandas.read_excel` has deprecated accepting ``usecols`` as an integer. Please pass in a list of ints from 0 to ``usecols`` inclusive instead (:issue:`23527`)
1031
1041
- Constructing a :class:`TimedeltaIndex` from data with ``datetime64``-dtyped data is deprecated, will raise ``TypeError`` in a future version (:issue:`23539`)
1042
+
- The ``keep_tz=False`` option (the default) of the ``keep_tz`` keyword of
1043
+
:meth:`DatetimeIndex.to_series` is deprecated (:issue:`17832`).
1044
+
- Timezone converting a tz-aware ``datetime.datetime`` or :class:`Timestamp` with :class:`Timestamp` and the ``tz`` argument is now deprecated. Instead, use :meth:`Timestamp.tz_convert` (:issue:`23579`)
- Improved performance of :func:`IndexEngine.get_indexer_non_unique` for sorted, non-unique indexes (:issue:`9466`)
1133
1146
- Improved performance of :func:`PeriodIndex.unique` (:issue:`23083`)
1134
1147
- Improved performance of :func:`pd.concat` for `Series` objects (:issue:`23404`)
1148
+
- Improved performance of :meth:`DatetimeIndex.normalize` and :meth:`Timestamp.normalize` for timezone naive or UTC datetimes (:issue:`23634`)
1149
+
- Improved performance of :meth:`DatetimeIndex.tz_localize` and various ``DatetimeIndex`` attributes with dateutil UTC timezone (:issue:`23772`)
1135
1150
1136
1151
1137
1152
.. _whatsnew_0240.docs:
@@ -1262,8 +1277,8 @@ Numeric
1262
1277
Strings
1263
1278
^^^^^^^
1264
1279
1265
-
-
1266
-
-
1280
+
- Bug in :meth:`Index.str.partition` was not nan-safe (:issue:`23558`).
1281
+
- Bug in :meth:`Index.str.split` was not nan-safe (:issue:`23677`).
1267
1282
-
1268
1283
1269
1284
Interval
@@ -1360,6 +1375,7 @@ Notice how we now instead output ``np.nan`` itself instead of a stringified form
1360
1375
- :func:`read_csv()` and func:`read_table()` will throw ``UnicodeError`` and not coredump on badly encoded strings (:issue:`22748`)
1361
1376
- :func:`read_csv()` will correctly parse timezone-aware datetimes (:issue:`22256`)
1362
1377
- Bug in :func:`read_csv()` in which memory management was prematurely optimized for the C engine when the data was being read in chunks (:issue:`23509`)
1378
+
- Bug in :func:`read_csv()` in unnamed columns were being improperly identified when extracting a multi-index (:issue:`23687`)
1363
1379
- :func:`read_sas()` will parse numbers in sas7bdat-files that have width less than 8 bytes correctly. (:issue:`21616`)
1364
1380
- :func:`read_sas()` will correctly parse sas7bdat files with many columns (:issue:`22628`)
1365
1381
- :func:`read_sas()` will correctly parse sas7bdat files with data page types having also bit 7 set (so page type is 128 + 256 = 384) (:issue:`16615`)
@@ -1369,8 +1385,12 @@ Notice how we now instead output ``np.nan`` itself instead of a stringified form
1369
1385
- Bug in :func:`DataFrame.to_string()` that caused representations of :class:`DataFrame` to not take up the whole window (:issue:`22984`)
1370
1386
- Bug in :func:`DataFrame.to_csv` where a single level MultiIndex incorrectly wrote a tuple. Now just the value of the index is written (:issue:`19589`).
1371
1387
- Bug in :meth:`HDFStore.append` when appending a :class:`DataFrame` with an empty string column and ``min_itemsize`` < 8 (:issue:`12242`)
1388
+
- Bug in :func:`read_csv()` in which memory leaks occurred in the C engine when parsing ``NaN`` values due to insufficient cleanup on completion or error (:issue:`21353`)
1389
+
- Bug in :func:`read_csv()` in which incorrect error messages were being raised when ``skipfooter`` was passed in along with ``nrows``, ``iterator``, or ``chunksize`` (:issue:`23711`)
1372
1390
- Bug in :meth:`read_csv()` in which :class:`MultiIndex` index names were being improperly handled in the cases when they were not provided (:issue:`23484`)
1391
+
- Bug in :meth:`read_csv()` in which unnecessary warnings were being raised when the dialect's values conflicted with the default arguments (:issue:`23761`)
1373
1392
- Bug in :meth:`read_html()` in which the error message was not displaying the valid flavors when an invalid one was provided (:issue:`23549`)
1393
+
- Bug in :meth:`read_excel()` in which extraneous header names were extracted, even though none were specified (:issue:`11733`)
1374
1394
- Bug in :meth:`read_excel()` in which ``index_col=None`` was not being respected and parsing index columns anyway (:issue:`20480`)
1375
1395
- Bug in :meth:`read_excel()` in which ``usecols`` was not being validated for proper column names when passed in as a string (:issue:`20480`)
1376
1396
@@ -1434,6 +1454,7 @@ Sparse
1434
1454
- Bug in ``DataFrame.groupby`` not including ``fill_value`` in the groups for non-NA ``fill_value`` when grouping by a sparse column (:issue:`5078`)
1435
1455
- Bug in unary inversion operator (``~``) on a ``SparseSeries`` with boolean values. The performance of this has also been improved (:issue:`22835`)
1436
1456
- Bug in :meth:`SparseArary.unique` not returning the unique values (:issue:`19595`)
1457
+
- Bug in :meth:`SparseArray.nonzero` and :meth:`SparseDataFrame.dropna` returning shifted/incorrect results (:issue:`21172`)
0 commit comments