You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: doc/source/whatsnew/v0.17.0.txt
+80-69
Original file line number
Diff line number
Diff line change
@@ -14,6 +14,10 @@ users upgrade to this version.
14
14
Highlights include:
15
15
16
16
- Release the Global Interpreter Lock (GIL) on some cython operations, see :ref:`here <whatsnew_0170.gil>`
17
+
- The default for ``to_datetime`` will now be to ``raise`` when presented with unparseable formats,
18
+
previously this would return the original input, see :ref:`here <whatsnew_0170.api_breaking.to_datetime>`
19
+
- The default for ``dropna`` in ``HDFStore`` has changed to ``False``, to store by default all rows even
20
+
if they are all ``NaN``, see :ref:`here <whatsnew_0170.api_breaking.hdf_dropna>`
17
21
- Development installed versions of pandas will now have ``PEP440`` compliant version strings (:issue:`9518`)
18
22
19
23
Check the :ref:`API Changes <whatsnew_0170.api>` and :ref:`deprecations <whatsnew_0170.deprecations>` before updating.
@@ -51,9 +55,10 @@ as well as the ``.sum()`` operation.
51
55
'data' : np.random.randn(N) })
52
56
df.groupby('key')['data'].sum()
53
57
54
-
Releasing of the GIL could benefit an application that uses threads for user interactions (e.g. ``QT``), or performaning multi-threaded computations. A nice example of a library that can handle these types of computation-in-parallel is the dask_ library.
58
+
Releasing of the GIL could benefit an application that uses threads for user interactions (e.g. QT_), or performaning multi-threaded computations. A nice example of a library that can handle these types of computation-in-parallel is the dask_ library.
55
59
56
60
.. _dask: https://dask.readthedocs.org/en/latest/
61
+
.. _QT: https://wiki.python.org/moin/PyQt
57
62
58
63
.. _whatsnew_0170.enhancements.other:
59
64
@@ -133,32 +138,35 @@ input as in previous versions. (:issue:`10636`)
Furthermore, ``pd.to_timedelta`` has gained a similar API, of ``errors='raise'|'ignore'|'coerce'``. The ``coerce`` keyword
162
170
has been deprecated in favor of ``errors='coerce'``.
163
171
164
172
.. _whatsnew_0170.api_breaking.convert_objects:
@@ -337,71 +345,37 @@ Usually you simply want to know which values are null.
337
345
None == None
338
346
np.nan == np.nan
339
347
348
+
.. _whatsnew_0170.api_breaking.hdf_dropna:
340
349
341
-
.. _whatsnew_0170.api_breaking.other:
342
-
343
-
Other API Changes
344
-
^^^^^^^^^^^^^^^^^
345
-
346
-
- Enable writing Excel files in :ref:`memory <_io.excel_writing_buffer>` using StringIO/BytesIO (:issue:`7074`)
347
-
- Enable serialization of lists and dicts to strings in ExcelWriter (:issue:`8188`)
348
-
- Allow passing `kwargs` to the interpolation methods (:issue:`10378`).
349
-
- Serialize metadata properties of subclasses of pandas objects (:issue:`10553`).
350
-
- ``Categorical.name`` was removed to make `Categorical` more ``numpy.ndarray`` like. Use ``Series(cat, name="whatever")`` instead (:issue:`10482`).
351
-
- ``Categorical.unique`` now returns new ``Categorical`` which ``categories`` and ``codes`` are unique, rather than returnning ``np.array`` (:issue:`10508`)
352
-
353
-
- unordered category: values and categories are sorted by appearance order.
354
-
- ordered category: values are sorted by appearance order, categories keeps existing order.
default behavior for HDFStore write functions with ``format='table'`` is now to keep rows that are all missing except for index. Previously, the behavior was to drop rows that were all missing save the index. The previous behavior can be replicated using the ``dropna=True`` option. (:issue:`9382`)
378
354
379
-
- default behavior for HDFStore write functions with ``format='table'`` is now to keep rows that are all missing except for index. Previously, the behavior was to drop rows that were all missing save the index. The previous behavior can be replicated using the ``dropna=True`` option. (:issue:`9382`)
df_with_missing.to_hdf('file.h5', 'df_with_missing', format = 'table', mode='w')
414
-
388
+
415
389
pd.read_hdf('file.h5', 'df_with_missing')
416
390
417
391
.. ipython:: python
418
392
:suppress:
419
393
420
394
os.remove('file.h5')
421
395
422
-
See :ref:`documentation <io.hdf5>` for more details.
396
+
See :ref:`documentation <io.hdf5>` for more details.
397
+
398
+
.. _whatsnew_0170.api_breaking.other:
399
+
400
+
Other API Changes
401
+
^^^^^^^^^^^^^^^^^
402
+
403
+
- Enable writing Excel files in :ref:`memory <_io.excel_writing_buffer>` using StringIO/BytesIO (:issue:`7074`)
404
+
- Enable serialization of lists and dicts to strings in ExcelWriter (:issue:`8188`)
405
+
- Allow passing `kwargs` to the interpolation methods (:issue:`10378`).
406
+
- Serialize metadata properties of subclasses of pandas objects (:issue:`10553`).
407
+
- ``Categorical.name`` was removed to make `Categorical` more ``numpy.ndarray`` like. Use ``Series(cat, name="whatever")`` instead (:issue:`10482`).
408
+
- ``Categorical.unique`` now returns new ``Categorical`` which ``categories`` and ``codes`` are unique, rather than returning ``np.array`` (:issue:`10508`)
409
+
410
+
- unordered category: values and categories are sorted by appearance order.
411
+
- ordered category: values are sorted by appearance order, categories keeps existing order.
- Bug that caused segfault when resampling an empty Series (:issue:`10228`)
492
503
- Bug in ``DatetimeIndex`` and ``PeriodIndex.value_counts`` resets name from its result, but retains in result's ``Index``. (:issue:`10150`)
493
-
- Bug in `pd.eval` using ``numexpr`` engine coerces 1 element numpy array to scalar (:issue:`10546`)
494
-
- Bug in `pandas.concat` with ``axis=0`` when column is of dtype ``category`` (:issue:`10177`)
504
+
- Bug in ``pd.eval`` using ``numexpr`` engine coerces 1 element numpy array to scalar (:issue:`10546`)
505
+
- Bug in ``pd.concat`` with ``axis=0`` when column is of dtype ``category`` (:issue:`10177`)
495
506
- Bug in ``read_msgpack`` where input type is not always checked (:issue:`10369`, :issue:`10630`)
496
-
- Bug in `pandas.read_csv` with kwargs ``index_col=False``, ``index_col=['a', 'b']`` or ``dtype``
507
+
- Bug in ``pd.read_csv`` with kwargs ``index_col=False``, ``index_col=['a', 'b']`` or ``dtype``
497
508
(:issue:`10413`, :issue:`10467`, :issue:`10577`)
498
-
- Bug in `Series.from_csv` with ``header`` kwarg not setting the ``Series.name`` or the ``Series.index.name`` (:issue:`10483`)
499
-
- Bug in `groupby.var` which caused variance to be inaccurate for small float values (:issue:`10448`)
509
+
- Bug in ``Series.from_csv`` with ``header`` kwarg not setting the ``Series.name`` or the ``Series.index.name`` (:issue:`10483`)
510
+
- Bug in ``groupby.var`` which caused variance to be inaccurate for small float values (:issue:`10448`)
500
511
- Bug in ``Series.plot(kind='hist')`` Y Label not informative (:issue:`10485`)
501
512
- Bug in ``read_csv`` when using a converter which generates a ``uint8`` type (:issue:`9266`)
502
513
@@ -510,7 +521,7 @@ Bug Fixes
510
521
511
522
512
523
- Reading "famafrench" data via ``DataReader`` results in HTTP 404 error because of the website url is changed (:issue:`10591`).
513
-
- Bug in `read_msgpack` where DataFrame to decode has duplicate column names (:issue:`9618`)
524
+
- Bug in ``read_msgpack`` where DataFrame to decode has duplicate column names (:issue:`9618`)
514
525
- Bug in ``io.common.get_filepath_or_buffer`` which caused reading of valid S3 files to fail if the bucket also contained keys for which the user does not have read permission (:issue:`10604`)
515
526
- Bug in vectorised setting of timestamp columns with python ``datetime.date`` and numpy ``datetime64`` (:issue:`10408`, :issue:`10412`)
0 commit comments