Skip to content

Commit 97b7e7f

Browse files
committed
Merge pull request #4770 from cpcloud/read-html-refactor
REF/BUG/ENH/API: refactor read_html to use TextParser
2 parents ac0ce3c + bea34eb commit 97b7e7f

File tree

7 files changed

+5420
-427
lines changed

7 files changed

+5420
-427
lines changed

doc/source/release.rst

+13
Original file line numberDiff line numberDiff line change
@@ -167,6 +167,8 @@ Improvements to existing features
167167
- Improve support for converting R datasets to pandas objects (more
168168
informative index for timeseries and numeric, support for factors, dist, and
169169
high-dimensional arrays).
170+
- :func:`~pandas.read_html` now supports the ``parse_dates``,
171+
``tupleize_cols`` and ``thousands`` parameters (:issue:`4770`).
170172

171173
API Changes
172174
~~~~~~~~~~~
@@ -373,6 +375,8 @@ See :ref:`Internal Refactoring<whatsnew_0130.refactoring>`
373375
``core/generic.py`` (:issue:`4435`).
374376
- Refactor cum objects to core/generic.py (:issue:`4435`), note that these have a more numpy-like
375377
function signature.
378+
- :func:`~pandas.read_html` now uses ``TextParser`` to parse HTML data from
379+
bs4/lxml (:issue:`4770`).
376380

377381
.. _release.bug_fixes-0.13.0:
378382

@@ -538,6 +542,15 @@ Bug Fixes
538542
- Make sure series-series boolean comparions are label based (:issue:`4947`)
539543
- Bug in multi-level indexing with a Timestamp partial indexer (:issue:`4294`)
540544
- Tests/fix for multi-index construction of an all-nan frame (:isue:`4078`)
545+
- Fixed a bug where :func:`~pandas.read_html` wasn't correctly inferring
546+
values of tables with commas (:issue:`5029`)
547+
- Fixed a bug where :func:`~pandas.read_html` wasn't providing a stable
548+
ordering of returned tables (:issue:`4770`, :issue:`5029`).
549+
- Fixed a bug where :func:`~pandas.read_html` was incorrectly parsing when
550+
passed ``index_col=0`` (:issue:`5066`).
551+
- Fixed a bug where :func:`~pandas.read_html` was incorrectly infering the
552+
type of headers (:issue:`5048`).
553+
541554

542555
pandas 0.12.0
543556
-------------

0 commit comments

Comments
 (0)