Skip to content

Commit 58731c4

Browse files
mroeschkejorisvandenbossche
authored andcommitted
DOC: Remove outdated caveats for Anaconda and HTML parsing (#9032) (#14739)
1 parent d8e427b commit 58731c4

File tree

1 file changed

+0
-37
lines changed

1 file changed

+0
-37
lines changed

doc/source/gotchas.rst

-37
Original file line numberDiff line numberDiff line change
@@ -514,40 +514,6 @@ parse HTML tables in the top-level pandas io function ``read_html``.
514514
text from the URL over the web, i.e., IO (input-output). For very large
515515
tables, this might not be true.
516516

517-
**Issues with using** |Anaconda|_
518-
519-
* `Anaconda`_ ships with `lxml`_ version 3.2.0; the following workaround for
520-
`Anaconda`_ was successfully used to deal with the versioning issues
521-
surrounding `lxml`_ and `BeautifulSoup4`_.
522-
523-
.. note::
524-
525-
Unless you have *both*:
526-
527-
* A strong restriction on the upper bound of the runtime of some code
528-
that incorporates :func:`~pandas.io.html.read_html`
529-
* Complete knowledge that the HTML you will be parsing will be 100%
530-
valid at all times
531-
532-
then you should install `html5lib`_ and things will work swimmingly
533-
without you having to muck around with `conda`. If you want the best of
534-
both worlds then install both `html5lib`_ and `lxml`_. If you do install
535-
`lxml`_ then you need to perform the following commands to ensure that
536-
lxml will work correctly:
537-
538-
.. code-block:: sh
539-
540-
# remove the included version
541-
conda remove lxml
542-
543-
# install the latest version of lxml
544-
pip install 'git+git://github.com/lxml/lxml.git'
545-
546-
# install the latest version of beautifulsoup4
547-
pip install 'bzr+lp:beautifulsoup'
548-
549-
Note that you need `bzr <http://bazaar.canonical.com/en>`__ and `git
550-
<http://git-scm.com>`__ installed to perform the last two operations.
551517

552518
.. |svm| replace:: **strictly valid markup**
553519
.. _svm: http://validator.w3.org/docs/help.html#validation_basics
@@ -561,9 +527,6 @@ parse HTML tables in the top-level pandas io function ``read_html``.
561527
.. |lxml| replace:: **lxml**
562528
.. _lxml: http://lxml.de
563529

564-
.. |Anaconda| replace:: **Anaconda**
565-
.. _Anaconda: https://store.continuum.io/cshop/anaconda
566-
567530

568531
Byte-Ordering Issues
569532
--------------------

0 commit comments

Comments
 (0)