Skip to content

Specify columns [indices?] to return raw HTML on read_html()? #11921

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
jeremyabramson opened this issue Dec 29, 2015 · 1 comment
Closed

Specify columns [indices?] to return raw HTML on read_html()? #11921

jeremyabramson opened this issue Dec 29, 2015 · 1 comment
Labels
Dtype Conversions Unexpected or buggy dtype conversions Duplicate Report Duplicate issue or pull request IO HTML read_html, to_html, Styler.apply, Styler.applymap

Comments

@jeremyabramson
Copy link

As per this: http://stackoverflow.com/questions/34236870/pandas-read-html-to-return-raw-html-contents-for-certain-rows-cells-etc?noredirect=1#comment56359521_34236870 Pandas returns the HTML stripped text of tables it scrapes. It might be nice to be able to specify columns for which you would return the raw HTML for post-processing. This would let you maintain references to images, grab links, etc.

@jreback jreback added Dtype Conversions Unexpected or buggy dtype conversions Duplicate Report Duplicate issue or pull request IO HTML read_html, to_html, Styler.apply, Styler.applymap labels Dec 29, 2015
@jreback
Copy link
Contributor

jreback commented Dec 29, 2015

this is a dupe of #10534

we need support of passing thru & using the dtype and/or converters options.

pull-requests are welcome.

@jreback jreback closed this as completed Dec 29, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Dtype Conversions Unexpected or buggy dtype conversions Duplicate Report Duplicate issue or pull request IO HTML read_html, to_html, Styler.apply, Styler.applymap
Projects
None yet
Development

No branches or pull requests

2 participants