ENH: add dtype to read_xml #45341

MehdiALLALA · 2022-01-13T09:30:44Z

Is your feature request related to a problem?

pandas column type detection is not precise. for some columns the type in not the actual data type when reading xml files with read_xml.

Describe the solution you'd like

the solution is to add dtype option in the read_xml utility that will provide pandas with the wanted column types, this feature exists already with read_json, read_csv and many other utilities.

API breaking implications

this will make the read_xml utility detect the right type for each column. to keep the data in the correct format.

Describe alternatives you've considered

creating a script that will use the default pandas.DataFrame(......) constructor that contains dtype option to parse the xml file.

the loaded result from the read_xml is not the same as the xml file. the highlighted column is a string but pandas read it as an integer and with this the data is corrupted.

ParfaitG · 2022-01-16T21:21:59Z

Thanks, @MehdiALLALA. The dtypes feature for read_xml was raised in a similar issue, #43567, and added to issue tracker, #40131.

mroeschke · 2022-01-16T22:51:36Z

Closing as a duplicate of #43567

MehdiALLALA added Enhancement Needs Triage Issue that has not been reviewed by a pandas team member labels Jan 13, 2022

mroeschke added Dtype Conversions Unexpected or buggy dtype conversions IO XML read_xml, to_xml and removed Needs Triage Issue that has not been reviewed by a pandas team member labels Jan 14, 2022

mroeschke closed this as completed Jan 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: add dtype to read_xml #45341

ENH: add dtype to read_xml #45341

MehdiALLALA commented Jan 13, 2022 •

edited

Loading

ParfaitG commented Jan 16, 2022

mroeschke commented Jan 16, 2022

ENH: add dtype to read_xml #45341

ENH: add dtype to read_xml #45341

Comments

MehdiALLALA commented Jan 13, 2022 • edited Loading

Is your feature request related to a problem?

Describe the solution you'd like

API breaking implications

Describe alternatives you've considered

ParfaitG commented Jan 16, 2022

mroeschke commented Jan 16, 2022

MehdiALLALA commented Jan 13, 2022 •

edited

Loading