BUG: Load ORC-format data failed when pandas version>1.2.0.dev0 #40918
Labels
Bug
Dependencies
Required and optional dependencies
IO Data
IO issues that don't fit into a more specific label
Milestone
I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of pandas.
(optional) I have confirmed this bug exists on the master branch of pandas.
https://github.com/pandas-dev/pandas/blob/master/pandas/io/orc.py#L15-L54
Code Sample, a copy-pastable example
Problem description
Pandas uses PyArrow package to load ORC/Parquet data.
For the orc data format, it will use
pyarrow.orc.ORCFile
to read data (orc.py), but the PyArrow does not declareorc
in __init__.py file, so pandas will raise an AttributeError: module 'pyarrow' has no attribute 'orc'This bug will occur if the Pandas version is greater than v1.2.0.dev0(after commit-6d1541e). Before that,
pandas/io/orc.py
will declareimport pyarrow.orc
before uses pyarrow to load orc data(v1.1.5/pandas/io.orc.py/).Testing environment:
The text was updated successfully, but these errors were encountered: