Skip to content

ENH: Improve explanation when erroring on dta files #25968

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Apr 5, 2019

Conversation

bashtage
Copy link
Contributor

@bashtage bashtage commented Apr 2, 2019

Improve the explanation when value labels are repeated in Stata dta files.
Add suggested methods to work around the issue using the low level
interface.

closes #25772

@pep8speaks
Copy link

pep8speaks commented Apr 2, 2019

Hello @bashtage! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2019-04-04 21:18:34 UTC

@bashtage bashtage force-pushed the improve-cat-conversion-error branch from 3c35496 to 1e7243f Compare April 2, 2019 23:06
@codecov
Copy link

codecov bot commented Apr 2, 2019

Codecov Report

Merging #25968 into master will decrease coverage by <.01%.
The diff coverage is n/a.

Impacted file tree graph

@@            Coverage Diff             @@
##           master   #25968      +/-   ##
==========================================
- Coverage   91.84%   91.83%   -0.01%     
==========================================
  Files         175      175              
  Lines       52550    52550              
==========================================
- Hits        48266    48261       -5     
- Misses       4284     4289       +5
Flag Coverage Δ
#multiple 90.39% <ø> (ø) ⬆️
#single 41.9% <ø> (-0.07%) ⬇️
Impacted Files Coverage Δ
pandas/io/gbq.py 75% <0%> (-12.5%) ⬇️
pandas/core/frame.py 96.79% <0%> (-0.12%) ⬇️
pandas/util/testing.py 90.61% <0%> (-0.11%) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4814a28...1e7243f. Read the comment docs.

@codecov
Copy link

codecov bot commented Apr 2, 2019

Codecov Report

Merging #25968 into master will decrease coverage by 0.01%.
The diff coverage is n/a.

Impacted file tree graph

@@            Coverage Diff             @@
##           master   #25968      +/-   ##
==========================================
- Coverage   91.86%   91.85%   -0.02%     
==========================================
  Files         175      175              
  Lines       52547    52543       -4     
==========================================
- Hits        48271    48262       -9     
- Misses       4276     4281       +5
Flag Coverage Δ
#multiple 90.4% <ø> (-0.01%) ⬇️
#single 41.9% <ø> (-0.07%) ⬇️
Impacted Files Coverage Δ
pandas/io/gbq.py 75% <0%> (-12.5%) ⬇️
pandas/core/frame.py 96.79% <0%> (-0.12%) ⬇️
pandas/util/testing.py 90.62% <0%> (-0.11%) ⬇️
pandas/io/formats/html.py 99.35% <0%> (-0.01%) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 013f4b4...479a82a. Read the comment docs.

Copy link
Member

@WillAyd WillAyd left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@gfyoung thoughts here?

@WillAyd WillAyd added the IO Stata read_stata, to_stata label Apr 3, 2019
@bashtage bashtage force-pushed the improve-cat-conversion-error branch from 1e7243f to 53ad235 Compare April 3, 2019 07:15
@jreback jreback added this to the 0.25.0 milestone Apr 4, 2019
@jreback
Copy link
Contributor

jreback commented Apr 4, 2019

lgtm. @WillAyd

@WillAyd
Copy link
Member

WillAyd commented Apr 4, 2019

@bashtage can you fix merge conflict and ping on green?

@bashtage bashtage force-pushed the improve-cat-conversion-error branch from 53ad235 to 1c9c196 Compare April 4, 2019 19:38
Improve the explanation when value labels are repeated in Stata dta files.
Add suggested methods to workaround the issue using the low level
interface.

closes pandas-dev#25772
@bashtage bashtage force-pushed the improve-cat-conversion-error branch from 1c9c196 to 479a82a Compare April 4, 2019 21:18
@bashtage
Copy link
Contributor Author

bashtage commented Apr 4, 2019

@WillAyd green.

@jreback jreback merged commit 35156dc into pandas-dev:master Apr 5, 2019
@jreback
Copy link
Contributor

jreback commented Apr 5, 2019

thanks @bashtage

@bashtage bashtage deleted the improve-cat-conversion-error branch December 19, 2019 22:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
IO Stata read_stata, to_stata
Projects
None yet
Development

Successfully merging this pull request may close these issues.

can't read large stata file
5 participants