ENH: Add columns argument to read_feather() (#24025) #24034

nixphix · 2018-12-01T10:16:11Z

closes Add columns-parameter like in feather.read_dataframe #24025
tests added / ~~passed~~
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

I have added test case but when running

pytest pandas/tests/io/test_feather.py

test cases (including some of already existing test cases) fails with error

NotImplementedError: > 1 ndim Categorical are not supported at this time

let me know if this is expected or I'm missing something

pep8speaks · 2018-12-01T10:16:14Z

Hello @nixphix! Thanks for submitting the PR.

There are no PEP8 issues in the file pandas/io/feather_format.py !
There are no PEP8 issues in the file pandas/tests/io/test_feather.py !

codecov · 2018-12-01T10:53:44Z

Codecov Report

Merging #24034 into master will not change coverage.
The diff coverage is 66.66%.

@@           Coverage Diff           @@
##           master   #24034   +/-   ##
=======================================
  Coverage   42.46%   42.46%           
=======================================
  Files         161      161           
  Lines       51557    51557           
=======================================
  Hits        21892    21892           
  Misses      29665    29665

Flag	Coverage Δ
#single	`42.46% <66.66%> (ø)`	⬆️

Impacted Files	Coverage Δ
pandas/io/feather_format.py	`89.74% <66.66%> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 5b6b346...43afb8d. Read the comment docs.

codecov · 2018-12-01T10:53:44Z

Codecov Report

Merging #24034 into master will increase coverage by 49.86%.
The diff coverage is 66.66%.

@@             Coverage Diff             @@
##           master   #24034       +/-   ##
===========================================
+ Coverage   42.38%   92.25%   +49.86%     
===========================================
  Files         161      161               
  Lines       51701    51701               
===========================================
+ Hits        21914    47696    +25782     
+ Misses      29787     4005    -25782

Flag	Coverage Δ
#multiple	`90.65% <33.33%> (?)`
#single	`42.38% <66.66%> (ø)`	⬆️

Impacted Files	Coverage Δ
pandas/io/feather_format.py	`89.74% <66.66%> (ø)`	⬆️
pandas/core/computation/pytables.py	`92.37% <0%> (+0.3%)`	⬆️
pandas/io/pytables.py	`92.3% <0%> (+0.92%)`	⬆️
pandas/util/_test_decorators.py	`93.24% <0%> (+4.05%)`	⬆️
pandas/compat/__init__.py	`58.36% <0%> (+8.17%)`	⬆️
pandas/core/config_init.py	`99.24% <0%> (+9.84%)`	⬆️
pandas/core/reshape/util.py	`100% <0%> (+11.53%)`	⬆️
pandas/compat/numpy/__init__.py	`92.85% <0%> (+14.28%)`	⬆️
pandas/core/computation/common.py	`85.71% <0%> (+14.28%)`	⬆️
pandas/core/api.py	`100% <0%> (+14.81%)`	⬆️
... and 120 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 08395af...8a997fc. Read the comment docs.

pandas/tests/io/test_feather.py

TomAugspurger · 2018-12-01T18:10:24Z

Any reason to not just specify the columns as a list? It’d make the test easier to read.

________________________________ From: Prabakaran Kumaresshan <[email protected]> Sent: Saturday, December 1, 2018 11:43 AM To: pandas-dev/pandas Cc: Tom Augspurger; Mention Subject: Re: [pandas-dev/pandas] ENH: Add columns argument to read_feather() (#24025) (#24034) @nixphix commented on this pull request.

________________________________ In pandas/tests/io/test_feather.py<#24034 (comment)>:

@@ -74,6 +77,18 @@ def test_stringify_columns(self):

df = pd.DataFrame(np.arange(12).reshape(4, 3)).copy() self.check_error_on_write(df, ValueError) + def test_read_columns(self): + + df = pd.DataFrame({'col1': list('abc'), + 'col2': list(range(1, 4)), + 'col3': list('xyz'), + 'col4': list(range(4, 7))}) + self.check_round_trip(df, columns=None) + self.check_round_trip(df, columns=df.columns) + random_cols = np.random.choice(df.columns, 2) @TomAugspurger<https://github.com/TomAugspurger> I missed replace=Flase argument that will ensure we are getting unique columns, this should work random_cols = np.random.choice(df.columns, 2, replace=False) — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub<#24034 (comment)>, or mute the thread<https://github.com/notifications/unsubscribe-auth/ABQHIq2LF0wkf9uXwQ5c22CFCGQfVx8gks5u0r-_gaJpZM4Y803h>.

nixphix · 2018-12-01T18:16:41Z

@TomAugspurger It's just that I'm not a fan of hard coded values, thought it might make the test case more dynamic and robust. I will change it to list for better readability.

pandas/tests/io/test_feather.py

pandas/io/feather_format.py

gfyoung

cc @jreback @TomAugspurger

TomAugspurger · 2018-12-02T21:25:21Z

pandas/tests/io/test_feather.py

@@ -74,6 +77,21 @@ def test_stringify_columns(self):
        df = pd.DataFrame(np.arange(12).reshape(4, 3)).copy()
        self.check_error_on_write(df, ValueError)

+    @pytest.mark.parametrize("columns", [


I hate to harp on this, but what are we testing by parametrizing this test? IMO, we should only be concerned with testing that pandas passes through the columns argument, so these tests seem redundant. I'd only keep the ['col1', 'col2'] test.

Are we worried about pyarrow breaking something, so this is some kind of integration test?

Sure, let's not worry about how the integration works, removed None and ['col1', 'col2', 'col3', 'col4']

jreback · 2018-12-04T03:57:20Z

lgtm. @TomAugspurger

TomAugspurger · 2018-12-04T12:23:56Z

Thanks @nixphix!

…s-dev#24034)

TomAugspurger requested changes Dec 1, 2018

View reviewed changes

pandas/tests/io/test_feather.py Outdated Show resolved Hide resolved

TomAugspurger added the IO Data IO issues that don't fit into a more specific label label Dec 1, 2018

TomAugspurger added this to the 0.24.0 milestone Dec 1, 2018

gfyoung reviewed Dec 2, 2018

View reviewed changes

pandas/tests/io/test_feather.py Outdated Show resolved Hide resolved

gfyoung reviewed Dec 2, 2018

View reviewed changes

pandas/tests/io/test_feather.py Outdated Show resolved Hide resolved

gfyoung reviewed Dec 2, 2018

View reviewed changes

pandas/io/feather_format.py Show resolved Hide resolved

jreback removed this from the 0.24.0 milestone Dec 2, 2018

gfyoung approved these changes Dec 2, 2018

View reviewed changes

TomAugspurger reviewed Dec 2, 2018

View reviewed changes

nixphix added 5 commits December 4, 2018 00:36

ENH: Add columns argument to read_feather() (#24025)

8e419d3

Fix test case

12a42ea

Add Github issue number

3f5382a

Parameterize test case and shorten doc string

99d4aee

Remove unnecessary test cases

8a997fc

jreback added this to the 0.24.0 milestone Dec 4, 2018

TomAugspurger approved these changes Dec 4, 2018

View reviewed changes

TomAugspurger merged commit 72980fb into pandas-dev:master Dec 4, 2018

Pingviinituutti pushed a commit to Pingviinituutti/pandas that referenced this pull request Feb 28, 2019

ENH: Add columns argument to read_feather() (pandas-dev#24025) (panda…

20b902d

…s-dev#24034)

Pingviinituutti pushed a commit to Pingviinituutti/pandas that referenced this pull request Feb 28, 2019

ENH: Add columns argument to read_feather() (pandas-dev#24025) (panda…

373fa14

…s-dev#24034)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Add columns argument to read_feather() (#24025) #24034

ENH: Add columns argument to read_feather() (#24025) #24034

nixphix commented Dec 1, 2018

pep8speaks commented Dec 1, 2018

codecov bot commented Dec 1, 2018

codecov bot commented Dec 1, 2018 •

edited

Loading

TomAugspurger commented Dec 1, 2018 via email

nixphix commented Dec 1, 2018

gfyoung left a comment

TomAugspurger Dec 2, 2018

nixphix Dec 3, 2018

jreback commented Dec 4, 2018

TomAugspurger commented Dec 4, 2018

ENH: Add columns argument to read_feather() (#24025) #24034

ENH: Add columns argument to read_feather() (#24025) #24034

Conversation

nixphix commented Dec 1, 2018

pep8speaks commented Dec 1, 2018

codecov bot commented Dec 1, 2018

Codecov Report

codecov bot commented Dec 1, 2018 • edited Loading

Codecov Report

TomAugspurger commented Dec 1, 2018 via email

nixphix commented Dec 1, 2018

gfyoung left a comment

Choose a reason for hiding this comment

TomAugspurger Dec 2, 2018

Choose a reason for hiding this comment

nixphix Dec 3, 2018

Choose a reason for hiding this comment

jreback commented Dec 4, 2018

TomAugspurger commented Dec 4, 2018

codecov bot commented Dec 1, 2018 •

edited

Loading