added a compression argument to to_csv to be sent to _get_handle #2636

yoavram · 2013-01-04T08:09:45Z

Writing csv files with gzip compression is very useful as it both minimized the disk size taken by large data files, and can be easily read by R, often faster than the same csv file without compression.
Reading compressed csv files in R is done without any additional input by the user, and is also already supported in Pandas

Writing `csv` with `gzip` is very useful as it both minimized the disk size taken by large data files, and can be often read by *R* faster than the same `csv` file without compression. Also, reading by *R* is done without any effort by the user, and is also already supported in *Pandas*

wesm · 2013-01-19T23:39:56Z

Needs a test case

yoavram · 2013-01-20T04:50:29Z

Can you suggest where a test case should be written?

ghost · 2013-01-20T05:10:01Z

pandas/tests/test_frame.py has all the test_to_csv_* tests.

ghost · 2013-02-04T12:23:13Z

I intended to write a roundtrip test, but from_csv doesn't support compression
despite it's hiding down the call stack.
Should probably keep things consistent and update from_csv the same way.

yoavram · 2013-02-04T16:03:06Z

OK. This is on my task list, but unfortunately not high enough...
Hopefully I will get to it soon.

ghost · 2013-07-29T05:15:16Z

MIA.

yoavram · 2015-10-02T06:04:40Z

@y-p I'd like to reopen this and complete the PR.
I notice that read_csv quietly reads gzipped csv files, so if I understand correctly, I need to add a test for both to_csv using a roundtrip with read_csv in pandas/tests/test_frame.py.
Let me know if this is true and if this is still relevant.
Also - should I do anything because so much time passed, like rebase or just fork again and apply my minor changes, or would this be handled by you during merging?

jreback · 2015-10-02T10:59:20Z

you should rebase on master and open a new PR
quite a lot has changed since this issue came up

contributing docs are here

and this issue is #7615

jreback mentioned this pull request Jun 10, 2013

ENH: add ujson support in pandas.io.json #3804

Merged

ghost closed this Jul 29, 2013

yoavram mentioned this pull request Oct 2, 2015

ENH: added compression kw to to_csv GH7615 #11219

Merged

jreback modified the milestones: 0.17.1, Someday Oct 12, 2015

jreback added Enhancement IO CSV read_csv, to_csv labels Oct 12, 2015

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added a compression argument to to_csv to be sent to _get_handle #2636

added a compression argument to to_csv to be sent to _get_handle #2636

yoavram commented Jan 4, 2013

wesm commented Jan 19, 2013

yoavram commented Jan 20, 2013

ghost commented Jan 20, 2013

ghost commented Feb 4, 2013

yoavram commented Feb 4, 2013

ghost commented Jul 29, 2013

yoavram commented Oct 2, 2015

jreback commented Oct 2, 2015

added a compression argument to to_csv to be sent to _get_handle #2636

added a compression argument to to_csv to be sent to _get_handle #2636

Conversation

yoavram commented Jan 4, 2013

wesm commented Jan 19, 2013

yoavram commented Jan 20, 2013

ghost commented Jan 20, 2013

ghost commented Feb 4, 2013

yoavram commented Feb 4, 2013

ghost commented Jul 29, 2013

yoavram commented Oct 2, 2015

jreback commented Oct 2, 2015