SAS chunksize / iteration issues #14743

kshedden · 2016-11-25T15:10:12Z

tests added / passed
passes git diff upstream/master | flake8 --diff
whatsnew entry

jreback · 2016-11-25T16:17:33Z

pandas/io/tests/sas/test_sas7bdat.py

@@ -65,6 +65,32 @@ def test_from_iterator(self):
                df = rdr.read(3)
                tm.assert_frame_equal(df, df0.iloc[2:5, :])

+    def test_iterator_loop(self):
+        for j in 0, 1:


can you add the issue references here

jreback · 2016-11-25T16:17:51Z

pandas/io/tests/sas/test_sas7bdat.py

+                    y = 0
+                    for x in rdr:
+                        y += x.shape[0]
+                    assert(y == rdr.row_count)


self.assertTrue

jreback · 2016-11-25T16:18:26Z

pls add to the whatsnew. ping when ready / green.

jorisvandenbossche · 2016-11-25T21:28:36Z

Question: this is not a problem in the xport reader? If so, is there a test that confirms this?

kshedden · 2016-11-25T22:04:03Z

It works there but I added a few tests.

jorisvandenbossche · 2016-11-25T22:05:10Z

@kshedden Thanks. You can move the whatsnew notice to the 0.19.2 file

jorisvandenbossche · 2016-11-25T22:08:07Z

pandas/io/tests/sas/test_sas7bdat.py

+                    fname = os.path.join(self.dirpath, "test%d.sas7bdat" % k)
+                    with open(fname, 'rb') as f:
+                        byts = f.read()
+                    buf = io.BytesIO(byts)


Is there a reason you read the file into a bytes object and not just pass the fname to read_sas ?

It seems to work either way so I took all but one of the bytesio tests and simplified to a direct file read.

codecov-io · 2016-11-26T08:05:18Z

Current coverage is 85.22% (diff: 0.00%)

Merging #14743 into master will increase coverage by <.01%

@@             master     #14743   diff @@
==========================================
  Files           143        143          
  Lines         50807      50857    +50   
  Methods           0          0          
  Messages          0          0          
  Branches          0          0          
==========================================
+ Hits          43297      43344    +47   
- Misses         7510       7513     +3   
  Partials          0          0

Powered by Codecov. Last update d8e427b...28d4038

jorisvandenbossche · 2016-11-26T09:22:08Z

pandas/io/tests/sas/test_sas7bdat.py

-                with open(fname, 'rb') as f:
-                    byts = f.read()
-                buf = io.BytesIO(byts)
-                df = pd.read_sas(buf, format="sas7bdat", encoding='utf-8')


I think it is more logical to keep the buffer reading here, as this test is called "test_from_buffer" is I suppose is exactly testing this (and then maybe use plain reading from file in "test_iterator_read_too_much")

jorisvandenbossche · 2016-11-28T09:49:08Z

@kshedden Thanks for the quick fix!

closes #14734 closes #13654 (cherry picked from commit c5f219a)

jreback added this to the Next Major Release milestone Nov 25, 2016

jreback added Bug IO SAS SAS: read_sas labels Nov 25, 2016

jreback changed the title ~~Fix 14734~~ SAS chunksize / iteration issues Nov 25, 2016

jreback reviewed Nov 25, 2016

View reviewed changes

pandas/io/tests/sas/test_sas7bdat.py

y = 0

for x in rdr:

y += x.shape[0]

assert(y == rdr.row_count)

Copy link

Contributor

jreback Nov 25, 2016

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

self.assertTrue

jreback modified the milestones: 0.19.2, Next Major Release Nov 25, 2016

jorisvandenbossche reviewed Nov 25, 2016

View reviewed changes

kshedden added 5 commits November 25, 2016 17:28

Fix 14734

9d6b61f

Added to whatsnew

e8327e0

Add iterator tests for xport

4504df5

Moved whatsnew to 19.2

a7b7da8

Bypass ioreader

8c1e17e

kshedden force-pushed the sas_iterator branch from 18c1c75 to 8c1e17e Compare November 25, 2016 22:30

jorisvandenbossche reviewed Nov 26, 2016

View reviewed changes

Minor change to tests

28d4038

jorisvandenbossche merged commit c5f219a into pandas-dev:master Nov 28, 2016

jorisvandenbossche pushed a commit that referenced this pull request Dec 15, 2016

[Backport #14743] BUG: SAS chunksize / iteration issues (#14743)

6c688b9

closes #14734 closes #13654 (cherry picked from commit c5f219a)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SAS chunksize / iteration issues #14743

SAS chunksize / iteration issues #14743

kshedden commented Nov 25, 2016 •

edited

Loading

jreback Nov 25, 2016

jreback Nov 25, 2016

jreback commented Nov 25, 2016

jorisvandenbossche commented Nov 25, 2016

kshedden commented Nov 25, 2016

jorisvandenbossche commented Nov 25, 2016

jorisvandenbossche Nov 25, 2016

kshedden Nov 25, 2016

codecov-io commented Nov 26, 2016 •

edited

Loading

jorisvandenbossche Nov 26, 2016

jorisvandenbossche commented Nov 28, 2016

SAS chunksize / iteration issues #14743

SAS chunksize / iteration issues #14743

Conversation

kshedden commented Nov 25, 2016 • edited Loading

jreback Nov 25, 2016

Choose a reason for hiding this comment

jreback Nov 25, 2016

Choose a reason for hiding this comment

jreback commented Nov 25, 2016

jorisvandenbossche commented Nov 25, 2016

kshedden commented Nov 25, 2016

jorisvandenbossche commented Nov 25, 2016

jorisvandenbossche Nov 25, 2016

Choose a reason for hiding this comment

kshedden Nov 25, 2016

Choose a reason for hiding this comment

codecov-io commented Nov 26, 2016 • edited Loading

Current coverage is 85.22% (diff: 0.00%)

jorisvandenbossche Nov 26, 2016

Choose a reason for hiding this comment

jorisvandenbossche commented Nov 28, 2016

kshedden commented Nov 25, 2016 •

edited

Loading

codecov-io commented Nov 26, 2016 •

edited

Loading