Bug: Made it so that 0 was included in uint8 #14412

verhalenn · 2016-10-13T04:25:29Z

closes BUG: pd.to_numeric(..., downcast='unsigned') should accept 0 #14401
tests added / passed
passes git diff upstream/master | flake8 --diff
whatsnew entry

Decided to restart. Sorry for the inconvenience. -> #14472

sinhrks · 2016-10-13T07:51:57Z

Thx for the PR. pls rebase existing PR from next.

sinhrks · 2016-10-13T07:53:32Z

doc/source/whatsnew/v0.19.1.txt

@@ -45,3 +45,4 @@ Bug Fixes

 - Bug in ``pd.concat`` where names of the ``keys`` were not propagated to the resulting ``MultiIndex`` (:issue:`14252`)
 - Bug in ``MultiIndex.set_levels`` where illegal level values were still set after raising an error (:issue:`13754`)
+- Bug in ``pd.to_numeric`` where it would not downcast a 0 to a uint8 (:issue:`14404`)


pls include #14401. Also, the issue is common for unsigned, not only for uint8.

codecov-io · 2016-10-13T08:51:59Z

Current coverage is 85.26% (diff: 100%)

Merging #14412 into master will increase coverage by <.01%

@@             master     #14412   diff @@
==========================================
  Files           140        140          
  Lines         50631      50634     +3   
  Methods           0          0          
  Messages          0          0          
  Branches          0          0          
==========================================
+ Hits          43167      43173     +6   
+ Misses         7464       7461     -3   
  Partials          0          0

Powered by Codecov. Last update 7cad3f1...2b2622c

jreback · 2016-10-13T10:07:30Z

pandas/tools/tests/test_util.py

@@ -391,6 +391,14 @@ def test_downcast(self):
            res = pd.to_numeric(data, downcast=downcast)
            tm.assert_numpy_array_equal(res, expected)

+        #check that 0 works as a unsigned downcast
+
+        data = [0, 1, 2, 3]


check this for all of the downcast parameters (where it should be included)

Did you want me to just edit everything between line 327 and line 368 in that test to include 0's in the arrays? Trying it before and after, it fails before and passes after. That being said if you don't want me to touch it for some reason it's relatively easy to make a new one. Also is there a reason that 340 and 341 is redundant?

verhalenn · 2016-10-14T01:58:55Z

Added a series of tests which applied a pandas series that included the minimum and maximum values of each dtype to a to_numeric downcast to see if it would downcast to the appropriate dtype. It did not add to uint64 but I'm not sure if there were plans to include uint64 so I'll open up a new issue. Also edited the whatsnew.

jreback · 2016-10-14T10:25:07Z

doc/source/whatsnew/v0.19.1.txt

@@ -45,3 +45,4 @@ Bug Fixes

 - Bug in ``pd.concat`` where names of the ``keys`` were not propagated to the resulting ``MultiIndex`` (:issue:`14252`)
 - Bug in ``MultiIndex.set_levels`` where illegal level values were still set after raising an error (:issue:`13754`)
+- Bug in ``pd.to_numeric`` where it would not downcast a 0 properly. (:issue:`14401`)


say downcast='unsigned' as a kwargs is passed

jreback · 2016-10-14T10:25:50Z

pandas/tools/tests/test_util.py

@@ -401,6 +401,32 @@ def test_downcast(self):
            res = pd.to_numeric(data, downcast=downcast)
            tm.assert_numpy_array_equal(res, expected)

+        # check that the smallest and largest values in each integer type pass to each type.
+        int8 = [-128, 127]


don't do this explicty, use np.iinfo(dtype).min/max as these vary by platform

actually on 2nd though, might be ok if appveyor passes

can you do this in a loop instead (eg. a loop of tuples of the start,stop, dtype)

I was actually thinking dictionary if that's okay. And I believe that the numpy types are standard across platforms but I'm not positive which is why I didn't bother with the np.info(dtype).min/max but for safeties sake I'll include it.

I also added a series of other tests that included making sure that a dtype shifted dtype once it reached the next value and once again edited the whatsnew.

jreback · 2016-10-19T12:35:54Z

doc/source/whatsnew/v0.19.1.txt

@@ -48,3 +48,6 @@ Bug Fixes
 - Bug in ``MultiIndex.set_levels`` where illegal level values were still set after raising an error (:issue:`13754`)
 - Bug in ``DataFrame.to_json`` where ``lines=True`` and a value contained a ``}`` character (:issue:`14391`)
 - Bug in ``df.groupby`` causing an ``AttributeError`` when grouping a single index frame by a column and the index level (:issue`14327`)
+- Bug in ``pd.to_numeric`` where it would not downcast a 0 to a uint8 (:issue:`14404`)


just make a single entry

jreback · 2016-10-19T12:38:20Z

pandas/tools/tests/test_util.py

@@ -401,6 +401,62 @@ def test_downcast(self):
            res = pd.to_numeric(data, downcast=downcast)
            tm.assert_numpy_array_equal(res, expected)

+        # check that the smallest and largest values in each integer type pass to each type.


this is still overly verbose. do something like.

checks = [('int8', 'integer', [np.iinfo(np.int8).min, np.iinfo(np.int8).max]), ..... ] for dtype, downcast, min_max in checks: ....

verhalenn changed the title ~~Issue14401~~ Bug: Made it so that 0 was included in uint8 Oct 13, 2016

sinhrks reviewed Oct 13, 2016

View reviewed changes

sinhrks added Bug Dtype Conversions Unexpected or buggy dtype conversions labels Oct 13, 2016

jorisvandenbossche added this to the 0.19.1 milestone Oct 13, 2016

jreback reviewed Oct 13, 2016

View reviewed changes

verhalenn force-pushed the issue14401 branch from 2af6bfc to 2b2622c Compare October 14, 2016 01:49

jreback reviewed Oct 14, 2016

View reviewed changes

Nicholas Ver Halen added 7 commits October 17, 2016 18:21

Made it so that 0 was included in uint8

3d2ce5b

Added a test to check uint8 with 0

81b4965

Added release note to issue 14401 resolve.

b6331a5

Edited mistakes in whatsnew

8a836b2

Added tests for the max and min values of all dtypes to to_numeric

3427e4f

Changed the tests so that it iterated through a dictionary.

0da1918

I also added a series of other tests that included making sure that a dtype shifted dtype once it reached the next value and once again edited the whatsnew.

Changed the test to work with python 3.x

c6be0eb

verhalenn force-pushed the issue14401 branch from 7fd0a16 to c6be0eb Compare October 18, 2016 00:16

jreback reviewed Oct 19, 2016

View reviewed changes

verhalenn closed this Oct 21, 2016

verhalenn deleted the issue14401 branch October 21, 2016 23:55

verhalenn mentioned this pull request Oct 22, 2016

BUG: downcast = 'unsigend' on 0 would would not downcast to unsigned. #14472

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: Made it so that 0 was included in uint8 #14412

Bug: Made it so that 0 was included in uint8 #14412

verhalenn commented Oct 13, 2016 •

edited by jorisvandenbossche

Loading

sinhrks commented Oct 13, 2016

sinhrks Oct 13, 2016 •

edited

Loading

codecov-io commented Oct 13, 2016 •

edited

Loading

jreback Oct 13, 2016

verhalenn Oct 13, 2016

verhalenn commented Oct 14, 2016

jreback Oct 14, 2016

jreback Oct 14, 2016 •

edited

Loading

jreback Oct 14, 2016

jreback Oct 14, 2016

verhalenn Oct 15, 2016

jreback Oct 19, 2016

jreback Oct 19, 2016

Bug: Made it so that 0 was included in uint8 #14412

Bug: Made it so that 0 was included in uint8 #14412

Conversation

verhalenn commented Oct 13, 2016 • edited by jorisvandenbossche Loading

sinhrks commented Oct 13, 2016

sinhrks Oct 13, 2016 • edited Loading

Choose a reason for hiding this comment

codecov-io commented Oct 13, 2016 • edited Loading

Current coverage is 85.26% (diff: 100%)

Choose a reason for hiding this comment

Choose a reason for hiding this comment

verhalenn commented Oct 14, 2016

Choose a reason for hiding this comment

jreback Oct 14, 2016 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

verhalenn commented Oct 13, 2016 •

edited by jorisvandenbossche

Loading

sinhrks Oct 13, 2016 •

edited

Loading

codecov-io commented Oct 13, 2016 •

edited

Loading

jreback Oct 14, 2016 •

edited

Loading