Random bad asserts for stat ops when running tests. #6982

dalejung · 2014-04-27T16:12:57Z

======================================================================
FAIL: test_sum (pandas.tests.test_frame.TestDataFrame)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/home/travis/virtualenv/python2.6_with_system_site_packages/lib/python2.6/site-packages/pandas/tests/test_frame.py", line 10590, in test_sum
    has_numeric_only=True, check_dtype=False, check_less_precise=True)
  File "/home/travis/virtualenv/python2.6_with_system_site_packages/lib/python2.6/site-packages/pandas/tests/test_frame.py", line 10780, in _check_stat_op
    check_less_precise=check_less_precise)  # HACK: win32
  File "/home/travis/virtualenv/python2.6_with_system_site_packages/lib/python2.6/site-packages/pandas/util/testing.py", line 513, in assert_series_equal
    assert_almost_equal(left.values, right.values, check_less_precise)
  File "testing.pyx", line 58, in pandas._testing.assert_almost_equal (pandas/src/testing.c:2554)
  File "testing.pyx", line 93, in pandas._testing.assert_almost_equal (pandas/src/testing.c:1796)
  File "testing.pyx", line 140, in pandas._testing.assert_almost_equal (pandas/src/testing.c:2387)
AssertionError: expected 0.00144 but got 0.00144

The text was updated successfully, but these errors were encountered:

dalejung · 2014-04-27T16:31:50Z

@jreback
It looks like the issue is that nanops specify the dtype_max when calling the ops. ff7bb2c seems to be the culprit.

In the case that I found, the unit test is checking a float32 frame vs that same frame up-casted to float64.

http://nbviewer.ipython.org/gist/anonymous/11349526

jreback · 2014-04-27T16:47:40Z

it could be but this error has been around a while actually

the issue is that np.sum is used as the comparison which should be passing .sum(dtype='float32') in this case

(the actual pandas routines are correct) after the fix above

jreback · 2014-04-27T16:48:20Z

pls submit a PR for this if you can (you can pass lambda x: np.sum(dtype='float32') instead of np.sum I think. This is sort a 'numpy' issue, really as np.sum is really doing the wrong thing

jreback · 2014-04-27T19:12:59Z

@dalejung I put up #6985, I *think * this should fix....can you reproduce reliably?

jreback · 2014-04-27T22:17:02Z

@dalejung if you notice any more pls lmk

jreback · 2014-04-28T00:51:14Z

@dalejung not sure my fix actually fixed this....!

dalejung · 2014-05-02T16:31:22Z

@jreback Hey, the last PR fixes this reliably. Just to clarify, if I have an array of float32, using a float64 accumulator is the correct behavior?

jreback · 2014-05-02T16:33:54Z

you can get away with the default accumulator on 64-bit systems because the default actually IS float64; however on 32-bit it breaks, but it will STILL work as long as it doesn't overflow.

so you need an overflow on 32-bit to fail, BUT using a 64-bit accumular is always safe, I think (and that's what I did)

dalejung · 2014-05-02T16:57:02Z

hm, I wasn't even thinking about overflow :/. I was more concerned about the output being different based on the accumulator. Not sure which output is technical correct. Like, the float64 is obviously more correct but I wasn't sure if there was an expectation of using float32 throughout the process.

jreback · 2014-05-02T17:10:51Z

I think it should be the same (though precision could affect it), so they could be slightly differently if accumulating really small numbers (that barely fit in float32). I would just always use float64, unless you have a really good reason.

dalejung · 2014-05-02T17:13:04Z

Agreed. I always use float64 throughout so this is the first time I've given it any thought.

jreback added the Testing label Apr 27, 2014

jreback added this to the 0.14.0 milestone Apr 27, 2014

jreback mentioned this issue Apr 27, 2014

TST: test_frame/test_sum not comparing correctly on smaller sized dtypes (GH6982) #6985

Merged

jreback closed this as completed in #6985 Apr 27, 2014

jreback mentioned this issue Apr 28, 2014

TST: fix checking for less_precise in floats #6990

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Random bad asserts for stat ops when running tests. #6982

Random bad asserts for stat ops when running tests. #6982

dalejung commented Apr 27, 2014

dalejung commented Apr 27, 2014

jreback commented Apr 27, 2014

jreback commented Apr 27, 2014

jreback commented Apr 27, 2014

jreback commented Apr 27, 2014

jreback commented Apr 28, 2014

dalejung commented May 2, 2014

jreback commented May 2, 2014

dalejung commented May 2, 2014

jreback commented May 2, 2014

dalejung commented May 2, 2014

Random bad asserts for stat ops when running tests. #6982

Random bad asserts for stat ops when running tests. #6982

Comments

dalejung commented Apr 27, 2014

dalejung commented Apr 27, 2014

jreback commented Apr 27, 2014

jreback commented Apr 27, 2014

jreback commented Apr 27, 2014

jreback commented Apr 27, 2014

jreback commented Apr 28, 2014

dalejung commented May 2, 2014

jreback commented May 2, 2014

dalejung commented May 2, 2014

jreback commented May 2, 2014

dalejung commented May 2, 2014