CI: Fix flaky test_value_counts_null #32449

SaturnFromTitan · 2020-03-04T21:21:24Z

As mentioned by @simonjayhawkins in #32438

Trace of the failing test:
https://dev.azure.com/pandas-dev/pandas/_build/results?buildId=29965&view=logs&j=077026cf-93c0-54aa-45e0-9996ba75f6f7&t=e95cf409-86ae-5b4d-6c5f-79395ef75e8f

simonjayhawkins · 2020-03-05T11:31:09Z

pandas/tests/base/test_ops.py

-        tm.assert_series_equal(obj.value_counts(), expected)
+        # sort_index to avoid switched order when values share the same count
+        expected = expected.sort_index()
+        result = obj.value_counts().sort_index()


I think the result should just be obj.value_counts() for the test to be considered a valid test.

is it feasible for expected to be constructed to create the expected sort order or is the sort order tested elsewhere?

I generally agree, but this is the only solution I found to work consistently on CI.

What I've found so far is the following:

Usually, value_counts preserves the order from obj if multiple values share the same count

This scenario is covered by the current test design/implementation

However, this seems to break on CI (I could only see it breaking for float16 on Windows py36_np15, but there might be more)

Locally (mac, py36) I cannot reproduce the flakiness, even when running it many times

So this might actually be a bug. I never worked with Cython, so it's hard for me to trace this deeper. It might be an issue in value_count_float64 in pandas/_libs/hashtable_func_helper.pxi

My suggestion would be to

adjust the current code to only call sort_index for float16 for now

merge this PR

open an issue for the potential bug

By seeing if the test is still flaky afterwards we can gather more data about its relation to float16 etc.

Wdyt @simonjayhawkins?

sounds good with a TODO

could it be not related to float16, just duplicate values are more likely due to decreased resolution.

I don't think so. If it's about duplicated values alone then it should always fail for repeats, see here

This time it failed with int32 as well: https://dev.azure.com/pandas-dev/pandas/_build/results?buildId=30037&view=logs&j=3a03f79d-0b41-5610-1aa4-b4a014d0bc70&t=4d05ed0e-1ed3-5bff-dd63-1e957f2766a9&l=66

How should we go about this?

could just revert #32281 for now instead @jbrockmendel ?

I come to wonder if the order of values with the same count is actually deterministic. If not, is there an alternative to using sort_index on result and expected?

could just revert #32281 for now instead @jbrockmendel ?

I think merging this PR is better than reverting the #32281. In the previous version we just skipped all tests with duplicated values. Here we at least test that the values are correct, even though we don't validate that they are ordered consistently.

pandas/tests/base/test_ops.py

… on CI

simonjayhawkins

Thanks @SaturnFromTitan lgtm pending green

WillAyd · 2020-03-06T16:59:21Z

pandas/tests/base/test_ops.py

-        # sort_index to avoid switched order when values share the same count
-        result = result.sort_index()
-        expected = expected.sort_index()
+        # TODO: Order of entries with the same count is inconsistent on CI (gh-32449)


Is there an open issue for this?

xref: #32514

SaturnFromTitan · 2020-03-07T09:34:01Z

@WillAyd @simonjayhawkins I added a follow-up issue and will come back to it later. Otherwise I think I addressed all comments, so this should be good to merge

simonjayhawkins · 2020-03-07T09:40:20Z

Thanks @SaturnFromTitan

TomAugspurger · 2020-03-09T19:23:34Z

Generally fixes for flaky tests should be backported. Does the 1.0.x branch suffer from the original flakiness?

simonjayhawkins · 2020-03-09T19:31:54Z

no. #32281 was not backported.

TomAugspurger · 2020-03-09T19:32:12Z

Great, thanks.

sorting by index before comparing results in test_value_counts_null

2113521

simonjayhawkins mentioned this pull request Mar 5, 2020

CLN: Replaced "bool_t" with "builtins.bool" #32365

Closed

5 tasks

simonjayhawkins reviewed Mar 5, 2020

View reviewed changes

simonjayhawkins added the CI Continuous Integration label Mar 5, 2020

simonjayhawkins added this to the 1.1 milestone Mar 5, 2020

SaturnFromTitan added 2 commits March 5, 2020 17:19

using sort_index for value_counts tests only for float16

cde9df6

Merge branch 'master' into fix-flaky-tests-test_value_counts_null

f04d255

simonjayhawkins mentioned this pull request Mar 5, 2020

TYP/CLN: Optional[Hashable] -> pandas._typing.Label #32371

Merged

jbrockmendel reviewed Mar 5, 2020

View reviewed changes

pandas/tests/base/test_ops.py Outdated Show resolved Hide resolved

SaturnFromTitan added 2 commits March 5, 2020 20:14

using sort_index for all fixture values to handle all inconsistencies…

ab04a73

… on CI

only using sort_index if there are duplicated values

72b4da1

simonjayhawkins approved these changes Mar 5, 2020

View reviewed changes

WillAyd reviewed Mar 6, 2020

View reviewed changes

SaturnFromTitan mentioned this pull request Mar 7, 2020

TST: Seemingly non-deterministic order of value_counts #32514

Closed

simonjayhawkins merged commit 45d412f into pandas-dev:master Mar 7, 2020

SeeminSyed pushed a commit to CSCD01-team01/pandas that referenced this pull request Mar 22, 2020

CI: Fix flaky test_value_counts_null (pandas-dev#32449)

0749715

BaruchYoussin mentioned this pull request Aug 23, 2020

BUG: value_counts() is non-reproducible from one run to another #35862

Closed

1 task

theemathas mentioned this pull request Oct 6, 2020

BUG: Joining data frames with MultiIndex results in non-deterministic level order. #36910

Closed

3 tasks

jreback mentioned this pull request Oct 6, 2020

ENH: guarantee pandas.Series.value_counts "sort=False" to be original ordering #12679

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI: Fix flaky test_value_counts_null #32449

CI: Fix flaky test_value_counts_null #32449

SaturnFromTitan commented Mar 4, 2020

simonjayhawkins Mar 5, 2020

SaturnFromTitan Mar 5, 2020

simonjayhawkins Mar 5, 2020

SaturnFromTitan Mar 5, 2020 •

edited

Loading

SaturnFromTitan Mar 5, 2020

simonjayhawkins Mar 5, 2020

SaturnFromTitan Mar 5, 2020

SaturnFromTitan Mar 5, 2020

simonjayhawkins left a comment

WillAyd Mar 6, 2020

SaturnFromTitan Mar 7, 2020

SaturnFromTitan commented Mar 7, 2020

simonjayhawkins commented Mar 7, 2020

TomAugspurger commented Mar 9, 2020

simonjayhawkins commented Mar 9, 2020

TomAugspurger commented Mar 9, 2020

CI: Fix flaky test_value_counts_null #32449

CI: Fix flaky test_value_counts_null #32449

Conversation

SaturnFromTitan commented Mar 4, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SaturnFromTitan Mar 5, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

simonjayhawkins left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SaturnFromTitan commented Mar 7, 2020

simonjayhawkins commented Mar 7, 2020

TomAugspurger commented Mar 9, 2020

simonjayhawkins commented Mar 9, 2020

TomAugspurger commented Mar 9, 2020

SaturnFromTitan Mar 5, 2020 •

edited

Loading