CI: azure timeouts #43643

mzeitlin11 · 2021-09-18T17:40:12Z

Based on the logging in #43611, in both timeout cases, the last test gw0 ran was before the hypothesis test in test_parse_dates (this is a giant test - 56 parameterizations, hypothesis does 100 examples by default).

Not sure why this would be the cause, couldn't find any issues about it that might explain a deadlock or something like that, but maybe this will help? Regardless of fixing timeouts, it makes sense for these to be slow

mzeitlin11 · 2021-09-18T17:48:40Z

Maybe something similar to HypothesisWorks/hypothesis#2340?

mzeitlin11 · 2021-09-18T19:01:38Z

Hmm this didn't work - new guess - the logging seems to stop around parser tests - maybe the fact that we use the same parser object across tests can cause deadlock (or issue is some mutation is happening?)

mzeitlin11 · 2021-09-19T02:03:53Z

We're at ~5 in a row on azure not timing out. Will keep running azure, but this is ready for review from my side. Summary of changes:

Mark hypothesis tests as slow
For parser fixtures, ensure we generate new objects instead of sharing them
skip pyarrow tests which can deadlock, xref CI/BUG: pyarrow read_csv deadlock #43650
(EDIT: just noticed more timeout cases on deadlock in CI: debug azure timeouts #43611. Another option might be to just replace all pyarrow xfails with skips)

jreback · 2021-09-20T13:17:22Z

Mark hypothesis tests as slow

is there some option that can set on hypthosesis instead? the point of these tests is to find holes in our tests, which are almost all fast, so this just remove this entirely (which maybe ok). but then we should just do that.

jreback · 2021-09-20T13:17:43Z

pandas/tests/io/parser/common/test_index.py

@@ -16,6 +16,7 @@
 import pandas._testing as tm

 xfail_pyarrow = pytest.mark.usefixtures("pyarrow_xfail")
+skip_pyarrow = pytest.mark.usefixtures("pyarrow_skip")


can you add some comments here on when to xfail vs skip

Yep, will do

mzeitlin11 · 2021-09-20T16:07:00Z

is there some option that can set on hypthosesis instead? the point of these tests is to find holes in our tests, which are almost all fast, so this just remove this entirely (which maybe ok). but then we should just do that.

We could set fewer examples to run (but at some point that defeats the purpose of hypothesis). I thought on running on slow was a good compromise since it will still run on some builds, just fewer.

Regardless, the important change here is skipping pyarrow for the hypothesis parse_dates test - will just remove the slow markers for now since it turned out pyarrow was the cause of the timeout, not the hypothesis tests sometimes running extremely slowly

jreback · 2021-09-20T16:24:01Z

kk lgtm. ping when ready to merge (or just go ahead)

jreback · 2021-09-20T16:55:29Z

note that I think we are seeing something similar on 1.3.4, but we didn't merge the pyarrow csv reader so prob something else.

mzeitlin11 · 2021-09-21T02:12:00Z

note that I think we are seeing something similar on 1.3.4, but we didn't merge the pyarrow csv reader so prob something else.

Good to know - certainly possible there are other potential timeout-causing issues unrelated to pyarrow. Will keep running azure pipelines on #43611 to see if anything else comes up. This should at least make timeouts less frequently hopefully

jreback · 2021-09-21T12:46:51Z

thanks @mzeitlin11 nice improvement here. Yeah let's keep an eye on 1.3.4

CI: mark hypothesis tests as slow

40f61e1

mzeitlin11 added Testing pandas testing functions or related to the test suite CI Continuous Integration labels Sep 18, 2021

mzeitlin11 marked this pull request as draft September 18, 2021 19:03

Don't share parsers

797922e

mzeitlin11 mentioned this pull request Sep 18, 2021

CI/BUG: pyarrow read_csv deadlock #43650

Closed

Add some pyarrow skips

1f6f837

mzeitlin11 marked this pull request as ready for review September 18, 2021 22:56

pandas-dev deleted a comment from azure-pipelines bot Sep 19, 2021

mzeitlin11 changed the title ~~CI: mark hypothesis tests as slow~~ CI: azure timeouts Sep 19, 2021

mzeitlin11 added 2 commits September 18, 2021 22:11

Skip more

4daa511

Skip more int tests

acc4ca4

jreback requested changes Sep 20, 2021

View reviewed changes

mzeitlin11 added 3 commits September 20, 2021 12:10

Remove hypothesis slow marks

f121ab8

Add comment explaining skip

52c0b3f

Fix test_ticks diff

94c8cdd

jreback added this to the 1.4 milestone Sep 20, 2021

jreback approved these changes Sep 20, 2021

View reviewed changes

jreback merged commit f9b6290 into pandas-dev:master Sep 21, 2021

mzeitlin11 deleted the mark_hypothesis_slow branch September 21, 2021 16:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI: azure timeouts #43643

CI: azure timeouts #43643

mzeitlin11 commented Sep 18, 2021 •

edited

Loading

mzeitlin11 commented Sep 18, 2021

mzeitlin11 commented Sep 18, 2021

mzeitlin11 commented Sep 19, 2021 •

edited

Loading

jreback commented Sep 20, 2021

jreback Sep 20, 2021

mzeitlin11 Sep 20, 2021

mzeitlin11 commented Sep 20, 2021

jreback commented Sep 20, 2021

jreback commented Sep 20, 2021

mzeitlin11 commented Sep 21, 2021

jreback commented Sep 21, 2021

CI: azure timeouts #43643

CI: azure timeouts #43643

Conversation

mzeitlin11 commented Sep 18, 2021 • edited Loading

mzeitlin11 commented Sep 18, 2021

mzeitlin11 commented Sep 18, 2021

mzeitlin11 commented Sep 19, 2021 • edited Loading

jreback commented Sep 20, 2021

jreback Sep 20, 2021

Choose a reason for hiding this comment

mzeitlin11 Sep 20, 2021

Choose a reason for hiding this comment

mzeitlin11 commented Sep 20, 2021

jreback commented Sep 20, 2021

jreback commented Sep 20, 2021

mzeitlin11 commented Sep 21, 2021

jreback commented Sep 21, 2021

mzeitlin11 commented Sep 18, 2021 •

edited

Loading

mzeitlin11 commented Sep 19, 2021 •

edited

Loading