-
-
Notifications
You must be signed in to change notification settings - Fork 18.4k
CI: failing timedelta64 test on Linux py37_locale build #33890
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
xref #33898 looks like in this circumstance
|
@seberg any guess why |
@jbrockmendel I somewhat thought we had removed locale stuff, but not sure, I will look into it. The Just to be sure, is the failure newer, or just something that came up now randomly? |
I've been seeing it for a few days, but its not in the npdev build. Its py37, Linux, numpy 1.18.1 |
@jbrockmendel you have a setup where you can reproduce this, right? For you The current dtypes work by having a C-side singleton instance. I assume that singleton instance is modified somewhere before this test is executed. I.e. you are not allowed to modify that singleton, but somewhere it does get modified. Seems tricky to track down where it happens before this test if this is the case. Is there a minimal test run I can do to reproduce this? |
I haven't been able to reproduce this locally; it is in the build that sets the locale to "zh_CN.utf8", which my machine wont let me do |
I poked at it for a while, and I honestly have no clue where to search and I am unable to get things running far enough to hope to reproduce it. I see there were a couple of changes e.g. in the json code a few weeks back. Could it be related to that? I tried looking at them and did not see anything obvious, but its pretty tricky and complex C code... |
Thanks for taking a look. I think we're going to end up xfailing it for the time being. |
I ran the pandas tests through valgrind over night. There is one fishy spot in that json code, although I doubt that it caused this. I will open a PR for that I think, does the test fail fairly reliably on CI? There were two or so other slightly fishy things seen in valgrind. One a possible one in What worried me a bit more is:
in the hashtable implementation. Although likely also just an immediate use-after-free which is unlikely to cause actual problems in practice, it may be nice to clean up? |
it was failing pretty often, is now xfailed.
Can you describe this?
I'll take a look at this, though it is likely past the limits of my C-fu. |
Hmmm, the take one (sorry, I did not use a debug python version here, so the output is not super useful:
I had missed the section part, which seems to occur within |
Wow, some other random change I made in NumPy made me run into a segfault in a semi-recent change to NumPy, which seems like it is probably related. Still have to investigate more, but I think the bug is probably in NumPy, just so hidden we did not notice it yet. EDIT: Sorry, false flag :(, it just happened that there was a header issue in the datetime file due to my change... |
Closable? |
This just stopped failing somehow? Or did we actually do something? |
xfailed the affected test in the relevant conditions |
This is seen on a few PRs (eg https://dev.azure.com/pandas-dev/pandas/_build/results?buildId=34508&view=logs&jobId=c4100758-acc4-5cd0-72d8-90828968cf00&j=c4100758-acc4-5cd0-72d8-90828968cf00)
The text was updated successfully, but these errors were encountered: