DataFrame.query produces inconsistent results on Windows 8.1 and higher #12055
Labels
Compat
pandas objects compatability with Numpy or Python functions
Duplicate Report
Duplicate issue or pull request
I have had issues with DataFrame.query() producing erroneous results on machines with Window 8.1, Windows Server 2012 R2 and Windows 10.
The issue can be re-produced with the code below which is creating a simple data frame and then querying it successively and checking the results. The issue appears to be intermittent: running the script multiple times will show the query failing on different tests. The issue also seems to depend on the data frame size: the the issue does not seem to existing for data frames below ~1,000 rows, data frames with ~1000 to 100,000 rows return the wrong results, and data frames with more than 100k rows commonly return no results.
I believe the perhaps lies with the numexpr evaluation. In the above script, changing the evaluation engine to python, does not return any errors. Also, the script below demonstrates the issue occur when doing numexpr by itself:
The text was updated successfully, but these errors were encountered: