BUG: skipna parameter in series.any() returns wrong result #23109

kaybhutani · 2018-10-12T14:28:28Z

#importing pandas module 
import pandas as pd 

#importing numpy module
import numpy as np

data=pd.DataFrame({'A':[1,2,3,4,0,np.nan,3],
                  'B':[3,1,4,5,0,np.nan,5]})

data.any(axis=1,skipna=True)

Expected output:
0 True
1 True
2 True
3 True
4 False
5 True
6 True
dtype: bool

Returned output:

0 True
1 True
2 True
3 True
4 False
5 False
6 True
dtype: bool

As written in documentation, If an entire row/column is NA, the result will be NA
But NA isn't returned in any of the cases (Keeping skipna True or False)

The text was updated successfully, but these errors were encountered:

TomAugspurger · 2018-10-12T15:28:43Z

As written in documentation, If an entire row/column is NA, the result will be NA

I think the docs at

pandas/pandas/core/generic.py

Lines 9728 to 9730 in 12a0dc4

    
           skipna : boolean, default True 
        
               Exclude NA/null values. If an entire row/column is NA, the result 
        
               will be NA.

are incorrect. Skipna should be the same as the operation on the values with NAs removed (is that right @jorisvandenbossche?).

kaybhutani · 2018-10-12T16:24:59Z

Isn't this a better way? ↓
If skipana is None which is default then it returns NA on whole NaN rows/column
If True/False then return True/False respectively for whole NA

Because if the docs are incorrect then there is probably no way to return NA for Null values

TomAugspurger · 2018-10-12T16:31:12Z

min_count I thikn

…

On Fri, Oct 12, 2018 at 11:25 AM Kartikay Bhutani ***@***.***> wrote: Isn't this a better way? ↓ If skipana is None which is default then it returns NA on whole NaN rows/column If True/False then return True/False respectively for whole NA Because if the docs are incorrect then there is probably no way to return NA for Null values — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#23109 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABQHIhLJLLq1TczRspP-JDekrFBh8mKiks5ukMJtgaJpZM4XZlWL> .

dsaxton · 2018-10-14T04:58:12Z

My opinion: I think the problem is the documentation; the result is actually correct. If you ask if any of an empty set of statements is True, the answer is no. This is consistent with numpy:

In [1]: import numpy as np

In [2]: np.any([])
Out[2]: False

jorisvandenbossche · 2018-10-15T08:39:55Z

Skipna should be the same as the operation on the values with NAs removed (is that right @jorisvandenbossche?).

I suppose this as well. any/all can be seen as reductions like sum or prod, so we should probably follow their design.

So I think @dsaxton is right that it is only the documentation that is incorrect.

dsaxton · 2018-10-17T00:51:29Z

@jorisvandenbossche What would you say is the appropriate fix for this? If the documentation is a general statement about the skipna parameter, maybe it makes sense to just remove the claim that the result will be NA (since it's not true for any, but presumably would be true in other contexts)?

… and data are all NA (pandas-dev#23109) Include examples with NA values and describe treatement of NA with `skipna == False`

…#23109) Also include examples with NA values and clarify treatment of NA with `skipna == False`

…24069)

…#23109) (pandas-dev#24069)

jorisvandenbossche added Docs good first issue labels Oct 15, 2018

jorisvandenbossche added this to the 0.24.0 milestone Oct 15, 2018

jreback modified the milestones: 0.24.0, Contributions Welcome Dec 2, 2018

jamesmyatt added a commit to jamesmyatt/pandas that referenced this issue Dec 3, 2018

BUG/DOC: Correct docstrings for any and all when skipna == True…

e5a0f61

… and data are all NA (pandas-dev#23109) Include examples with NA values and describe treatement of NA with `skipna == False`

jamesmyatt added a commit to jamesmyatt/pandas that referenced this issue Dec 3, 2018

DOC: Correct/update skipna docstrings for any and all (pandas-dev…

f95cef4

…#23109) Also include examples with NA values and clarify treatment of NA with `skipna == False`

jamesmyatt added a commit to jamesmyatt/pandas that referenced this issue Dec 3, 2018

DOC: Correct/update skipna docstrings for any and all (pandas-dev…

df91bcb

…#23109) Also include examples with NA values and clarify treatment of NA with `skipna == False`

jamesmyatt mentioned this issue Dec 3, 2018

DOC: Correct/update skipna docstrings for any and all (#23109) #24069

Merged

4 tasks

jreback modified the milestones: Contributions Welcome, 0.24.0 Dec 4, 2018

jreback closed this as completed in #24069 Dec 10, 2018

jreback pushed a commit that referenced this issue Dec 10, 2018

DOC: Correct/update skipna docstrings for any and all (#23109) (#…

5389987

…24069)

Pingviinituutti pushed a commit to Pingviinituutti/pandas that referenced this issue Feb 28, 2019

DOC: Correct/update skipna docstrings for any and all (pandas-dev…

97f14aa

…#23109) (pandas-dev#24069)

Pingviinituutti pushed a commit to Pingviinituutti/pandas that referenced this issue Feb 28, 2019

DOC: Correct/update skipna docstrings for any and all (pandas-dev…

3df8a6a

…#23109) (pandas-dev#24069)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: skipna parameter in series.any() returns wrong result #23109

BUG: skipna parameter in series.any() returns wrong result #23109

kaybhutani commented Oct 12, 2018

TomAugspurger commented Oct 12, 2018

kaybhutani commented Oct 12, 2018

TomAugspurger commented Oct 12, 2018 via email

dsaxton commented Oct 14, 2018

jorisvandenbossche commented Oct 15, 2018

dsaxton commented Oct 17, 2018

BUG: skipna parameter in series.any() returns wrong result #23109

BUG: skipna parameter in series.any() returns wrong result #23109

Comments

kaybhutani commented Oct 12, 2018

TomAugspurger commented Oct 12, 2018

kaybhutani commented Oct 12, 2018

TomAugspurger commented Oct 12, 2018 via email

dsaxton commented Oct 14, 2018

jorisvandenbossche commented Oct 15, 2018

dsaxton commented Oct 17, 2018