REGR: fix rank algo for read-only data #37439

jorisvandenbossche · 2020-10-27T08:05:41Z

jreback · 2020-10-27T12:49:21Z

hmm i think this is actually failing see the traivs CI: https://travis-ci.org/github/pandas-dev/pandas/jobs/739214922

>   ranked_mat[:, i] = rank_1d(mat[:, i])
E   TypeError: Argument 'in_arr' has incorrect type (expected numpy.ndarray, got pandas._libs.algos._memoryviewslice)
pandas/_libs/algos.pyx:347: TypeError
_ TestDataFrameCorr.test_corr_nullable_integer[spearman-other_column2-nullable_column1] _
[gw0] linux -- Python 3.7.9 /home/travis/miniconda3/envs/pandas-dev/bin/python
self = <pandas.tests.frame.methods.test_cov_corr.TestDataFrameCorr object at 0x7fcbc4f5a4d0>
nullable_column = <IntegerArray>
[1, 2, <NA>]
Length: 3, dtype: Int64
other_column = array([ 1.,  2., nan]), method = 'spearman'
    @td.skip_if_no_scipy
    @pytest.mark.parametrize(
        "nullable_column", [pd.array([1, 2, 3]), pd.array([1, 2, None])]
    )
    @pytest.mark.parametrize(
        "other_column",
        [pd.array([1, 2, 3]), np.array([1.0, 2.0, 3.0]), np.array([1.0, 2.0, np.nan])],
    )
    @pytest.mark.parametrize("method", ["pearson", "spearman", "kendall"])
    def test_corr_nullable_integer(self, nullable_column, other_column, method):
        # https://github.com/pandas-dev/pandas/issues/33803
        data = DataFrame({"a": nullable_column, "b": other_column})
>       result = data.corr(method=method)
pandas/tests/frame/methods/test_cov_corr.py:190: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
pandas/core/frame.py:8300: in corr
    correl = libalgos.nancorr_spearman(mat, minp=min_periods)
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
>   ranked_mat[:, i] = rank_1d(mat[:, i])
E   TypeError: Argument 'in_arr' has incorrect type (expected numpy.ndarray, got pandas._libs.algos._memoryviewslice)

jbrockmendel · 2020-10-27T15:31:42Z

looks like you need to update the types in nancorr_spearman and rank_2d (which call rank_1d)

jorisvandenbossche · 2020-10-27T20:51:46Z

Thanks for the note, indeed needed to update some other places where rank_1d is called

jreback · 2020-10-27T22:09:10Z

thanks @jorisvandenbossche

jreback · 2020-10-28T02:23:25Z

@meeseeksdev backport 1.1.x

Co-authored-by: Joris Van den Bossche <[email protected]>

REGR: fix rank algo for read-only data

01018d1

jorisvandenbossche added Regression Functionality that used to work in a prior pandas version Algos Non-arithmetic algos: value_counts, factorize, sorting, isin, clip, shift, diff labels Oct 27, 2020

jorisvandenbossche added this to the 1.1.4 milestone Oct 27, 2020

jorisvandenbossche mentioned this pull request Oct 27, 2020

BUG: rank raises error with read-only data #37290

Closed

3 tasks

update call sites of rank_1d

1c6248e

jreback merged commit 9c5500e into pandas-dev:master Oct 27, 2020

meeseeksmachine mentioned this pull request Oct 28, 2020

Backport PR #37439 on branch 1.1.x (REGR: fix rank algo for read-only data) #37459

Merged

meeseeksmachine pushed a commit to meeseeksmachine/pandas that referenced this pull request Oct 28, 2020

Backport PR pandas-dev#37439: REGR: fix rank algo for read-only data

782ddee

jorisvandenbossche deleted the gh-37290-rank-readonly branch October 28, 2020 07:23

jorisvandenbossche added a commit that referenced this pull request Oct 28, 2020

Backport PR #37439: REGR: fix rank algo for read-only data (#37459)

dc39ee2

Co-authored-by: Joris Van den Bossche <[email protected]>

kesmit13 pushed a commit to kesmit13/pandas that referenced this pull request Nov 2, 2020

REGR: fix rank algo for read-only data (pandas-dev#37439)

f57b0ea

ukarroum pushed a commit to ukarroum/pandas that referenced this pull request Nov 2, 2020

REGR: fix rank algo for read-only data (pandas-dev#37439)

1a08ab0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

REGR: fix rank algo for read-only data #37439

REGR: fix rank algo for read-only data #37439

jorisvandenbossche commented Oct 27, 2020

jreback commented Oct 27, 2020

jbrockmendel commented Oct 27, 2020

jorisvandenbossche commented Oct 27, 2020

jreback commented Oct 27, 2020

jreback commented Oct 28, 2020

REGR: fix rank algo for read-only data #37439

REGR: fix rank algo for read-only data #37439

Conversation

jorisvandenbossche commented Oct 27, 2020

jreback commented Oct 27, 2020

jbrockmendel commented Oct 27, 2020

jorisvandenbossche commented Oct 27, 2020

jreback commented Oct 27, 2020

jreback commented Oct 28, 2020