-
-
Notifications
You must be signed in to change notification settings - Fork 18.4k
BUG: Series.rank modifies inplace with NaT #18576
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 4 commits
e82b128
d1cc880
9c07153
ca5d28c
aad8fb8
caa3fa9
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -84,6 +84,9 @@ def rank_1d_{{dtype}}(object in_arr, ties_method='average', ascending=True, | |
mask = np.isnan(values) | ||
{{elif dtype == 'int64'}} | ||
mask = values == iNaT | ||
# create copy in case of iNaT | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. add that we are mutating the values in-place here |
||
if mask.any(): | ||
values = values.copy() | ||
{{endif}} | ||
|
||
# double sort first by mask and then by values to ensure nan values are | ||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -2214,3 +2214,11 @@ def test_series_broadcasting(self): | |
df_nan.clip_lower(s, axis=0) | ||
for op in ['lt', 'le', 'gt', 'ge', 'eq', 'ne']: | ||
getattr(df, op)(s_nan, axis=0) | ||
|
||
def test_series_nat_conversion(self): | ||
# GH 18521 | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. add a 1-liner explaining this is testing non-mutataion of the input data |
||
df = DataFrame(np.random.randn(10, 3), dtype='float64') | ||
expected = df.copy() | ||
df.rank() | ||
result = df | ||
tm.assert_frame_equal(result, expected) |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,5 +1,5 @@ | ||
# -*- coding: utf-8 -*- | ||
from pandas import compat | ||
from pandas import compat, Timestamp | ||
|
||
import pytest | ||
|
||
|
@@ -368,3 +368,12 @@ def test_rank_object_bug(self): | |
# smoke tests | ||
Series([np.nan] * 32).astype(object).rank(ascending=True) | ||
Series([np.nan] * 32).astype(object).rank(ascending=False) | ||
|
||
def test_rank_modify_inplace(self): | ||
# GH 18521 | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. same as above |
||
s = Series([Timestamp('2017-01-05 10:20:27.569000'), NaT]) | ||
expected = s.copy() | ||
|
||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. add a test for an all-float DataFrame); in pandas/tests/frame/test_analytics |
||
s.rank() | ||
result = s | ||
assert_series_equal(result, expected) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can you move to the reshaping section (n bug fix)