Skip to content

BUG: drop_duplicates raising for boolean arrow dtype with missing values #54670

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 4 commits into from
Aug 22, 2023

Conversation

phofl
Copy link
Member

@phofl phofl commented Aug 21, 2023

cc @mroeschke

More generally, how do we want to proceed with Arrow EA fixes after 2.1 is out?

@phofl phofl added the Arrow pyarrow functionality label Aug 21, 2023
@@ -249,3 +249,10 @@ def test_drop_duplicates_ignore_index(self):
result = ser.drop_duplicates(ignore_index=True)
expected = Series([1, 2, 3])
tm.assert_series_equal(result, expected)

def test_duplicated_arrow_dtype(keep):
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did you intend to parameterize over keep?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nope, that was a lesson that you shouldn't copy paste without double checking

@mroeschke
Copy link
Member

More generally, how do we want to proceed with Arrow EA fixes after 2.1 is out?

Since Arrow support isn't new anymore I suppose we shouldn't be backporting bug fixes as aggressively anymore, but I would still be okay with it

@phofl
Copy link
Member Author

phofl commented Aug 22, 2023

Ok then maybe lets backport low risk fixes and decide on the others on a case by case basis

@mroeschke mroeschke modified the milestones: 2.2, 2.1 Aug 22, 2023
@mroeschke mroeschke merged commit 47ae3f0 into pandas-dev:main Aug 22, 2023
meeseeksmachine pushed a commit to meeseeksmachine/pandas that referenced this pull request Aug 22, 2023
@phofl phofl deleted the 54667 branch August 22, 2023 18:48
@mroeschke
Copy link
Member

Thanks @phofl

mroeschke pushed a commit that referenced this pull request Aug 22, 2023
…boolean arrow dtype with missing values) (#54692)

Backport PR #54670: BUG: drop_duplicates raising for boolean arrow dtype with missing values

Co-authored-by: Patrick Hoefler <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Arrow pyarrow functionality
Projects
None yet
Development

Successfully merging this pull request may close these issues.

BUG: drop_duplicates raises unexpected error for bool[pyarrow] with null value
2 participants