BUG: Fix for convert_dtypes with mix of int and string #32126

Dr-Irv · 2020-02-20T13:52:45Z

closes convert_dtypes fails with int and str #32117
tests added / passed
- added new cases for test_convert_dtypes
passes black pandas
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry
- placed in v1.0.2 whatsnew

jorisvandenbossche

Thanks! Small question

jorisvandenbossche · 2020-02-20T15:04:24Z

pandas/tests/series/methods/test_convert_dtypes.py

@@ -144,11 +156,23 @@ class TestSeriesConvertDtypes:
                [1, 2.0],


It's a bit hard to interpret below tests / diff, but does pd.Series([1, 2.0], dtype=object).convert_dtypes() still give Int64?

Yes. Here's the interpretation of the tests for that case:
The code reads:

( [1, 2.0], object, { ((True,), (True, False), (True,), (True, False)): "Int64", ((True,), (True, False), (False,), (True, False)): np.dtype( "float" ), ((False,), (True, False), (True, False), (True, False)): np.dtype( "object" ), }, ),

This means the following:

Create a Series with [1, 2.0] as the entries, with dtype object

Consider the 16 possible combinations of the 4 arguments infer_objects, convert_string, convert_integer and convert_boolean

If infer_objects==True and convert_integer==True, result should be Int64

If infer_objects==True and convert_integer==False, result should be float

If infer_objects==False, result is always object

Prior to this PR, the tests were as follows:
p3) If convert_integer==True, result should be Int64 independent of value of infer_objects
p4) If infer_objects==True and convert_integer==False, result should be float (same)
p5) If infer_objects==False and convert_integer==False, result is object

I think the new version is what we want the behavior to be, i.e., if you start with object and you don't do the infer-objects step, it remains an object.

jorisvandenbossche · 2020-02-21T14:49:05Z

@Dr-Irv Thanks!

… int and string

…tring (#32153) Co-authored-by: Irv Lustig <[email protected]>

)

Fix for convert_dtypes with mix of int and string

c99a844

jorisvandenbossche changed the title ~~Fix for convert_dtypes with mix of int and string~~ BUG: Fix for convert_dtypes with mix of int and string Feb 20, 2020

jorisvandenbossche added this to the 1.0.2 milestone Feb 20, 2020

jorisvandenbossche added the Bug label Feb 20, 2020

jorisvandenbossche reviewed Feb 20, 2020

View reviewed changes

jorisvandenbossche approved these changes Feb 20, 2020

View reviewed changes

jorisvandenbossche merged commit c05ef6f into pandas-dev:master Feb 21, 2020

meeseeksmachine mentioned this pull request Feb 21, 2020

Backport PR #32126 on branch 1.0.x (BUG: Fix for convert_dtypes with mix of int and string) #32153

Merged

meeseeksmachine pushed a commit to meeseeksmachine/pandas that referenced this pull request Feb 21, 2020

Backport PR pandas-dev#32126: BUG: Fix for convert_dtypes with mix of…

3969ccf

… int and string

simonjayhawkins pushed a commit that referenced this pull request Feb 21, 2020

Backport PR #32126: BUG: Fix for convert_dtypes with mix of int and s…

de6020e

…tring (#32153) Co-authored-by: Irv Lustig <[email protected]>

roberthdevries pushed a commit to roberthdevries/pandas that referenced this pull request Mar 2, 2020

BUG: Fix for convert_dtypes with mix of int and string (pandas-dev#32126

0b35ec0

)

Dr-Irv deleted the issue32117 branch February 13, 2023 20:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: Fix for convert_dtypes with mix of int and string #32126

BUG: Fix for convert_dtypes with mix of int and string #32126

Dr-Irv commented Feb 20, 2020

jorisvandenbossche left a comment

jorisvandenbossche Feb 20, 2020

Dr-Irv Feb 20, 2020

jorisvandenbossche Feb 20, 2020

jorisvandenbossche commented Feb 21, 2020

		@@ -144,11 +156,23 @@ class TestSeriesConvertDtypes:
		[1, 2.0],

BUG: Fix for convert_dtypes with mix of int and string #32126

BUG: Fix for convert_dtypes with mix of int and string #32126

Conversation

Dr-Irv commented Feb 20, 2020

jorisvandenbossche left a comment

Choose a reason for hiding this comment

jorisvandenbossche Feb 20, 2020

Choose a reason for hiding this comment

Dr-Irv Feb 20, 2020

Choose a reason for hiding this comment

jorisvandenbossche Feb 20, 2020

Choose a reason for hiding this comment

jorisvandenbossche commented Feb 21, 2020