API: CategoricalDtype str, repr #17782

TomAugspurger · 2017-10-04T14:48:17Z

I think that str(cat.dtype) should be changed back to always being 'category'. I've seen several places where people use str(thing.dtype) == 'category' as a way to check for Categoricals. Even subtle things like this arrow PR would break.

So instead of

In [6]: str(pd.Categorical([1, 2, 3]).dtype)
Out[6]: 'CategoricalDtype(categories=[1, 2, 3], ordered=False)'

it would be

Out[6]: 'category'

We can leave __repr__ to be unambiguous.

The text was updated successfully, but these errors were encountered:

jreback · 2017-10-04T23:38:33Z

hmm, I only think you do do that for a CategoricalDtype(None, ordered=False).

TomAugspurger · 2017-10-05T02:09:33Z

Any reason you would do it just for when categories is None categories?

To be clear, in most places like the output in the console, in the Series / DataFrame repr, you'll still have the informative CategoricalDtype(categories=...) repr. It's only when you call str(x.dtype) that you get 'category'.

jorisvandenbossche · 2017-10-05T07:16:11Z

Possible option is also to only change str back to 'category', and keeping repr as it is now. That gives the more informative repr in the console, but doesn't break code that used str(dtype).
But, that of course hides a bit that you can do str(dtype) == 'category' (but that is maybe not a bad thing? as we want them to use pd.api.types.is_categorical ?)

jorisvandenbossche · 2017-10-05T07:16:54Z

Ah, I see that is what you have done in the PR (and stated in the top post)! :-) Should have looked there first and read better.

TomAugspurger added the Categorical Categorical Data Type label Oct 4, 2017

TomAugspurger added this to the 0.21.0 milestone Oct 4, 2017

This was referenced Oct 4, 2017

API: Change str for CategoricalDtype to category #17783

Merged

API: CategoricalDtype str, repr #17781

Closed

jreback closed this as completed in #17783 Oct 5, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API: CategoricalDtype str, repr #17782

API: CategoricalDtype str, repr #17782

TomAugspurger commented Oct 4, 2017

jreback commented Oct 4, 2017

TomAugspurger commented Oct 5, 2017

jorisvandenbossche commented Oct 5, 2017

jorisvandenbossche commented Oct 5, 2017 •

edited

Loading

API: CategoricalDtype str, repr #17782

API: CategoricalDtype str, repr #17782

Comments

TomAugspurger commented Oct 4, 2017

jreback commented Oct 4, 2017

TomAugspurger commented Oct 5, 2017

jorisvandenbossche commented Oct 5, 2017

jorisvandenbossche commented Oct 5, 2017 • edited Loading

jorisvandenbossche commented Oct 5, 2017 •

edited

Loading