BUG: CategoricalIndex.format #35440

topper-123 · 2020-07-28T22:28:20Z

closes BUG: output of df.to_string depends on whether columns is a CategoricalIndex or not #35439
tests added / passed
passes black pandas
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

I've temporarily put the whatsnewentry in the v.1.1.0 release note, because there isn't a v.1.1.1 version yet. I'll move it, before this is merged.

simonjayhawkins · 2020-07-30T20:34:13Z

Thanks @topper-123 for the PR.

The regression was caused by #35118. Categorical types other than object were also affected. maybe need to parameterise test with other values for cols

>>> pd.__version__
'1.2.0.dev0+10.g3b1d4f1ee'
>>>
>>> data = [[4, 2], [3, 2], [4, 3]]
>>> cols = [1, None]
>>> res = pd.DataFrame(data, columns=cols)
>>> print(res)
   1  NaN
0  4    2
1  3    2
2  4    3
>>>
>>> res = pd.DataFrame(data, columns=pd.CategoricalIndex(cols))
>>> print(res)
   1    NaN
0    4    2
1    3    2
2    4    3
>>>

>>> pd.__version__
'1.0.5'
>>>
>>> data = [[4, 2], [3, 2], [4, 3]]
>>> cols = [1, None]
>>> res = pd.DataFrame(data, columns=cols)
>>> print(res)
   1  NaN
0  4    2
1  3    2
2  4    3
>>>
>>> res = pd.DataFrame(data, columns=pd.CategoricalIndex(cols))
>>> print(res)
   1.0  NaN
0    4    2
1    3    2
2    4    3
>>>

I've temporarily put the whatsnewentry in the v.1.1.0 release note, because there isn't a v.1.1.1 version yet.

doc\source\whatsnew\v1.1.1.rst now merged to master

topper-123 · 2020-08-02T12:51:41Z

pandas/core/indexes/range.py

@@ -197,9 +197,6 @@ def _format_data(self, name=None):
        # we are formatting thru the attributes
        return None

-    def _format_with_header(self, header, na_rep="NaN") -> List[str]:
-        return header + [pprint_thing(x) for x in self._range]
-


The added tests revealed that this method in master made the output from RangeIndex.format different than for Int64Index.format:

>>> pd.RangeIndex(0, 18, 2).format() ['0', '2', '4', '6', '8', '10', '12', '14', '16'] >>> pd.Int64Index(range(0, 18, 2)).format() ['0 ', '2 ', '4 ', '6 ', '8 ', '10', '12', '14', '16']

Notice the extra space for one-digit scalars in the Int64Index case. The outputs from the two methods are identical after merging this PR.

topper-123 · 2020-08-02T14:00:34Z

Updated.

jreback · 2020-08-03T23:49:01Z

thanks @topper-123 very nice!

simonjayhawkins · 2020-08-04T09:54:38Z

@meeseeksdev backport to 1.1.x

Co-authored-by: Terji Petersen <[email protected]>

simonjayhawkins added Categorical Categorical Data Type Output-Formatting __repr__ of pandas objects, to_string labels Jul 30, 2020

simonjayhawkins added this to the 1.1.1 milestone Jul 30, 2020

topper-123 added 2 commits August 2, 2020 12:40

BUG: CategoricalIndex.format

c23e424

Add tests for Index.format

cd8a9ee

topper-123 force-pushed the categorical_df_to_string branch from eb920f9 to cd8a9ee Compare August 2, 2020 12:47

topper-123 commented Aug 2, 2020

View reviewed changes

topper-123 added 2 commits August 2, 2020 13:54

add GH numbers

309573c

flake8 cleanup

2da31af

remove whatsnew entry in v.1.1.0 + add new entry in v.1.1.1

5c37450

jreback merged commit cda8284 into pandas-dev:master Aug 3, 2020

topper-123 deleted the categorical_df_to_string branch August 4, 2020 06:07

simonjayhawkins added the Still Needs Manual Backport label Aug 4, 2020

meeseeksmachine mentioned this pull request Aug 4, 2020

Backport PR #35440 on branch 1.1.x (BUG: CategoricalIndex.format) #35539

Merged

meeseeksmachine pushed a commit to meeseeksmachine/pandas that referenced this pull request Aug 4, 2020

Backport PR pandas-dev#35440: BUG: CategoricalIndex.format

abd6492

simonjayhawkins removed the Still Needs Manual Backport label Aug 4, 2020

simonjayhawkins pushed a commit that referenced this pull request Aug 4, 2020

Backport PR #35440: BUG: CategoricalIndex.format (#35539)

1ab59a8

Co-authored-by: Terji Petersen <[email protected]>

topper-123 mentioned this pull request Aug 13, 2020

PERF: RangeIndex.format performance #35712

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG: CategoricalIndex.format #35440

BUG: CategoricalIndex.format #35440

Uh oh!

topper-123 commented Jul 28, 2020 •

edited

Loading

Uh oh!

simonjayhawkins commented Jul 30, 2020

Uh oh!

topper-123 Aug 2, 2020 •

edited

Loading

Uh oh!

topper-123 commented Aug 2, 2020

Uh oh!

jreback commented Aug 3, 2020

Uh oh!

simonjayhawkins commented Aug 4, 2020

Uh oh!

Uh oh!

Uh oh!

BUG: CategoricalIndex.format #35440

BUG: CategoricalIndex.format #35440

Uh oh!

Conversation

topper-123 commented Jul 28, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

simonjayhawkins commented Jul 30, 2020

Uh oh!

topper-123 Aug 2, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

topper-123 commented Aug 2, 2020

Uh oh!

jreback commented Aug 3, 2020

Uh oh!

simonjayhawkins commented Aug 4, 2020

Uh oh!

Uh oh!

topper-123 commented Jul 28, 2020 •

edited

Loading

topper-123 Aug 2, 2020 •

edited

Loading