ENH: pprint index/columns? #6295

jseabold · 2014-02-07T16:21:46Z

I haven't looked at the source yet, but do columns ever get pretty printed? It looks like indices do if it's a DatetimeIndex, but not otherwise.

I often find myself doing

df.columns.tolist()

otherwise it's not really readable.

The text was updated successfully, but these errors were encountered:

jreback · 2014-03-22T21:35:16Z

IIRC display.max_seq_items controls this....

@cpcloud ?

jseabold · 2014-03-22T21:40:58Z

It looks to me like it just truncates the columns with a .... I was thinking more of one column name per row or something that's easier to parse with the eyes.

jreback · 2014-03-22T21:43:00Z

can you post an example with how you think it should look?

jseabold · 2014-03-22T21:51:16Z

List printing usually works for me. Even the nice lined up columns of tab completion are easier to parse though.

[~/]                                                                            
[3]: df = pd.DataFrame(np.random.rand(20,4), columns=['A really long name', 'Another really long name', 'A third really long name', 'The last column name'])

[~/]
[4]: df.columns
[4]: Index([u'A really long name', u'Another really long name', u'A third really long name', u'The last column name'], dtype='object')

[~/]
[5]: df.columns.to<TAB>
df.columns.to_datetime      df.columns.tofile
df.columns.to_native_types  df.columns.tolist
df.columns.to_series        df.columns.tostring

[~/]
[5]: df.columns.tolist()
[5]: 
['A really long name',
'Another really long name',
'A third really long name',
'The last column name']

jankatins · 2014-03-24T14:21:52Z

Please, no linebreaks after each element: I usually use the output to copy it and use it as a row selection df[["col", "col2"]] and I was quite happy with this (I seem to remember that df.columns printed column names without quotes, so I had to always use list(df.columns))

This is what ipython currently produces:

In [7]: l = ["a", "b", "c"]

In [8]: l
Out[8]: ['a', 'b', 'c']

In [16]: df = pd.DataFrame(np.random.rand(40,8), columns=['A really long name', 'Another really long name', 'A third really long name', 'The last column name', "a","b","c","d"]
)

In [17]: list(df.columns)
Out[17]: 
['A really long name',
 'Another really long name',
 'A third really long name',
 'The last column name',
 'a',
 'b',
 'c',
 'd']

In [18]: df.columns
Out[18]: Index([u'A really long name', u'Another really long name', u'A third really long name', u'The last column name', u'a', u'b', u'c', u'd'], dtype='object')

jseabold · 2014-03-24T14:46:39Z

The counterargument is that you can always slice .columns, so you really don't need to do any copy and pasting. I'm mainly interested in some solution for readability. It's nearly impossible to see what's actually in the data and where, as I use this day to day. The counterargument to me I guess is just use info. Maybe a column_info property would be nice. I don't think it's worth crowing the namespace for though.

Both sides of this are really arguments for some kind of IDE with a variable browser window pane and tab-completion for selecting variable names... It's still the only thing I miss from using Stata.

jreback · 2014-03-24T14:56:42Z

This is what ipythons interact can do! (in ipython 2.0)....now to get someone to write it......(for frames)

jseabold · 2014-03-24T15:39:16Z

Well that's interesting. They'll be qtconsole/notebooks-only I take it? Is this the link? I'm way behind on ipython-dev.

TomAugspurger · 2014-03-24T15:45:01Z

It may be notebook only. I'm guessing the qt console doesn't handle javascript.

jreback · 2014-03-24T15:50:56Z

yah..i think notebook only

jreback · 2014-03-24T15:52:38Z

http://jakevdp.github.io/blog/2013/12/05/static-interactive-widgets/

I don't think it would be very hard to have as default a couple of sliders that show the extent of a frame (and then allow hiding/showing of various columns).

jreback · 2014-03-29T17:40:46Z

http://nbviewer.ipython.org/gist/rossant/9463955

proof of concept!

jreback · 2015-04-04T18:55:17Z

xref #9741

To do this right is actually a bit non-trivial. and will require some refactoring to gather all of the scattered bits of code.

jreback added the Output-Formatting label Mar 22, 2014

jreback added Enhancement Indexing Related to indexing on series/frames, not to indexes themselves labels Mar 8, 2015

jreback added this to the 0.16.1 milestone Mar 8, 2015

jreback added Difficulty Intermediate labels Apr 4, 2015

jreback mentioned this issue Apr 21, 2015

Index repr changes to make them consistent #9901

Merged

jreback closed this as completed in #9901 May 9, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: pprint index/columns? #6295

ENH: pprint index/columns? #6295

jseabold commented Feb 7, 2014

jreback commented Mar 22, 2014

jseabold commented Mar 22, 2014

jreback commented Mar 22, 2014

jseabold commented Mar 22, 2014

jankatins commented Mar 24, 2014

jseabold commented Mar 24, 2014

jreback commented Mar 24, 2014

jseabold commented Mar 24, 2014

TomAugspurger commented Mar 24, 2014

jreback commented Mar 24, 2014

jreback commented Mar 24, 2014

jreback commented Mar 29, 2014

jreback commented Apr 4, 2015

ENH: pprint index/columns? #6295

ENH: pprint index/columns? #6295

Comments

jseabold commented Feb 7, 2014

jreback commented Mar 22, 2014

jseabold commented Mar 22, 2014

jreback commented Mar 22, 2014

jseabold commented Mar 22, 2014

jankatins commented Mar 24, 2014

jseabold commented Mar 24, 2014

jreback commented Mar 24, 2014

jseabold commented Mar 24, 2014

TomAugspurger commented Mar 24, 2014

jreback commented Mar 24, 2014

jreback commented Mar 24, 2014

jreback commented Mar 29, 2014

jreback commented Apr 4, 2015