Proposal: Shorter default Series/DataFrame repr when truncated #27000

jorisvandenbossche · 2019-06-22T13:41:00Z

There haven been previous attempts to reduce the length of the Series/DataFrame repr (pandas.options.display.max_rows), eg #20514. Related pandas-dev email: https://mail.python.org/pipermail/pandas-dev/2018-March/000732.html

In that discussion, I once made the following proposal to introduce two thresholds:

We have 2 thresholds instead of 1 (the current 'max_rows'): a number of
rows to show in a truncated repr, and a max number of rows to show
without truncating
For 'big' dataframes, we show a truncated repr. And then I would go even
lower than 20 and only show first/last 5 (so like a max_rows of 10)
For 'small' dataframes, we show the full dataframe without truncating, up
to the threshold.

We would still need to define those two thresholds. But for example, using the current max_rows of 60: we could show a full repr up to 60 rows, and once the number of rows > 60, we only show 10 (first/last 5).

You can then still set both thresholds at the same number (like 20, as in the linked PR above) to not get this variable behaviour.

This is actually similar to what numpy arrays do (but with a bigger threshold: eg np.random.randn(1000) shows all 1000 elements, np.random.randn(1001) shows the first/lst 3).
And it is also very similar to what R tibbles do: they have a "print_min" and "print_max" options with exactly this behaviour, only their "print_max" is lower (it's 10 and 20, respectively):

options(tibble.print_max = n, tibble.print_min = m): if there are more than
n rows, print only the first m rows. Use options(tibble.print_max = Inf)
to always show all rows.

The text was updated successfully, but these errors were encountered:

simonjayhawkins · 2019-06-23T08:18:47Z

so this would add two display options pandas.options.display.min_rows and pandas.options.display.min_columns

and two arguments, to to_string and to_html(notebook=True); min_rows and min_cols?

personally, since this proposal is display related and not so relevant to IO, I would prefer not to see the additional arguments to to_string and to_html

jorisvandenbossche · 2019-06-23T11:23:14Z

I would personally only start with min_rows (we could always add the columns one later if there is demand for it).

And also personally for me, I am fine with only adding it as a general display option for now, and not necessarily to to_string / to_html.

simonjayhawkins · 2019-06-23T11:54:05Z

+1 in that case.

TomAugspurger · 2019-06-23T19:31:12Z

+1 from me as well.

jorisvandenbossche · 2019-07-03T03:51:02Z

cc @pandas-dev/pandas-core we might still include this in 0.25.0. Any concerns about the above proposal?

shoyer · 2019-07-03T03:53:32Z

+1 sounds great to me

toobaz · 2019-07-03T08:44:16Z

+1 for me too

topper-123 · 2019-07-03T17:08:18Z

I'm +1.

jorisvandenbossche added the Output-Formatting __repr__ of pandas objects, to_string label Jun 22, 2019

jorisvandenbossche mentioned this issue Jun 28, 2019

Shorter truncated Series/DataFrame repr: introduce min_rows #27095

Merged

jreback added this to the 0.25.0 milestone Jun 28, 2019

jreback closed this as completed in #27095 Jul 3, 2019

jreback mentioned this issue Aug 2, 2021

ENH: A new method that will more efficiently display 'tall' df #42837

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: Shorter default Series/DataFrame repr when truncated #27000

Proposal: Shorter default Series/DataFrame repr when truncated #27000

jorisvandenbossche commented Jun 22, 2019

simonjayhawkins commented Jun 23, 2019

jorisvandenbossche commented Jun 23, 2019

simonjayhawkins commented Jun 23, 2019

TomAugspurger commented Jun 23, 2019

jorisvandenbossche commented Jul 3, 2019

shoyer commented Jul 3, 2019

toobaz commented Jul 3, 2019

topper-123 commented Jul 3, 2019

Proposal: Shorter default Series/DataFrame repr when truncated #27000

Proposal: Shorter default Series/DataFrame repr when truncated #27000

Comments

jorisvandenbossche commented Jun 22, 2019

simonjayhawkins commented Jun 23, 2019

jorisvandenbossche commented Jun 23, 2019

simonjayhawkins commented Jun 23, 2019

TomAugspurger commented Jun 23, 2019

jorisvandenbossche commented Jul 3, 2019

shoyer commented Jul 3, 2019

toobaz commented Jul 3, 2019

topper-123 commented Jul 3, 2019