REGR: Notebook (html) repr of DataFrame no longer follows min_rows/max_rows settings #37363

ivanovmg · 2020-10-23T11:16:39Z

closes REGR: Notebook (html) repr of DataFrame no longer follows min_rows/max_rows settings #37359
tests added / passed
passes black pandas
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

Fix display logic in number of rows,
explained here https://pandas.pydata.org/docs/dev/user_guide/options.html#frequently-used-options

ivanovmg · 2020-10-23T11:17:36Z

@jorisvandenbossche can you please confirm it works? I am still unable to install dev environment with jupyter (even conda).

ivanovmg · 2020-10-23T17:02:07Z

I managed to install dev env with jupyter. Looks like everything works. But please check in your environment as well.

jorisvandenbossche · 2020-10-23T19:09:30Z

I can confirm this works, thanks!

jorisvandenbossche

Thanks for the quick follow-up!

jorisvandenbossche · 2020-10-23T19:10:00Z

pandas/io/formats/format.py

        else:
            max_rows = self.max_rows

+        return self._adjust_max_rows(max_rows)
+
+    def _adjust_max_rows(self, max_rows: Optional[int]) -> Optional[int]:


I don't think it is worth creating a separate function for those few lines, I think it fits fine in the function above, as it was before

I thought that some explanation should be added why there are still some changes to max_rows and comparison with frame length. Could be a comment, but a separate function with the docstring is better, isn't it.

jorisvandenbossche · 2020-10-23T19:12:21Z

pandas/io/formats/format.py

+                return height - self._get_number_of_auxillary_rows()
+
+            max_rows: Optional[int]
+            if self._is_screen_short(height):


What is this screen_short doing? I think it is rather about the dataframe being longer than the screen, and that we want to show all rows possible

def _is_screen_short(self, max_height) -> bool: return bool(self.max_rows == 0 and len(self.frame) > max_height)

If in terminal (max_rows==0) and if screen height (terminal height) is not enough to accommodate the dataframe entirely, then the screen is short.

If the logic can be improved, then please suggest. But my refactoring mainly consisted of removing comments and making methods with the same/similar names. Apparently got lost in some logic, but that was not covered by the tests properly.
I think that we can make a system test with the string representation as well, once we see that the present behavior is correct.

Code snippet before refactoring (regarding _is_screen_short).

# Format only rows and columns that could potentially fit the # screen if max_cols == 0 and len(self.frame.columns) > w: max_cols = w if max_rows == 0 and len(self.frame) > h: max_rows = h

jorisvandenbossche · 2020-10-23T19:14:57Z

pandas/tests/io/formats/test_format.py

+            min_rows=min_rows,
+        )
+        result = formatter.max_rows_fitted
+        assert result == expected


This is actually already tested (see #37359 (comment), so not sure it is necessarily needed to add additional tests). But so the problem is that it was not catching the regression (and I suppose those tests above will also not have catched it?)

I will try to think about another way to test it

These tests do fail on master

pandas\tests\io\formats\test_format.py:2059: AssertionError ============================================================== short test summary info =============================================================== FAILED pandas/tests/io/formats/test_format.py::TestDataFrameFormatting::test_max_rows_fitted[50-30-10-10] - assert 30 == 10 FAILED pandas/tests/io/formats/test_format.py::TestDataFrameFormatting::test_max_rows_fitted[100-60-10-10] - assert 60 == 10 FAILED pandas/tests/io/formats/test_format.py::TestDataFrameFormatting::test_max_rows_fitted[61-60-10-10] - assert 60 == 10 =========================================================== 3 failed, 206 passed in 4.35s ============================================================

jreback · 2020-10-23T19:14:52Z

pandas/io/formats/format.py

+                # rows available to fill with actual data
+                return height - self._get_number_of_auxillary_rows()
+
+            max_rows: Optional[int]


can you type this at the top to make it less cluttering

I also think that splitting even further would be better.

if self._is_in_terminal(): return self._get_max_rows_fitted_in_terminal() else: return self._adjust_max_rows(self.max_rows)

Probably other PR?

jreback · 2020-10-31T19:03:58Z

also pls rebase

jreback · 2020-11-04T02:56:09Z

@ivanovmg can you merge master.

@jorisvandenbossche if any comments.

simonjayhawkins · 2020-11-12T10:50:47Z

@ivanovmg can you merge master one more time and this can be merged cc @jorisvandenbossche

ivanovmg · 2020-11-12T13:43:26Z

@ivanovmg can you merge master one more time and this can be merged cc @jorisvandenbossche

Done, it's green.

jreback · 2020-11-13T05:45:57Z

thanks @ivanovmg

ivanovmg added 2 commits October 23, 2020 18:07

TST: add failing test

1b252d8

FIX: fix display logic

74dd429

simonjayhawkins added IO HTML read_html, to_html, Styler.apply, Styler.applymap Output-Formatting __repr__ of pandas objects, to_string Regression Functionality that used to work in a prior pandas version labels Oct 23, 2020

simonjayhawkins changed the title ~~FIX:~~ REGR: Notebook (html) repr of DataFrame no longer follows min_rows/max_rows settings Oct 23, 2020

simonjayhawkins added this to the 1.2 milestone Oct 23, 2020

TST: add test cases

2bb25cb

ivanovmg requested a review from jorisvandenbossche October 23, 2020 14:22

jorisvandenbossche reviewed Oct 23, 2020

View reviewed changes

ivanovmg requested a review from jorisvandenbossche October 28, 2020 16:27

jreback requested changes Oct 31, 2020

View reviewed changes

ivanovmg added 2 commits November 1, 2020 03:05

Merge branch 'master' into bug_37359

59bf494

TYP: type max_rows on top of the method

6ff4f91

ivanovmg requested a review from jreback November 1, 2020 04:05

jreback approved these changes Nov 4, 2020

View reviewed changes

Merge branch 'master' into bug_37359

fd20881

Merge branch 'master' into bug_37359

fca7749

jreback merged commit b966657 into pandas-dev:master Nov 13, 2020

ivanovmg deleted the bug_37359 branch November 13, 2020 05:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

REGR: Notebook (html) repr of DataFrame no longer follows min_rows/max_rows settings #37363

REGR: Notebook (html) repr of DataFrame no longer follows min_rows/max_rows settings #37363

ivanovmg commented Oct 23, 2020

ivanovmg commented Oct 23, 2020 •

edited

Loading

ivanovmg commented Oct 23, 2020

jorisvandenbossche commented Oct 23, 2020

jorisvandenbossche left a comment

jorisvandenbossche Oct 23, 2020

ivanovmg Oct 28, 2020

jorisvandenbossche Oct 23, 2020

ivanovmg Oct 23, 2020

ivanovmg Oct 23, 2020

ivanovmg Oct 23, 2020

jorisvandenbossche Oct 23, 2020

ivanovmg Oct 23, 2020

ivanovmg Oct 23, 2020

jreback Oct 23, 2020

ivanovmg Oct 31, 2020

jreback commented Oct 31, 2020

jreback commented Nov 4, 2020

simonjayhawkins commented Nov 12, 2020

ivanovmg commented Nov 12, 2020

jreback commented Nov 13, 2020

REGR: Notebook (html) repr of DataFrame no longer follows min_rows/max_rows settings #37363

REGR: Notebook (html) repr of DataFrame no longer follows min_rows/max_rows settings #37363

Conversation

ivanovmg commented Oct 23, 2020

ivanovmg commented Oct 23, 2020 • edited Loading

ivanovmg commented Oct 23, 2020

jorisvandenbossche commented Oct 23, 2020

jorisvandenbossche left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jreback commented Oct 31, 2020

jreback commented Nov 4, 2020

simonjayhawkins commented Nov 12, 2020

ivanovmg commented Nov 12, 2020

jreback commented Nov 13, 2020

ivanovmg commented Oct 23, 2020 •

edited

Loading