BUG: to_latex outputs string with missing second index level values #14484

the-alleged-car · 2016-10-24T18:55:28Z

I am using pandas to generate a LaTeX string using the to_latex() method on a DataFrame, which is indexed using a MultiIndex object. Running the code snippet produces an incorrect list of strings: the LaTeX table is missing two index numbers.

Code Snippet

import pandas as pd

outliers_lst = [(23240, 0),
                 (23240, 15),
                 (23240, 23),
                 (23240, 31),
                 (23240, 85),
                 (38661, 85),
                 (41231, 85),
                 (41231, 92),
                 (46371, 0)]

headers = (['max', 'EC 1', 'S'],
             ['max', 'EC 1', 'A'],
             ['max', 'EC 2', 'S'])

table = pd.DataFrame("",index = pd.MultiIndex.from_tuples(sorted(outliers_lst)), columns = pd.MultiIndex.from_tuples(headers))
table.to_latex(index = True, longtable = True, column_format = 'c'*5).split('\n')

Incorrect Output

[u'\\begin{longtable}{cccccccccccccccccccccccccc}',
 u'\\toprule',
 u'      &    &  max &   &      \\\\',
 u'      &    & EC 1 &   & EC 2 \\\\',
 u'      &    &    S & A &    S \\\\',
 u'\\midrule',
 u'\\endhead',
 u'\\midrule',
 u'\\multicolumn{3}{r}{{Continued on next page}} \\\\',
 u'\\midrule',
 u'\\endfoot',
 u'',
 u'\\bottomrule',
 u'\\endlastfoot',
 u'23240 & 0  &      &   &      \\\\',
 u'      & 15 &      &   &      \\\\',
 u'      & 23 &      &   &      \\\\',
 u'      & 31 &      &   &      \\\\',
 u'      & 85 &      &   &      \\\\',
 u'38661 &    &      &   &      \\\\',
 u'41231 &    &      &   &      \\\\',
 u'      & 92 &      &   &      \\\\',
 u'46371 & 0  &      &   &      \\\\',
 u'\\end{longtable}',
 u'']

Correct Output

[u'\\begin{longtable}{cccccccccccccccccccccccccc}',
 u'\\toprule',
 u'      &    &  max &   &      \\\\',
 u'      &    & EC 1 &   & EC 2 \\\\',
 u'      &    &    S & A &    S \\\\',
 u'\\midrule',
 u'\\endhead',
 u'\\midrule',
 u'\\multicolumn{3}{r}{{Continued on next page}} \\\\',
 u'\\midrule',
 u'\\endfoot',
 u'',
 u'\\bottomrule',
 u'\\endlastfoot',
 u'23240 & 0  &      &   &      \\\\',
 u'      & 15 &      &   &      \\\\',
 u'      & 23 &      &   &      \\\\',
 u'      & 31 &      &   &      \\\\',
 u'      & 85 &      &   &      \\\\',
 u'38661 & 85 &      &   &      \\\\',
 u'41231 & 85 &      &   &      \\\\',
 u'      & 92 &      &   &      \\\\',
 u'46371 & 0  &      &   &      \\\\',
 u'\\end{longtable}',
 u'']

Note that in the correct output LaTeX strings, the rows with indices (38661, 85) and (41231, 85) correctly include the second index (the number 85), but in the incorrect LaTeX strings the rows do not include the number 85.

Could this be because the row (23240, 85) above (38661, 85) includes 85 in its second index?

commit: None python: 2.7.12.final.0 python-bits: 64 OS: Windows OS-release: 7 machine: AMD64 processor: Intel64 Family 6 Model 60 Stepping 3, GenuineIntel byteorder: little LC_ALL: None LANG: None

pandas: 0.18.1
nose: 1.3.7
pip: 8.1.2
setuptools: 23.0.0
Cython: 0.24
numpy: 1.11.1
scipy: 0.17.1
statsmodels: 0.6.1
xarray: None
IPython: 4.2.0
sphinx: 1.4.1
patsy: 0.4.1
dateutil: 2.5.3
pytz: 2016.4
blosc: None
bottleneck: 1.1.0
tables: 3.2.2
numexpr: 2.6.0
matplotlib: 1.5.1
openpyxl: 2.3.2
xlrd: 1.0.0
xlwt: 1.1.2
xlsxwriter: 0.9.2
lxml: 3.6.0
bs4: 4.4.1
html5lib: None
httplib2: None
apiclient: None
sqlalchemy: 1.0.13
pymysql: 0.7.6.None
psycopg2: None
jinja2: 2.8
boto: 2.40.0
pandas_datareader: None

The text was updated successfully, but these errors were encountered:

jorisvandenbossche · 2016-10-24T21:22:39Z

@the-alleged-car That indeed looks like a bug in the multi-index handling (not printing consecutive values should only happen for the same values of the previous level). Thanks for the report!

Smaller reproducible example:

In [18]: df = pd.DataFrame(index=pd.MultiIndex.from_tuples([('A', 'c'), ('B', 'c')]), columns=['col'])

In [19]: print(df.to_latex())
\begin{tabular}{lll}
\toprule
  &   &  col \\
\midrule
A & c &  NaN \\
B &   &  NaN \\
\bottomrule
\end{tabular}

jorisvandenbossche · 2016-10-24T21:29:54Z

@the-alleged-car If you want to take a look how to fix it, always welcome!

enriquefernandez · 2017-10-28T23:38:12Z

This just bit me as well.
Any known workarounds for the moment?

Closes pandas-devgh-14484 Closes pandas-devgh-17499

* BUG: LatexFormatter.write_result multi-index Fixed GH issue 14484: `LatexFormatter.write_result`` now does not print blanks if a higher-order index differs from the previous row. Also added testcase for this. * MAINT: Address reviewer comments Closes gh-14484 Closes gh-17499

* BUG: LatexFormatter.write_result multi-index Fixed GH issue 14484: `LatexFormatter.write_result`` now does not print blanks if a higher-order index differs from the previous row. Also added testcase for this. * MAINT: Address reviewer comments Closes pandas-devgh-14484 Closes pandas-devgh-17499

* BUG: LatexFormatter.write_result multi-index Fixed GH issue 14484: `LatexFormatter.write_result`` now does not print blanks if a higher-order index differs from the previous row. Also added testcase for this. * MAINT: Address reviewer comments Closes gh-14484 Closes gh-17499

jorisvandenbossche added Bug MultiIndex IO LaTeX to_latex labels Oct 24, 2016

jorisvandenbossche changed the title ~~DataFrame.to_latex() outputs string with missing second index values~~ BUG: to_latex outputs string with missing second index level values Oct 24, 2016

MaximilianKoestler mentioned this issue Sep 12, 2017

BUG: LatexFormatter.write_result multi-index #17499

Closed

4 tasks

jreback mentioned this issue Oct 23, 2017

to_latex wrong \multicolumn count #17959

Closed

rdturnermtl mentioned this issue Nov 16, 2017

to_latex bug for midrule location #18326

Closed

gfyoung added a commit to forking-repos/pandas that referenced this issue Dec 8, 2017

MAINT: Address reviewer comments

3d8275e

Closes pandas-devgh-14484 Closes pandas-devgh-17499

gfyoung mentioned this issue Dec 8, 2017

BUG: LatexFormatter.write_result multi-index #18685

Merged

jreback added this to the 0.21.1 milestone Dec 8, 2017

jreback closed this as completed in #18685 Dec 8, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: to_latex outputs string with missing second index level values #14484

BUG: to_latex outputs string with missing second index level values #14484

the-alleged-car commented Oct 24, 2016 •

edited

Loading

jorisvandenbossche commented Oct 24, 2016

jorisvandenbossche commented Oct 24, 2016

enriquefernandez commented Oct 28, 2017

BUG: to_latex outputs string with missing second index level values #14484

BUG: to_latex outputs string with missing second index level values #14484

Comments

the-alleged-car commented Oct 24, 2016 • edited Loading

Code Snippet

Incorrect Output

Correct Output

jorisvandenbossche commented Oct 24, 2016

jorisvandenbossche commented Oct 24, 2016

enriquefernandez commented Oct 28, 2017

the-alleged-car commented Oct 24, 2016 •

edited

Loading