Excelfancy #2370

jassinm · 2012-11-27T20:23:26Z

adds to export dataframe for excel:
- multiindex (merge cells similar to htmlformatter)
- border
- bold header
- ability to add dataframe in same sheet (startrow, startcol)

http://cl.ly/image/2r102L0E1l23

solves Issue #2294

Conflicts: pandas/src/parse_helper.h pandas/src/parser/tokenizer.c

wesm · 2012-11-27T20:28:07Z

Oh that's very nice. Would you might sprinkling in some test cases?

changhiskhan · 2012-11-27T20:30:29Z

+1

jassinm · 2012-11-27T20:31:28Z

cool. sure I tried to add but ./test_fast.h

======================================================================
ERROR: Failure: ImportError (C extensions not built: if you installed already verify that you are not importing from the source directory)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/usr/local/lib/python2.7/site-packages/nose/loader.py", line 390, in loadTestsFromName
    addr.filename, addr.module)
  File "/usr/local/lib/python2.7/site-packages/nose/importer.py", line 39, in importFromPath
    return self.importFromDir(dir_path, fqname)
  File "/usr/local/lib/python2.7/site-packages/nose/importer.py", line 86, in importFromDir
    mod = load_module(part_fqname, fh, filename, desc)
  File "/Users/locojay/Documents/Dev/github/pandas/pandas/__init__.py", line 16, in <module>
    raise ImportError('C extensions not built: if you installed already '
ImportError: C extensions not built: if you installed already verify that you are not importing from the source directory

----------------------------------------------------------------------
Ran 1 test in 0.001s

FAILED (errors=1)

not sure if related to the osx build issue which was solved over weekend

wesm · 2012-11-27T20:37:57Z

try building the extensions in place : python setup.py build_ext --inplace

wesm · 2012-11-27T20:38:11Z

Shortcut for that is make tseries

jassinm · 2012-11-27T20:46:51Z

thanks works wil add some tests

jassinm · 2012-11-28T23:51:37Z

all frame test are green
excel export looks like html export when dealing with index_labels (not in same row as columns)
small fix to reader, functionality to deal with new index label style

What is missing is to add to the Reader the functionality to read multindex columns (pivot dumps....). Maybe a feature for the future

wesm · 2012-11-29T00:49:19Z

Great, thanks. I'll go ahead and merge this-- if you get a chance to write some docs, go for it in a new PR

wesm · 2012-11-29T01:12:16Z

There are test failures in this on my machine. I'll hack it to work-- just checking you ran all ~2600 tests from test_fast.sh? (or nosetests pandas)

jassinm · 2012-11-29T02:19:47Z

was just running nosetests -s pandas.tests.test_frame apologies
I get 2 fails:
pandas.stats.tests.test_ols.TestOLS (not sure if related)
pandas.tests.test_panel.TestPanel:test_to_excel.

This one is due to the handling of the Excel Reader when dealing with index labels :

The new dump does it like the HTML dumper and offset by 1 row if an index has a label.
I assumed that if a row has no values in the columns its an indexname...
Any thoughts one what would be a good way to deal with this so one can handle parsing frames having no index label but an empty first value row? add an argument to the parser (has_indexname?), check type of column if first element is different must be an indexname

…ls not in the same row as columnnames has_index_labels: boolean, default False True if the cols defined in index_col have an index name and are not in the header

jassinm · 2012-11-29T17:01:34Z

added argument has_index_labels to the reader to handle index_labels not in the same row as the column header.
people using the parser don't need to add any argument if they have files dumped in the previous layout.
for the new layout (index labels offseted by one row) one needs to add has_index_labels=True
all but one tests passes when running nosetets pandas. This is unrelated has it occurs in master (fbd77d5) (pandas.stats.tests.test_ols.TestOLS:testWLS)

wesm · 2012-11-29T17:06:46Z

Thanks a ton. You can get rid of that error by upgrading to the latest development version of statsmodels

jassinm · 2012-11-29T17:11:11Z

great thanks all green now

wesm · 2012-11-29T20:08:53Z

If you get a chance to write some documentation (look in docs/source), that'd be great!

ghost · 2012-11-29T22:57:58Z

@wesm, Looks like xlwt was introduced as a hard dependency, xlwt doesn't install on python3,
and the port to python3 xlwt3 is abandonware. release-blocker?

changhiskhan · 2012-11-30T18:11:53Z

So there seems to be a bug in openpyxl 1.5.8 where the cell styles don't get read in correctly but they do get written correctly still.

blounsbury-usbr · 2013-04-27T20:03:46Z

Just wanted to say that although the original posters screenshot (though missing the right border of the last column header) looks alright, it shouldn't mean that everyone else be forced to use his style preference.

I would request that the default output be changed back the way it has been, and an additional to_excel parameter be added for those who wish to use this style preference. I don't want my output formatted this way.

locojaydev added 6 commits November 21, 2012 16:14

excel format

b1e916e

excel format

bce9118

excel format

b178066

adding na_repl, cols argument to excel formatter

afde3f2

adding float_format to ExcelFormatter

d354267

Merge branch 'master' into excelfancy

c99dc49

Conflicts: pandas/src/parse_helper.h pandas/src/parser/tokenizer.c

locojaydev added 4 commits November 28, 2012 18:30

excelformatter handles multiindex, aliases

f13d093

hadling all attributes

9ae35f8

reader bug fix (colnames was None.1,....), datetime hadling, period

5138bdc

adding styling test

c1708b2

adding argument has index_labels to excel reader to handle index_labe…

389da90

…ls not in the same row as columnnames has_index_labels: boolean, default False True if the cols defined in index_col have an index name and are not in the header

ghost assigned wesm Nov 29, 2012

wesm added a commit that referenced this pull request Nov 29, 2012

ENH: rename index_labels to index_names. #2370

749318f

wesm merged commit 389da90 into pandas-dev:master Nov 29, 2012

changhiskhan mentioned this pull request Nov 30, 2012

tests fail if xlwt is not installed. #2395

Closed

This was referenced Dec 8, 2012

New Excel changes cause an extra line to be generated in the Excel file #2396

Closed

New Excel functionality #2478

Closed

ghost mentioned this pull request May 18, 2013

ENH: allow to_csv to write multi-index columns, read_csv to read with header=list arg #3575

Merged

ghost mentioned this pull request Feb 7, 2014

BUG: misplaced index_label with DF.to_excel() #6260

Closed

Uh oh!

Excelfancy #2370

Excelfancy #2370

Uh oh!

Conversation

jassinm commented Nov 27, 2012

Uh oh!

wesm commented Nov 27, 2012

Uh oh!

changhiskhan commented Nov 27, 2012

Uh oh!

jassinm commented Nov 27, 2012

Uh oh!

wesm commented Nov 27, 2012

Uh oh!

wesm commented Nov 27, 2012

Uh oh!

jassinm commented Nov 27, 2012

Uh oh!

jassinm commented Nov 28, 2012

Uh oh!

wesm commented Nov 29, 2012

Uh oh!

wesm commented Nov 29, 2012

Uh oh!

jassinm commented Nov 29, 2012

Uh oh!

jassinm commented Nov 29, 2012

Uh oh!

wesm commented Nov 29, 2012

Uh oh!

jassinm commented Nov 29, 2012

Uh oh!

wesm commented Nov 29, 2012

Uh oh!

ghost commented Nov 29, 2012

Uh oh!

changhiskhan commented Nov 30, 2012

Uh oh!

blounsbury-usbr commented Apr 27, 2013

Uh oh!

Uh oh!