PERF: refactor ExcelFormatter #11355

chris-b1 · 2015-10-17T15:02:57Z

use to_native_types by block to yield already formatted cells
yield cells in row-oriented way Openpyxl22 #11144 (comment)

The text was updated successfully, but these errors were encountered:

jreback · 2015-10-17T15:07:30Z

FYI, you might want to reach out o see if they can input these on a columnar basis to make this faster

xlsxwriter cc @jmcnamara
openpyxl cc @Themanwithoutaplan (already working on this)

Themanwithoutaplan · 2015-10-18T06:26:10Z

The output is row based so there has to be a conversion at some point from columns to rows. See https://bitbucket.org/snippets/openpyxl/jgbak for a barebones approach.

Will look to add support for Numpy types in a future version of openpyxl and using named styles for faster formatting.

calvinwyoung · 2018-02-09T05:36:32Z

Are there any plans to resolve this issue? We'd really love to be able to use constant_memory to output large XLSX files.

Themanwithoutaplan · 2018-02-09T10:50:40Z

@calvinwyoung this is possible in openpyxl using the dataframe_to_rows() function, which should also work with xlsxwriter. Having looked at the code for the Excelformatter I'm pretty sure that this will always be faster than using to_excel()

jmcnamara · 2018-02-09T15:21:39Z

@calvinwyoung One problem with using xlsxwriter's constant_memory mode from Pandas is that xlsxwriter doesn't support merged ranges across rows in that mode. As a result it would break the formatting for merged indices. If you specifically need constant_memory support you should probably look at writing the data from the dataframe yourself using xlsxwriter directly.

calvinwyoung · 2018-02-12T19:42:05Z

@Themanwithoutaplan @jmcnamara Thank you both for the feedback. It seems like neither dataframe_to_rows() nor constant_memory supports merged ranges across rows, but this is something I'd like to support if at all possible.

Is there a generally accepted strategy for supporting merged ranges and keeping memory consumption low? For our application, we're okay if the processing takes longer — it just so happens that the machines we're using are constrained by memory, and we aren't able to use swap.

Themanwithoutaplan · 2018-02-13T09:18:55Z

@calvinwyoung if you want to merge cells when writing straight to XML then you'll have to manage more of this yourself. Apart from the formatting it's not difficult. Details are out of scope for this ticket I think and should be discussed on the openpyxl mailing list.

calvinwyoung · 2018-02-14T00:54:07Z

Grea, thank you!

mroeschke · 2024-01-27T22:42:19Z

Seems like there's not been much development here and there doesn't seem to be a clear action item here so closing

jreback added Performance Memory or execution speed performance IO Excel read_excel, to_excel labels Oct 17, 2015

jreback added this to the Next Major Release milestone Oct 17, 2015

chris-b1 mentioned this issue Nov 30, 2015

ExcelWriter won't write any row but the last when used with XlsxWriter in constant_memory mode #11703

Closed

chris-b1 mentioned this issue Feb 14, 2017

DataFrame.to_excel with xlsxwriter and constant_memory makes most of the cells empty #15392

Open

mroeschke removed this from the Contributions Welcome milestone Oct 13, 2022

mroeschke closed this as completed Jan 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PERF: refactor ExcelFormatter #11355

PERF: refactor ExcelFormatter #11355

chris-b1 commented Oct 17, 2015

jreback commented Oct 17, 2015

Themanwithoutaplan commented Oct 18, 2015

calvinwyoung commented Feb 9, 2018

Themanwithoutaplan commented Feb 9, 2018

jmcnamara commented Feb 9, 2018

calvinwyoung commented Feb 12, 2018 •

edited

Loading

Themanwithoutaplan commented Feb 13, 2018

calvinwyoung commented Feb 14, 2018

mroeschke commented Jan 27, 2024

PERF: refactor ExcelFormatter #11355

PERF: refactor ExcelFormatter #11355

Comments

chris-b1 commented Oct 17, 2015

jreback commented Oct 17, 2015

Themanwithoutaplan commented Oct 18, 2015

calvinwyoung commented Feb 9, 2018

Themanwithoutaplan commented Feb 9, 2018

jmcnamara commented Feb 9, 2018

calvinwyoung commented Feb 12, 2018 • edited Loading

Themanwithoutaplan commented Feb 13, 2018

calvinwyoung commented Feb 14, 2018

mroeschke commented Jan 27, 2024

calvinwyoung commented Feb 12, 2018 •

edited

Loading