Skip to content

Commit 1ad2eff

Browse files
committed
PERF: StataWriter is slow
StataWriter wrote data using scalar operations. This has been replaced using numpy's internal binary writer (tofile). Changes needed to improve performance include: * Vectorized pandas date to Stata date conversion * Vectorized null padding * Conversion of string columns to fixed width numpy string arrays * Conversion to record array
1 parent e7e02e6 commit 1ad2eff

File tree

5 files changed

+164
-192
lines changed

5 files changed

+164
-192
lines changed

doc/source/v0.15.0.txt

+2
Original file line numberDiff line numberDiff line change
@@ -482,6 +482,8 @@ Performance
482482
- Performance improvements in ``Period`` creation (and ``PeriodIndex`` setitem) (:issue:`5155`)
483483
- Improvements in Series.transform for significant performance gains (revised) (:issue:`6496`)
484484
- Performance improvements in ``StataReader`` when reading large files (:issue:`8040`, :issue:`8073`)
485+
- Performance improvements in ``StataWriter`` when writing large files (:issue:`8079`)
486+
485487

486488

487489

0 commit comments

Comments
 (0)