Extremely slow construction of large DataFrames out of ndarrays? #8161
Labels
API Design
Duplicate Report
Duplicate issue or pull request
Performance
Memory or execution speed performance
I was trying to figure out what's the most efficient way to create a dataframe out of a large numpy record array (I initially naively thought this could be done in a zero-copy way) and ran some simple tests:
I wonder what's the DataFrame constructor doing for several seconds straight even if it's provided with a materialized
ndarray
(note that copying both arrays takes < 50ms)? This is on v0.14.1.P.S. Is there a more efficientway to make a dataframe out of a recarray?
The text was updated successfully, but these errors were encountered: