PERF: concat of pyarrow string array does not rechunk #42357
Labels
Arrow
pyarrow functionality
Enhancement
Needs Discussion
Requires discussion from core team before further action
Performance
Memory or execution speed performance
Strings
String extension data type and string data
If you naively concat series, then the pyarrow storage is not-rechunked. This can lead to dramatic performance issues.
e.g. you get 100k chunks above, vs 1 in the astype operation.
The text was updated successfully, but these errors were encountered: