Skip to content

[ArrowStringArray] REF: _str_startswith/_str_endswith #41487

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
May 17, 2021

Conversation

simonjayhawkins
Copy link
Member

using pyarrow native functions for _str_startswith/_str_endswith #41222 were merged before _str_contains #41217 so we can de-duplication some logic.

and also a change to _str_match for perf gain on object fallback

-      32.8±0.2ms       25.5±0.1ms     0.78  strings.Methods.time_match('arrow_string')

@simonjayhawkins simonjayhawkins added Refactor Internal refactoring of code Strings String extension data type and string data labels May 15, 2021
@simonjayhawkins simonjayhawkins added this to the 1.3 milestone May 15, 2021
@simonjayhawkins
Copy link
Member Author

AFAICT the coverage build uses pyarrow 4.0.0 so the extra loc for the fallback won't be hit but are tested in other envs

@jreback jreback merged commit 31ea5ed into pandas-dev:master May 17, 2021
@jreback
Copy link
Contributor

jreback commented May 17, 2021

thanks!

@simonjayhawkins simonjayhawkins deleted the tidy branch May 17, 2021 15:53
TLouf pushed a commit to TLouf/pandas that referenced this pull request Jun 1, 2021
JulianWgs pushed a commit to JulianWgs/pandas that referenced this pull request Jul 3, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Refactor Internal refactoring of code Strings String extension data type and string data
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants