[ArrowStringArray] BUG: fix test_astype_string for Float32Dtype #40998

simonjayhawkins · 2021-04-17T11:41:39Z

this also fixes Interval, but that test doesn't fail in #39908 for a single StringDtype parameterized by storage.

…type

…2Dtype]

jreback · 2021-04-19T14:04:26Z

pandas/core/arrays/string_arrow.py

+            # numerical issues with Float32Dtype
+            na_values = scalars._mask
+            result = scalars._data
+            result = lib.ensure_string_array(result, copy=False, convert_na_value=False)


umm this doesn't copy? (its fine but then you are mutating)

that's not a change in this PR. will need to look back why we do this.

updated to be consistent with StringArray. I guess that the assumption was that a copy would probably have been created in most cases (unless an object array was passed) and explicitly avoid the additional copy made in ensure_string_array since that does not track if a copy/new array has been already been created before making a copy. The default for copy is False.

jorisvandenbossche · 2021-04-20T08:05:13Z

pandas/core/arrays/string_arrow.py

+            na_values = scalars._mask
+            result = scalars._data
+            result = lib.ensure_string_array(result, copy=False, convert_na_value=False)
+            result[na_values] = ArrowStringDtype.na_value


Alternatively, might also be able to do pa.array(result, mask=mask, type=pa.string()), then pyarrow will use the mask for missing values, and we don't need the setitem operation.

jreback · 2021-04-30T17:24:29Z

thanks @simonjayhawkins

…as-dev#40998)

simonjayhawkins added 14 commits March 29, 2021 14:51

TST: [ArrowStringArray] more parameterised testing - part 1

3bb9750

Merge remote-tracking branch 'upstream/master' into nullable_string_d…

acfb5f5

…type

revert changes to pandas/tests/frame/methods/test_astype.py

98b3a5f

Merge remote-tracking branch 'upstream/master' into nullable_string_d…

56d3717

…type

undo inference change

c095cd4

Merge remote-tracking branch 'upstream/master' into nullable_string_d…

88b05e8

…type

undo unrelated changes

f337018

StringArray

d861895

Merge remote-tracking branch 'upstream/master' into test_astype_string

1f35f06

revert changes to StringArray.astype. fixed in pandas-dev#40450

dd59832

test_floating.py::TestCasting::test_astype_string[arrow_string-Float3…

8dddaef

…2Dtype]

test_interval.py::TestCasting::test_astype_string[arrow_string]

770b018

Merge remote-tracking branch 'upstream/master' into test_astype_string

e391075

tidy diff

2d6c835

simonjayhawkins added the Strings String extension data type and string data label Apr 17, 2021

simonjayhawkins added this to the 1.3 milestone Apr 17, 2021

simonjayhawkins mentioned this pull request Apr 17, 2021

[ArrowStringArray] API: StringDtype parameterized by storage (python or pyarrow) #39908

Merged

4 tasks

jreback requested changes Apr 19, 2021

View reviewed changes

jorisvandenbossche reviewed Apr 20, 2021

View reviewed changes

jorisvandenbossche changed the title ~~[ArrowStringArray] fix test_astype_string for Float32Dtype~~ [ArrowStringArray] BUG: fix test_astype_string for Float32Dtype Apr 20, 2021

simonjayhawkins added 3 commits April 29, 2021 13:55

Merge remote-tracking branch 'upstream/master' into test_astype_string

c28036c

copy=copy

e211b75

pass mask to pyarrow constructor

45b93c6

jorisvandenbossche approved these changes Apr 29, 2021

View reviewed changes

jreback approved these changes Apr 30, 2021

View reviewed changes

jreback merged commit f72f02f into pandas-dev:master Apr 30, 2021

simonjayhawkins deleted the test_astype_string branch April 30, 2021 17:57

martinfleis mentioned this pull request May 2, 2021

CI: ERROR at setup of TestCasting.test_astype_string with pandas master geopandas/geopandas#1930

Closed

yeshsurya pushed a commit to yeshsurya/pandas that referenced this pull request May 6, 2021

[ArrowStringArray] BUG: fix test_astype_string for Float32Dtype (pand…

864060f

…as-dev#40998)

JulianWgs pushed a commit to JulianWgs/pandas that referenced this pull request Jul 3, 2021

[ArrowStringArray] BUG: fix test_astype_string for Float32Dtype (pand…

bff3ec1

…as-dev#40998)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ArrowStringArray] BUG: fix test_astype_string for Float32Dtype #40998

[ArrowStringArray] BUG: fix test_astype_string for Float32Dtype #40998

simonjayhawkins commented Apr 17, 2021

jreback Apr 19, 2021

simonjayhawkins Apr 19, 2021

simonjayhawkins Apr 29, 2021

jorisvandenbossche Apr 20, 2021

simonjayhawkins Apr 29, 2021

jreback commented Apr 30, 2021

[ArrowStringArray] BUG: fix test_astype_string for Float32Dtype #40998

[ArrowStringArray] BUG: fix test_astype_string for Float32Dtype #40998

Conversation

simonjayhawkins commented Apr 17, 2021

jreback Apr 19, 2021

Choose a reason for hiding this comment

simonjayhawkins Apr 19, 2021

Choose a reason for hiding this comment

simonjayhawkins Apr 29, 2021

Choose a reason for hiding this comment

jorisvandenbossche Apr 20, 2021

Choose a reason for hiding this comment

simonjayhawkins Apr 29, 2021

Choose a reason for hiding this comment

jreback commented Apr 30, 2021