API/BUG: pd.array(float_ndarray_with_nans) casts np.nan to pd.NA #39039

jbrockmendel · 2021-01-08T18:35:53Z

I was under the impression we meant to keep these as distinct concepts

jbrockmendel · 2021-01-08T18:36:59Z

setting data[-3] = np.nan also gets cast to pd.NA

phofl · 2021-01-19T20:03:43Z

Docs sound like this is intended? https://pandas.pydata.org/docs/reference/api/pandas.array.html

Changed in version 1.2.0: Pandas now also infers nullable-floating dtype for float-like input data

jbrockmendel · 2021-01-19T20:07:05Z

@jorisvandenbossche is the one to ask on this

jorisvandenbossche · 2021-01-20T13:50:16Z

Yes, this is not a bug but intentional behaviour.

See the 4th bullet points in the top post of the PR implementing it (#34307), but this just mentions it. Some actual discussion about is scattered through the issue on this: #32265

The main reason is that any code that currently deals with numpy arrays and pandas (also internally in pandas) assumes that NaNs are treated as missing data. So for compatibility with this (preserving the "missing" semantics), we convert any NaN to NA by default when converting numpy array to a masked/nullable array.

Now, this certainly is a topic that still needs more attention (eg adding an option to be able to preserve NaNs on input instead of converting to NA, we still need to discuss how to treat NaNs that get introduced by computations, default conversion to numpy, ..)

jbrockmendel · 2021-01-20T18:48:13Z

Thanks @jorisvandenbossche. Is there (supposed to be?) a way to actually set data[3] = np.nan and actually get np.nan instead of pd.NA?

simonjayhawkins · 2021-01-22T14:51:09Z

duplicate of/covered by #32265

mroeschke · 2021-08-15T02:43:37Z

Does look similar to #32265, continued discussion can happen on that issue

jbrockmendel added Bug Needs Triage Issue that has not been reviewed by a pandas team member labels Jan 8, 2021

simonjayhawkins added NA - MaskedArrays Related to pd.NA and nullable extension arrays API Design Needs Discussion Requires discussion from core team before further action and removed Needs Triage Issue that has not been reviewed by a pandas team member Bug labels Jan 22, 2021

simonjayhawkins added the Closing Candidate May be closeable, needs more eyeballs label Jan 22, 2021

jbrockmendel mentioned this issue Jan 29, 2021

BUG/API: make setitem-inplace preserve dtype when possible with PandasArray, IntegerArray, FloatingArray #39044

Closed

4 tasks

mroeschke closed this as completed Aug 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API/BUG: pd.array(float_ndarray_with_nans) casts np.nan to pd.NA #39039

API/BUG: pd.array(float_ndarray_with_nans) casts np.nan to pd.NA #39039

jbrockmendel commented Jan 8, 2021

jbrockmendel commented Jan 8, 2021

phofl commented Jan 19, 2021

jbrockmendel commented Jan 19, 2021

jorisvandenbossche commented Jan 20, 2021

jbrockmendel commented Jan 20, 2021

simonjayhawkins commented Jan 22, 2021

mroeschke commented Aug 15, 2021

API/BUG: pd.array(float_ndarray_with_nans) casts np.nan to pd.NA #39039

API/BUG: pd.array(float_ndarray_with_nans) casts np.nan to pd.NA #39039

Comments

jbrockmendel commented Jan 8, 2021

jbrockmendel commented Jan 8, 2021

phofl commented Jan 19, 2021

jbrockmendel commented Jan 19, 2021

jorisvandenbossche commented Jan 20, 2021

jbrockmendel commented Jan 20, 2021

simonjayhawkins commented Jan 22, 2021

mroeschke commented Aug 15, 2021