BUG: Parse missing values using read_json with dtype=False to NaN instead of None (GH28501) #37834

avinashpancham · 2020-11-14T16:52:13Z

closes read_json with dtype=False infers Missing Values as None #28501
tests added / passed
passes black pandas
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

…tead of None (GH28501)

jreback

yep this is way more consistent, thanks @avinashpancham
cc @WillAyd merge when ready

WillAyd · 2020-11-15T01:53:01Z

Nice PR! Looks like there is a merge conflict in the whatsnew - if you can fix that up we can get this merged

avinashpancham · 2020-11-15T10:33:37Z

Merged master, but CI fails due to unrelated reasons. Will retry with new updates from master later today

jreback · 2020-11-15T17:15:48Z

@avinashpancham can you merge master once again

jreback · 2020-11-15T17:16:09Z

doc/source/whatsnew/v1.2.0.rst

@@ -624,6 +624,7 @@ I/O
 - Bug in :func:`read_html` was raising a ``TypeError`` when supplying a ``pathlib.Path`` argument to the ``io`` parameter (:issue:`37705`)
 - :meth:`to_excel` and :meth:`to_markdown` support writing to fsspec URLs such as S3 and Google Cloud Storage (:issue:`33987`)
 - Bug in :meth:`read_fw` was not skipping blank lines (even with ``skip_blank_lines=True``) (:issue:`37758`)
+- Parse missing values using :func:`read_json` with ``dtype=False`` to NaN instead of None (:issue:`28501`)


use double backticks around NaN and None

avinashpancham · 2020-11-15T17:20:01Z

@jreback merged master and updated whatsnew entry

jreback · 2020-11-18T19:06:38Z

hmm precommit checks are failing, can you merge master and ping on green

avinashpancham · 2020-11-18T22:52:02Z

Still some issues, will try again tomorrow

jreback · 2020-11-20T02:51:33Z

thanks @avinashpancham

simonjayhawkins · 2020-11-20T12:02:35Z

I'm not sure that there was consensus in the issue discussion on the correct behaviour or whether it is a bug. Maybe worth considering moving the release note into the breaking changes section in case anyone is depending on this behaviour.

jreback · 2020-11-20T12:18:14Z

I'm not sure that there was consensus in the issue discussion on the correct behaviour or whether it is a bug. Maybe worth considering moving the release note into the breaking changes section in case anyone is depending on this behaviour.

type dtype option is actually pretty odd anyhow (and should deprecate)

jorisvandenbossche · 2020-11-20T13:55:16Z

I think most voices in the issue actually argued that the current behaviour was correct, and so IMO we should not have changed it to start with.

jreback · 2020-11-20T14:29:47Z

there was exactly 1 comment

jorisvandenbossche · 2020-11-20T14:36:34Z

Will opened the issue, and 2 people commented on it arguing not to change it (and a third one commented as well, but off-topic), no one else argued in favor of the change.

Anyway, I don't think counting the numbers is that important, but I agree with @simonjayhawkins that there was certainly no clear consensus

jorisvandenbossche · 2020-11-20T14:41:20Z

On the None vs NaN, I am personally quite ambivalent, but when disabling parsing I think we should not return float dtype for this case.

jreback · 2020-11-20T14:49:39Z

you can revert if u want
not that important

this was removing a special case though - it was very odd

BUG: Parse missing values using read_json with dtype=False to NaN ins…

1e1f301

…tead of None (GH28501)

jreback added the IO JSON read_json, to_json, json_normalize label Nov 14, 2020

jreback added this to the 1.2 milestone Nov 14, 2020

jreback added the Missing-data np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate label Nov 14, 2020

jreback approved these changes Nov 14, 2020

View reviewed changes

Merge remote-tracking branch 'upstream/master' into GH28501

9ac0c72

jreback requested changes Nov 15, 2020

View reviewed changes

avinashpancham added 2 commits November 15, 2020 18:17

Merge remote-tracking branch 'upstream/master' into GH28501

f03b4a0

Update whatsnew entry

5370b83

Merge remote-tracking branch 'upstream/master' into GH28501

9fc5989

avinashpancham force-pushed the GH28501 branch from 14c9f3f to 9fc5989 Compare November 19, 2020 23:02

jreback approved these changes Nov 20, 2020

View reviewed changes

jreback merged commit 5c35871 into pandas-dev:master Nov 20, 2020

jorisvandenbossche mentioned this pull request Nov 20, 2020

read_json with dtype=False infers Missing Values as None #28501

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: Parse missing values using read_json with dtype=False to NaN instead of None (GH28501) #37834

BUG: Parse missing values using read_json with dtype=False to NaN instead of None (GH28501) #37834

avinashpancham commented Nov 14, 2020

jreback left a comment

WillAyd commented Nov 15, 2020

avinashpancham commented Nov 15, 2020

jreback commented Nov 15, 2020

jreback Nov 15, 2020

avinashpancham Nov 15, 2020

avinashpancham commented Nov 15, 2020

jreback commented Nov 18, 2020

avinashpancham commented Nov 18, 2020

jreback commented Nov 20, 2020

simonjayhawkins commented Nov 20, 2020

jreback commented Nov 20, 2020

jorisvandenbossche commented Nov 20, 2020

jreback commented Nov 20, 2020

jorisvandenbossche commented Nov 20, 2020

jorisvandenbossche commented Nov 20, 2020

jreback commented Nov 20, 2020

BUG: Parse missing values using read_json with dtype=False to NaN instead of None (GH28501) #37834

BUG: Parse missing values using read_json with dtype=False to NaN instead of None (GH28501) #37834

Conversation

avinashpancham commented Nov 14, 2020

jreback left a comment

Choose a reason for hiding this comment

WillAyd commented Nov 15, 2020

avinashpancham commented Nov 15, 2020

jreback commented Nov 15, 2020

jreback Nov 15, 2020

Choose a reason for hiding this comment

avinashpancham Nov 15, 2020

Choose a reason for hiding this comment

avinashpancham commented Nov 15, 2020

jreback commented Nov 18, 2020

avinashpancham commented Nov 18, 2020

jreback commented Nov 20, 2020

simonjayhawkins commented Nov 20, 2020

jreback commented Nov 20, 2020

jorisvandenbossche commented Nov 20, 2020

jreback commented Nov 20, 2020

jorisvandenbossche commented Nov 20, 2020

jorisvandenbossche commented Nov 20, 2020

jreback commented Nov 20, 2020