BUG: datetime64 series reduces to nan when empty instead of nat #11245

llllllllll · 2015-10-05T18:41:09Z

I ran into some strange behavior with a series of dtype datetime64[ns] where I called max and got back a nan. I think the correct behavior here is to return nat. I looked through test_nanops but I am not sure where the correct place to put the test for this is.

The new behavior is:

In [1]: pd.Series(dtype='datetime64[ns]').max()
Out[1]: NaT

where the old behavior was:

In [1]: pd.Series(dtype='datetime64[ns]').max()
Out[1]: nan

jreback · 2015-10-05T21:58:46Z

can I add a test in test_nanops and/or test_series

jreback · 2015-10-05T21:59:42Z

also want to test for timedelta64/datetime64[ns, tz] too

llllllllll · 2015-10-05T22:00:30Z

Sure, I will update this tonight when I get home

jreback · 2015-10-05T22:27:00Z

pls also add a note in the 0.17.1 whatsnew bug section

llllllllll · 2015-10-07T17:00:38Z

ping @jreback

jreback · 2015-10-07T18:52:03Z

pandas/core/nanops.py

@@ -637,7 +619,7 @@ def _maybe_null_out(result, axis, mask):
            else:
                result = result.astype('f8')
            result[null_mask] = np.nan
-    else:
+    elif result is not tslib.NaT:


if we started with M8/m8 and then do a .view('i8') this needs to be compared to pd.lib.iNaT, not sure why you are not hitting this here

so this should be elif not (result is tslib.NaT or result is tslib.iNaT)?

== tslib.iNaT

but puzzled why a NaT is there
as I don't think it's wrapped yet

If you take a ts and take a view as an int and then reduce, I think you still want nan. The only reason that you would want a NaT is that the dtype of the sequence being reduced is datetime64.

Oh, I see what you are saying, you were suggesting:

elif result.view('i8') == tslib.iNaT

It looks like result is still just a timestamp at this point so it will be the NaT object. I don't know if this is a guarantee or not.

no I mean result should already be an int if it's M8 as wrapping is the last step

jreback · 2015-10-07T18:53:37Z

question for you. otherwise lgtm, pls squash to a single commit (I know different from other projects, just convention here)

jreback · 2015-10-07T18:55:16Z

pandas/core/nanops.py

+            fill_value_typ=fill_value_typ,
+        )
+
+        # numpy 1.6.1 workaround in Python 3.x


hmm, you might be able to take this workaround out entirely, as we don't support 1.6 any longer (you can try and if travis passes, then ok!)

llllllllll · 2015-10-07T19:57:51Z

pandas/core/nanops.py

-    result = _wrap_results(result, dtype)
-    return _maybe_null_out(result, axis, mask)
+        result = _wrap_results(result, dtype)
+        return _maybe_null_out(result, axis, mask)


We have already wrapped the types by the time we call maybe_null_out. The result will already be coerced so I think the is check is safe.

ok, then your check is good. thxs

llllllllll · 2015-10-07T20:01:01Z

Also I removed the workaround branch so hopefully the tests pass, otherwise we can put that branch back.

jreback · 2015-10-07T20:44:39Z

ok looks good. I'll be merging things on friday after releasing 0.17.0

llllllllll · 2015-10-07T20:47:59Z

Thank you very much

jreback · 2015-10-07T20:53:01Z

@llllllllll no thank you!

llllllllll · 2015-10-10T23:07:05Z

@jreback just rebased against master, good to merge?

jreback · 2015-10-10T23:08:24Z

doc/source/whatsnew/v0.17.1.txt

@@ -45,32 +48,10 @@ Bug Fixes

 - Bug in ``.to_latex()`` output broken when the index has a name (:issue: `10660`)
 - Bug in ``HDFStore.append`` with strings whose encoded length exceded the max unencoded length (:issue:`11234`)
-
-
-


all of this space is here on purpose
pls revert

Okay, sorry. What is the space for though?

jreback · 2015-10-10T23:39:01Z

we have lots of prs that are worked on at the same time
so when someone adds a note in whatsnew if they are all at the end
merging one requires everyone to rebase
but if the notes are inserted in a big list they are resolved by git
so we can merge many things w/o conflicts

llllllllll · 2015-10-10T23:41:42Z

Ah, that's a cool trick

jreback · 2015-10-11T00:12:09Z

doc/source/whatsnew/v0.17.1.txt

@@ -74,3 +74,7 @@ Bug Fixes


 - Bugs in ``to_excel`` with duplicate columns (:issue:`11007`, :issue:`10982`, :issue:`10970`)
+- min and max reductions on ``datetime64`` and ``timedelta64`` dtyped series now
+  result in ``NaT`` and not ``nan`` (:issue:`11245`).


I would move this top comment to API change section (its not a big deal, but in theory get's highlited to a user in a slightly more useful way).
(leave the empty series of dtype here)

jreback · 2015-10-11T00:12:47Z

minor comment, ping when green

Fixes the parser for datetimetz to also allow the `M8[ns, tz]` alias.

llllllllll · 2015-10-11T04:09:14Z

tests are passing

BUG: datetime64 series reduces to nan when empty instead of nat

jreback · 2015-10-11T15:17:56Z

thank you sir!

llllllllll force-pushed the dt64-reduce-to-nat branch from b10133a to dbf3824 Compare October 5, 2015 18:42

llllllllll mentioned this pull request Oct 5, 2015

ENH: Adds nan->pd.NaT edge blaze/odo#331

Merged

jreback added Bug Datetime Datetime data dtype Dtype Conversions Unexpected or buggy dtype conversions Numeric Operations Arithmetic, Comparison, and Logical operations Compat pandas objects compatability with Numpy or Python functions labels Oct 5, 2015

jreback added this to the 0.17.1 milestone Oct 5, 2015

llllllllll force-pushed the dt64-reduce-to-nat branch from 631ba34 to e8a8d56 Compare October 6, 2015 07:38

jreback reviewed Oct 7, 2015
View reviewed changes

llllllllll force-pushed the dt64-reduce-to-nat branch from 1decd2e to 26a46a6 Compare October 7, 2015 19:49

llllllllll reviewed Oct 7, 2015
View reviewed changes

llllllllll force-pushed the dt64-reduce-to-nat branch from 26a46a6 to 4e290d7 Compare October 10, 2015 23:06

jreback reviewed Oct 10, 2015
View reviewed changes

llllllllll force-pushed the dt64-reduce-to-nat branch from 4e290d7 to af5e201 Compare October 10, 2015 23:34

jreback reviewed Oct 11, 2015
View reviewed changes

llllllllll force-pushed the dt64-reduce-to-nat branch from af5e201 to 40c8fcf Compare October 11, 2015 00:35

BUG: datetime64 series reduces to nan when empty instead of nat

40c8fcf

Fixes the parser for datetimetz to also allow the `M8[ns, tz]` alias.

jreback added a commit that referenced this pull request Oct 11, 2015

Merge pull request #11245 from llllllllll/dt64-reduce-to-nat

a4843cb

BUG: datetime64 series reduces to nan when empty instead of nat

jreback merged commit a4843cb into pandas-dev:master Oct 11, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: datetime64 series reduces to nan when empty instead of nat #11245

BUG: datetime64 series reduces to nan when empty instead of nat #11245

llllllllll commented Oct 5, 2015

jreback commented Oct 5, 2015

jreback commented Oct 5, 2015

llllllllll commented Oct 5, 2015

jreback commented Oct 5, 2015

llllllllll commented Oct 7, 2015

jreback Oct 7, 2015

llllllllll Oct 7, 2015

jreback Oct 7, 2015

llllllllll Oct 7, 2015

llllllllll Oct 7, 2015

jreback Oct 7, 2015

jreback Oct 7, 2015

jreback commented Oct 7, 2015

jreback Oct 7, 2015

llllllllll Oct 7, 2015

jreback Oct 7, 2015

llllllllll commented Oct 7, 2015

jreback commented Oct 7, 2015

llllllllll commented Oct 7, 2015

jreback commented Oct 7, 2015

llllllllll commented Oct 10, 2015

jreback Oct 10, 2015

llllllllll Oct 10, 2015

llllllllll Oct 10, 2015

jreback commented Oct 10, 2015

llllllllll commented Oct 10, 2015

jreback Oct 11, 2015

jreback commented Oct 11, 2015

llllllllll commented Oct 11, 2015

jreback commented Oct 11, 2015

		@@ -45,32 +48,10 @@ Bug Fixes

		- Bug in ``.to_latex()`` output broken when the index has a name (:issue: `10660`)
		- Bug in ``HDFStore.append`` with strings whose encoded length exceded the max unencoded length (:issue:`11234`)

BUG: datetime64 series reduces to nan when empty instead of nat #11245

BUG: datetime64 series reduces to nan when empty instead of nat #11245

Conversation

llllllllll commented Oct 5, 2015

jreback commented Oct 5, 2015

jreback commented Oct 5, 2015

llllllllll commented Oct 5, 2015

jreback commented Oct 5, 2015

llllllllll commented Oct 7, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jreback commented Oct 7, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

llllllllll commented Oct 7, 2015

jreback commented Oct 7, 2015

llllllllll commented Oct 7, 2015

jreback commented Oct 7, 2015

llllllllll commented Oct 10, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jreback commented Oct 10, 2015

llllllllll commented Oct 10, 2015

Choose a reason for hiding this comment

jreback commented Oct 11, 2015

llllllllll commented Oct 11, 2015

jreback commented Oct 11, 2015