ENH: evaluate datetime ops in python with eval #4924

cpcloud · 2013-09-21T21:47:41Z

Also adds DatetimeIndex comparisons in query expressions and a corresponding vbench for a simple Timestamp vs either Series or DatetimeIndex.

jreback · 2013-09-21T22:21:39Z

maybe add some tests where use is comparing a date column with a non-date
? so should raise

as an aside in future min/max/diff/shift on dates have to be done in python land

cpcloud · 2013-09-21T23:52:06Z

pandas/computation/expr.py

@@ -493,8 +495,14 @@ def _possibly_evaluate_binop(self, op, op_class, lhs, rhs,
                                 maybe_eval_in_python=('==', '!=')):
        res = op(lhs, rhs)

-        # "in"/"not in" ops are always evaluated in python
+        if (res.op in _cmp_ops_syms and
+            lhs.kind in _date_kinds or lhs.kind in _date_kinds):


that should be rhs

cpcloud · 2013-09-24T21:04:13Z

@jreback @jtratner comments?

cpcloud · 2013-09-25T01:33:05Z

this should eventually handle timedelta as well

jreback · 2013-09-25T01:39:43Z

under 1.7! they r like ints so I bet ne can handle them but prob better to handle in python because of NaT

cpcloud · 2013-09-25T01:41:07Z

i ithink i'll do a new pr for that

cpcloud · 2013-09-25T01:43:09Z

💥

In [5]: td = timedelta64(1, 'D')

In [6]: ne.evaluate('td + 1')
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-6-f2c950d373ae> in <module>()
----> 1 ne.evaluate('td + 1')

/home/phillip/.virtualenvs/pandas/lib/python2.7/site-packages/numexpr/necompiler.pyc in evaluate(ex, local_dict, global_dict, out, order, casting, **kwargs)
    728
    729     # Create a signature
--> 730     signature = [(name, getType(arg)) for (name, arg) in zip(names, arguments)]
    731
    732     # Look up numexpr if possible.

/home/phillip/.virtualenvs/pandas/lib/python2.7/site-packages/numexpr/necompiler.pyc in getType(a)
    627     if kind == 'S':
    628         return bytes
--> 629     raise ValueError("unkown type %s" % a.dtype.name)
    630
    631

ValueError: unkown type timedelta64[D]

jreback · 2013-09-25T12:05:52Z

pandas/computation/ops.py

-            return pd.Timestamp(self._value)
-        elif kind == 'timestamp':
-            return self._value.asm8.view('i8')
+            return np.datetime64(self._value)


are you setting the kind somewhere? I woldn't do this, instead Timestamp(self._value).value if you really need the i8 value. why would these need to be i8 anyhow? are they evaluated in python space? (for which you are calling functions)?

I'm not actually getting the i8 value. That's what I was doing previously, to evaluate datetime ops using numexpr. kind is just a string repr of either the class name or the dtype, it's a computed property

ok....I just wouldn't allow a datetime time at all (instead just wrap Timestamp around it)....makes consistent

cpcloud · 2013-09-27T16:23:31Z

@jreback comments?

jreback · 2013-09-27T16:25:10Z

pandas/core/frame.py

@@ -1898,6 +1898,7 @@ def _get_index_resolvers(self, axis):
        # index or columns
        axis_index = getattr(self, axis)
        d = dict()
+        prefix = axis[0]


in the future maybe this function should be in core/generic?

jreback · 2013-09-27T16:25:37Z

looks fine otherwise

cpcloud · 2013-09-27T16:55:05Z

moved to generic and added a smoke test for failing panel/panel4d with multiiindex

cpcloud · 2013-09-27T18:00:19Z

slight hit when using DatetimeIndex vs a column of Series, need to clear that up b4 merging

cpcloud · 2013-09-27T18:18:01Z

turns out it's because a datetime64 Series with an Int64Index compares faster than a datetime64 Series with a DatetimeIndex. This is the case before 8a9a4f2 (the just merged timestamp compare fix) so that didn't introduce any performance regressions (I also ran the full vbench suite on it and no changes). I ran a vbench on this as well and nothing changed.

cpcloud · 2013-09-27T19:34:39Z

bombs away

ENH: evaluate datetime ops in python with eval

jreback · 2014-03-27T15:20:39Z

@cpcloud do you recall if we fixed this for timedelta ops as well (which should be evaluated in python space and not by numexpr)

cpcloud · 2014-03-27T23:10:26Z

Don't think there's a test for them in top level query/eval, but there are some tests in test_pytables.py.

ghost assigned cpcloud Sep 21, 2013

cpcloud reviewed Sep 21, 2013
View reviewed changes

jreback reviewed Sep 25, 2013
View reviewed changes

jreback reviewed Sep 27, 2013
View reviewed changes

cpcloud added 5 commits September 27, 2013 13:36

ENH: evaluate datetime ops in python space with eval

a9dfc50

TST: test nondate vs. date comparisons in query

2e6e261

TST: add test for multiindex resolver naming

ea7be22

ENH: convert all datetime-like objects to Timestamp

d12ae36

CLN: move resolvers getter to generic.py

a178546

PERF: add vbench for datetime comparisons

1375c51

cpcloud added a commit that referenced this pull request Sep 27, 2013

Merge pull request #4924 from cpcloud/eval-datetime-in-python

855d3d7

ENH: evaluate datetime ops in python with eval

cpcloud merged commit 855d3d7 into pandas-dev:master Sep 27, 2013

cpcloud deleted the eval-datetime-in-python branch September 27, 2013 19:34

jreback mentioned this pull request Mar 27, 2014

WARN: possible windows x64 issue with older numpy MKL #3434

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: evaluate datetime ops in python with eval #4924

ENH: evaluate datetime ops in python with eval #4924

cpcloud commented Sep 21, 2013

jreback commented Sep 21, 2013

cpcloud Sep 21, 2013

cpcloud commented Sep 24, 2013

cpcloud commented Sep 25, 2013

jreback commented Sep 25, 2013

cpcloud commented Sep 25, 2013

cpcloud commented Sep 25, 2013

jreback Sep 25, 2013

cpcloud Sep 25, 2013

jreback Sep 25, 2013

cpcloud commented Sep 27, 2013

jreback Sep 27, 2013

jreback commented Sep 27, 2013

cpcloud commented Sep 27, 2013

cpcloud commented Sep 27, 2013

cpcloud commented Sep 27, 2013

cpcloud commented Sep 27, 2013

jreback commented Mar 27, 2014

cpcloud commented Mar 27, 2014

ENH: evaluate datetime ops in python with eval #4924

ENH: evaluate datetime ops in python with eval #4924

Conversation

cpcloud commented Sep 21, 2013

jreback commented Sep 21, 2013

cpcloud Sep 21, 2013

Choose a reason for hiding this comment

cpcloud commented Sep 24, 2013

cpcloud commented Sep 25, 2013

jreback commented Sep 25, 2013

cpcloud commented Sep 25, 2013

cpcloud commented Sep 25, 2013

jreback Sep 25, 2013

Choose a reason for hiding this comment

cpcloud Sep 25, 2013

Choose a reason for hiding this comment

jreback Sep 25, 2013

Choose a reason for hiding this comment

cpcloud commented Sep 27, 2013

jreback Sep 27, 2013

Choose a reason for hiding this comment

jreback commented Sep 27, 2013

cpcloud commented Sep 27, 2013

cpcloud commented Sep 27, 2013

cpcloud commented Sep 27, 2013

cpcloud commented Sep 27, 2013

jreback commented Mar 27, 2014

cpcloud commented Mar 27, 2014