Nanoseconds being truncated in asof #3375

schwallie · 2013-04-16T17:48:20Z

match_time = get_info.index.asof(trade_time)
match_price = get_info[match_time]['Bid Price']

The actual index value:
<Timestamp: 2013-04-10 10:22:01.696815975-0500 CDT, tz=US/Central>

The error returned:
KeyError: u'no item named 2013-04-10 10:22:01.696815-05:00'

It seems to me that it is cutting off the last digits of the timestamp in this example.

jreback · 2013-04-16T17:56:56Z

can you show pandas version, platform, and get_info?

schwallie · 2013-04-16T18:07:36Z

pandas version = .10
platform = linux

get_info is a huge huge dataframe that is indexed by a timeseries. I think we should be fine without?

FYI if I use

match_price = get_info[get_info.index[0]]

I get the same problem and error

jreback · 2013-04-16T18:17:26Z

of course, just want to see a reproducible sample (you can put random data)

jreback · 2013-04-16T18:31:40Z

related to #3060

0.11rc1

In [14]: s = Series([pd.Timestamp('20130101')]).values.view('i8')[0]

In [15]: r = pd.DatetimeIndex([ s + 50 + i for i in range(100) ])

In [16]: x = Series(randn(100),index=r)

In [17]: x.asof(x.index[0])
Out[17]: -1.7275857659530389

In [20]: x.index.asof(x.index[0])
Out[20]: <Timestamp: 2013-01-01 00:00:00.000000050>

The errors

In [21]: x['2013-01-01 00:00:00.000000050']
KeyError: '2013-01-01 00:00:00.000000050'
In [23]: x[pd.Timestamp('2013-01-01 00:00:00.000000050')]
KeyError: 1356998400000000000

wuan · 2013-05-11T08:44:59Z

when writing

x[Timestamp(np.datetime64('2013-01-01 00:00:00.000000050+0000', 'ns'))]

the index access should work with #3060 now. The major problem here is, that timestrings are parsed with python datetime when creating a Timestamp object. This ignores nanosecond values.

It may break a lot of things, but wouldnt it be better to use np.datetime64 to parse timestamp strings?

jreback · 2013-05-11T11:21:26Z

http://comments.gmane.org/gmane.comp.python.pydata/688

the issue is dateutil which truncates

can't use numpy parser as its broken in < 1.7
it's possible to try it first and then fallback to dateutil
might be a perf hit
also possible to detect via some regexes common formats that we know numpy can parse

wuan · 2013-05-18T05:37:09Z

Maybe the only (performant) way to improve the situation is to reimplement the dateutil parser which is pure python at the moment.

jreback · 2013-05-18T11:31:34Z

it's possible that we do parse a few common formats in cython/c and fallback to parse
but this would be some work

jreback · 2015-01-26T00:52:19Z

IIRC this is fixed by some recent work in 0.15.0. If still not working, pls open a new issue.

wuan mentioned this issue Apr 23, 2013

ENH: support for nanosecond time in offset and period #3060

Merged

jreback added the Timeseries label Feb 26, 2014

jreback modified the milestones: 0.15.0, 0.14.0 Feb 26, 2014

jreback closed this as completed Jan 26, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nanoseconds being truncated in asof #3375

Nanoseconds being truncated in asof #3375

schwallie commented Apr 16, 2013

jreback commented Apr 16, 2013

schwallie commented Apr 16, 2013

jreback commented Apr 16, 2013

jreback commented Apr 16, 2013

wuan commented May 11, 2013

jreback commented May 11, 2013

wuan commented May 18, 2013

jreback commented May 18, 2013

jreback commented Jan 26, 2015

Nanoseconds being truncated in asof #3375

Nanoseconds being truncated in asof #3375

Comments

schwallie commented Apr 16, 2013

jreback commented Apr 16, 2013

schwallie commented Apr 16, 2013

jreback commented Apr 16, 2013

jreback commented Apr 16, 2013

wuan commented May 11, 2013

jreback commented May 11, 2013

wuan commented May 18, 2013

jreback commented May 18, 2013

jreback commented Jan 26, 2015