smaller than microsecond timedelta64 Series are not saved correctly with to_csv #6783

cpcloud · 2014-04-03T17:28:14Z

In [24]: x = Series(np.around(randn(10) * 100).astype('timedelta64[ns]'))

In [25]: x
Out[25]:
0   -00:00:00.000000
1    00:00:00.000000
2   -00:00:00.000000
3   -00:00:00.000000
4    00:00:00.000000
5   -00:00:00.000000
6   -00:00:00.000000
7   -00:00:00.000000
8   -00:00:00.000000
9   -00:00:00.000000
dtype: timedelta64[ns]

In [26]: x.values
Out[26]: array([-259,   31,  -49,  -41,  141,  -14, -111, -117,  -20,  -35], dtype='timedelta64[ns]')

In [27]: buf = StringIO()

In [28]: x.to_csv(buf)

In [29]: print(buf.getvalue())
0,-00:00:00.000000
1,00:00:00.000000
2,-00:00:00.000000
3,-00:00:00.000000
4,00:00:00.000000
5,-00:00:00.000000
6,-00:00:00.000000
7,-00:00:00.000000
8,-00:00:00.000000
9,-00:00:00.000000

The text was updated successfully, but these errors were encountered:

cpcloud · 2014-04-03T17:29:30Z

somewhat of an edge case ... not sure how many people need or are using this level of precision

jreback · 2014-04-03T17:38:30Z

yep....part of the whole, formatting for csv output.....welcome to figure this out in a general way! (the timedelta formatting pretty trivial (as it already takes a format kw, just need to interpret it), bigger issue is how to tell to_csv the formats that you want...

e.g. dict of column -> format, or prob better to have a Class that you can override, call to set state, etc.

mroeschke · 2019-10-11T04:17:20Z

Looks like this is fixed on master. Could use a test

In [8]: print(buf.getvalue())
0,-1 days +23:59:59.999999925
1,-1 days +23:59:59.999999964
2,-1 days +23:59:59.999999957
3,-1 days +23:59:59.999999912
4,-1 days +23:59:59.999999996
5,0 days 00:00:00.000000094
6,0 days 00:00:00.000000130
7,-1 days +23:59:59.999999893
8,0 days 00:00:00.000000078
9,0 days 00:00:00.000000016

baevpetr · 2019-12-18T22:37:26Z

Hi, can I take it ?

TomAugspurger · 2019-12-30T14:00:26Z

@baevpetr I think this is already being fixed in #30554.

jreback added CSV labels Apr 3, 2014

jreback added this to the 0.15.0 milestone Apr 3, 2014

cpcloud mentioned this issue Apr 3, 2014

custom formatters for to_csv #4668

Closed

5 tasks

jreback modified the milestones: 0.16.0, Next Major Release Mar 3, 2015

mroeschke added good first issue Needs Tests Unit test(s) needed to prevent regressions and removed IO CSV read_csv, to_csv Output-Formatting __repr__ of pandas objects, to_string Timedelta Timedelta data type labels Oct 11, 2019

jbrockmendel added the IO CSV read_csv, to_csv label Oct 16, 2019

mroeschke mentioned this issue Dec 30, 2019

TST: Regression testing for fixed issues #30554

Merged

9 tasks

simonjayhawkins modified the milestones: Contributions Welcome, 1.0 Dec 30, 2019

TomAugspurger closed this as completed in #30554 Dec 31, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

smaller than microsecond timedelta64 Series are not saved correctly with to_csv #6783

smaller than microsecond timedelta64 Series are not saved correctly with to_csv #6783

cpcloud commented Apr 3, 2014

cpcloud commented Apr 3, 2014

jreback commented Apr 3, 2014

mroeschke commented Oct 11, 2019

baevpetr commented Dec 18, 2019

TomAugspurger commented Dec 30, 2019

smaller than microsecond timedelta64 Series are not saved correctly with to_csv #6783

smaller than microsecond timedelta64 Series are not saved correctly with to_csv #6783

Comments

cpcloud commented Apr 3, 2014

cpcloud commented Apr 3, 2014

jreback commented Apr 3, 2014

mroeschke commented Oct 11, 2019

baevpetr commented Dec 18, 2019

TomAugspurger commented Dec 30, 2019