timeseries: add tip about using groupby() rather than resample

jd · jreback · commit f0c7c41831eb · 2015-10-08T07:24:29.000-04:00
As discussed in #11217, there's another way of doing resampling that is not yet covered by `resample' itself. Let's document that.
diff --git a/doc/source/timeseries.rst b/doc/source/timeseries.rst
@@ -1246,6 +1246,29 @@ previous versions, resampling had to be done using a combination of
 function on the grouped object. This was not nearly as convenient or performant
 as the new pandas timeseries API.
 
+Sparse timeseries
+~~~~~~~~~~~~~~~~~
+
+If your timeseries are sparse, be aware that upsampling will generate a lot of
+intermediate points filled with whatever passed as ``fill_method``. What
+``resample`` does is basically a group by and then applying an aggregation
+method on each of its groups, which can also be achieve with something like the
+following.
+
+.. ipython:: python
+
+    def round(t, freq):
+        # round a Timestamp to a specified freq
+        return Timestamp((t.value // freq.delta.value) * freq.delta.value)
+
+    from functools import partial
+
+    rng = date_range('1/1/2012', periods=100, freq='S')
+
+    ts = Series(randint(0, 500, len(rng)), index=rng)
+
+    ts.groupby(partial(round, freq=offsets.Minute(3))).sum()
+
 .. _timeseries.periods:
 
 Time Span Representation