Demonstrate time aggregation in vignette #24

jacobbien · 2021-11-09T20:11:54Z

In the first half of the "aggregation.Rmd" vignette, we should include some examples of how to perform time aggregation. In particular, the approach to demo is casting to tsibble on-the-fly and then leveraging the handy functions there.

A particularly useful example would involve adding aggregation to the epiweek level:

library(tsibble)
dat %>%
  as_tsibble(index = time_value, key = geo_value) %>%
  index_by(Epiweek = ~ epiweek(.)) %>%
  summarize(num_cases = sum(Cases))

Here the function epiweek function would have to be defined (perhaps based on the MMWRweek R package).

(For more context on this issue, see #7)

The text was updated successfully, but these errors were encountered:

ryantibs · 2021-11-13T17:52:01Z

Thanks @jacobbien. Wondering what @earowang thinks?

We're currently thinking that casting to tsibble on-the-fly (which, as Jacob mentions, we'll demo in the first half of this vignette) is a good strategy to access all the tsibble utilities, without worrying about compatibility in all cases, up-front. That is, it puts the onus on the user to define the index variable carefully when they want to use tsibble utilities, and not on us (packages designers) in general. In general, it seems like how the index variable gets updated across various sliding and pivoting operations could get complicated.

Do you see a downside to this approach, or have different perspectives on its pros & cons?

earowang · 2021-11-15T01:29:41Z

Coercing to tsibble as needful definitely works.

Do index = time_value and key = geo_value hold all the time? If so, users don't need to pass these parameters while coercing, by defining as_tsibble.epi_tibble(x, ...).

dat %>% 
  as_tsibble() %>% # less learning
  index_by()

{lubridate} also provides epiweek(), although not sure if they refer to the same epi weeks.

qpmnguyen · 2022-01-30T00:14:09Z

@ryantibs @jacobbien happy to take up writing the vignette for this issue if it's available.

ryantibs · 2022-01-30T00:21:30Z

@qpmnguyen Thanks! Please go for it.

I like Earo's idea of defining as_tsibble.epi_df() with the defaults being index = time_value and key = geo_value. But the user can override this and set a key based on multiple variables, if they want.

I also think you could consider demonstratinging tsibble's functionality for detecting and filling gaps in the time series (either with NAs or with LOCF). Thanks again for volunteering.

ryantibs · 2022-02-09T19:09:32Z

Closed by #37 #38.

jacobbien mentioned this issue Apr 15, 2022

Create geo-aggregation extension of tsibble and demonstrate its use in aggregation vignette cmu-delphi/gtsibble#1

Open

ryantibs assigned qpmnguyen Jan 30, 2022

qpmnguyen mentioned this issue Feb 4, 2022

Drafted the time component of the aggregation vignette #37

Merged

ryantibs closed this as completed Feb 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Demonstrate time aggregation in vignette #24

Demonstrate time aggregation in vignette #24

jacobbien commented Nov 9, 2021 •

edited

Loading

ryantibs commented Nov 13, 2021

earowang commented Nov 15, 2021

qpmnguyen commented Jan 30, 2022

ryantibs commented Jan 30, 2022 •

edited

Loading

ryantibs commented Feb 9, 2022

Demonstrate time aggregation in vignette #24

Demonstrate time aggregation in vignette #24

Comments

jacobbien commented Nov 9, 2021 • edited Loading

ryantibs commented Nov 13, 2021

earowang commented Nov 15, 2021

qpmnguyen commented Jan 30, 2022

ryantibs commented Jan 30, 2022 • edited Loading

ryantibs commented Feb 9, 2022

jacobbien commented Nov 9, 2021 •

edited

Loading

ryantibs commented Jan 30, 2022 •

edited

Loading