Refactor smoothers as a utility, create new filters. #171

jsharpna · 2020-07-30T01:29:45Z

No description provided.

krivard · 2020-07-30T18:47:35Z

Standardize input and output: numpy array with one cell per day
Start with JHU or another Jingjing & Addison production; that's the most common style amongst the python indicators.

dshemetov · 2020-07-30T20:02:48Z

I think it may be a good to collect all the possible sources of data missingness and come up with a standard approach:

missing from source (temporarily or permanently)
privacy censoring

One approach to missing data can be seen in geo_reindex here. The TL;DR is: we fill a regular grid of daily values with the signal and all the missing values are then filled with 0's.

I wonder if it would be better to mark them with NANs instead. This would make it clear on the smoothing end that the value is missing and not 0. There we can decide to impute the data based on the most recent historic data or geographic factors.

dshemetov · 2020-07-30T20:42:23Z

Also, we should probably think through smoother boundary effects issues. Currently some smoothers will report NANs for approximately the first few weeks of the available data (the NAN window is based on the averaging window of ~2 weeks). This data will be so far in the past, that it likely won't matter for practical real-time users, but is worth considering for historical use-cases. A natural way to fix this would be to dynamically change the smoothing window on the boundaries.

krivard · 2020-08-04T18:47:00Z

Will incorporate into JHU first.

Comparing results to the reference implementation can be done in two copies of the repository or between a feature branch and the main branch.

Data censoring occurs within each indicator to handle data that are permitted by the DUA (ie no stderr/sample size) vs minimum sample size. Missingness due to censorship should be NA. Some sources report 0 when it is not necessarily a true 0 (GHT, cases/deaths).

There are a lot of parameters to the smoothers -- at some point we'll want to evaluate different configurations for typical and edge case performance.

dshemetov · 2020-08-11T03:42:57Z

#177 JHU refactoring is almost done. Want to write a couple better smoother tests first.

krivard · 2020-08-11T18:37:29Z

Add to other indicators, order TBD

krivard · 2020-08-12T18:28:20Z

jsharpna assigned dshemetov Jul 30, 2020

dshemetov mentioned this issue Jul 31, 2020

Add the smoothing utility #176

Merged

SumitDELPHI added the Engineering Used to filter issues when synching with Asana label Dec 6, 2020

krivard closed this as completed Aug 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor smoothers as a utility, create new filters. #171

Refactor smoothers as a utility, create new filters. #171

jsharpna commented Jul 30, 2020

krivard commented Jul 30, 2020

Uh oh!

dshemetov commented Jul 30, 2020

Uh oh!

dshemetov commented Jul 30, 2020

Uh oh!

krivard commented Aug 4, 2020

Uh oh!

dshemetov commented Aug 11, 2020

Uh oh!

krivard commented Aug 11, 2020

Uh oh!

krivard commented Aug 12, 2020 •

edited by bweaver-work

Loading

Uh oh!

Refactor smoothers as a utility, create new filters. #171

Refactor smoothers as a utility, create new filters. #171

Comments

jsharpna commented Jul 30, 2020

krivard commented Jul 30, 2020

Uh oh!

dshemetov commented Jul 30, 2020

Uh oh!

dshemetov commented Jul 30, 2020

Uh oh!

krivard commented Aug 4, 2020

Uh oh!

dshemetov commented Aug 11, 2020

Uh oh!

krivard commented Aug 11, 2020

Uh oh!

krivard commented Aug 12, 2020 • edited by bweaver-work Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

krivard commented Aug 12, 2020 •

edited by bweaver-work

Loading