testing forecasters on simple datasets #32

dsweber2 · 2023-10-11T00:44:24Z

addresses #23 and #20. I ended up not actually using targets for the simple data tests b/c it ended up too awkward and distributed. I should probably do a little cleanup of the repo and check for edge cases before we actually merge this.

dsweber2 · 2023-10-12T20:07:05Z

I also added the flatline forecaster and made the necessary changes to get that working. Should be ready for review (pending the tests passing on the remote).

dsweber2 · 2023-10-12T20:48:57Z

To be clear, the tests this adds are:

2 states with different constants (technically normally distributed with sd=1e-5, to avoid problems with linear_reg)
single state with white noise (sd=2 while the mean is 25)
2 states, one of which has latency varying between 0-3 days (poisson mean 0.5)
linearly increasing by 1 each day (the flatline is exempt from passing this test)

nmdefries

Comments are mostly for formatting/structure, still looking at this for content.

There are a lot of objs being printed out, I assume for debugging. Test code should be removed. I marked some of them, probably not all.

Generally, it'd be good to have more descriptive "test_that" names and/or comments about what the goal of a particular test is.

R/forecaster.R

R/forecaster_flatline.R

tests/testthat/test-forecasters-basics.R

tests/testthat/test-forecasters-data.R

nmdefries

Confused about the reasoning behind a couple of the checks.

Other tests that might be useful to include:

Which dates predicted values are available for/making sure that preds are on days where we have enough data
Right now, looks like predictions are only for dates that are included in the input dataset. Might be useful to check behavior when making true forecasts (again, that output dates are what we expect, etc)

tests/testthat/test-forecasters-data.R

nmdefries · 2023-10-19T18:12:30Z

tests/testthat/test-forecasters-data.R

+      mutate(
+        diff_from_exp =
+          (value - qnorm(quantile, mean = synth_mean, sd = synth_sd)) /
+            as.integer(target_end_date - forecast_date)


question: Why are we normalizing by days ahead?

iirc the variance should be increasing linearly with the number of days into the future? It's probably a complication we don't need to include, since ahead=1 for all of these tests.

the variance should be increasing linearly with the number of days into the future

Oh, I see what you mean. A prediction for ahead_i will have error within synth_sd around the prediction for ahead_i - 1, which also has error. So the error accumulates and is ~bounded by synth_sd * days ahead for each ahead. Sound approximately correct? 😄

maybe leave this in case we want add more aheads later, but add a comment about the normalization? But no strong feelings on my part.

tests/testthat/test-forecasters-data.R

dsweber2 · 2023-10-23T19:04:54Z

output date correctness is checked here in the "basics" section, rather than the "data" section:

exploration-tooling/tests/testthat/test-forecasters-basics.R

Lines 18 to 21 in b60b500

    
           expect_true(all( 
        
             res$target_end_date == 
        
               as.Date("2022-01-01") 
        
           ))

as well as somewhat indirectly in the latency adjusting specific functions:

exploration-tooling/tests/testthat/test-latency_adjusting.R

Line 3 in 98c1370

expect_no_error(epidataAhead <- extend_ahead(case_death_rate_subset, 1))

dshemetov

Looks good for a first pass.

R/forecaster_flatline.R

R/forecaster_scaled_pop.R

tests/testthat/test-forecasters-basics.R

tests/testthat/test-forecasters-data.R

nmdefries

Looks good!

tests/testthat/test-forecasters-data.R

dsweber2 marked this pull request as ready for review October 12, 2023 20:06

dsweber2 requested a review from dshemetov October 12, 2023 21:11

dsweber2 added this to the initial exploration milestone Oct 12, 2023

dsweber2 requested review from nmdefries and removed request for nmdefries October 16, 2023 20:57

nmdefries self-requested a review October 16, 2023 21:40

dsweber2 added 10 commits October 18, 2023 10:20

feat: check for sufficient training data

452d6cd

test: generic forecaster data tests

a8119a7

fix: confirm_insufficient data moved earlier

fad5be8

lint: whitespace

f641b29

tests: too stringent with tiny_sd, increased tol

4323e12

fix: need the content of outcome not the name

5801e39

docs: forgot to run

4c9bdec

docs: doc all the params

9767d90

feat+test: make a functional flatline forecaster

9d27064

style: rename poorly named test files

b60b500

dsweber2 force-pushed the forecaster_testing_init branch from 7a6e0d1 to b60b500 Compare October 18, 2023 17:20

nmdefries reviewed Oct 18, 2023

View reviewed changes

nmdefries reviewed Oct 19, 2023

View reviewed changes

dsweber2 mentioned this pull request Oct 19, 2023

How to track no input data/no forecasts for a given forecaster #44

Open

dsweber2 added 3 commits October 23, 2023 13:30

fix: cleaning random cruft

f4ceaa6

fix: substantive suggestions

bab8d66

fix: namespace and docs

29528ae

dshemetov approved these changes Oct 23, 2023

View reviewed changes

dsweber2 added 2 commits October 23, 2023 16:45

fix: level,tau->quantile_level

b2fd743

fix: unused test value

15caf84

dsweber2 mentioned this pull request Oct 24, 2023

New Data unit tests #46

Closed

5 tasks

style: run styler on this

f7d8318

dsweber2 self-assigned this Oct 24, 2023

dsweber2 added 2 commits October 24, 2023 14:33

tests: document the tests a little better

cc0c955

test: forecaster_testing not in targets

0481615

nmdefries approved these changes Oct 25, 2023

View reviewed changes

tests/testthat/test-forecasters-data.R Show resolved Hide resolved

nmdefries mentioned this pull request Oct 25, 2023

Support use of shiny app with external score file #36

Merged

2 tasks

dsweber2 merged commit 109d9ce into main Oct 25, 2023

nmdefries deleted the forecaster_testing_init branch October 25, 2023 18:20

dsweber2 mentioned this pull request Oct 25, 2023

simple "sanity-check" datasets for forecasters #23

Open

18 tasks

dshemetov mentioned this pull request Nov 16, 2023

Import David's previous forecasters into targets workflow #20

Closed

2 tasks

testing forecasters on simple datasets #32

testing forecasters on simple datasets #32

Uh oh!

Conversation

dsweber2 commented Oct 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dsweber2 commented Oct 12, 2023

Uh oh!

dsweber2 commented Oct 12, 2023

Uh oh!

nmdefries left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nmdefries left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

nmdefries Oct 19, 2023

Choose a reason for hiding this comment

Uh oh!

dsweber2 Oct 23, 2023

Choose a reason for hiding this comment

Uh oh!

nmdefries Oct 25, 2023

Choose a reason for hiding this comment

Uh oh!

nmdefries Oct 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dsweber2 commented Oct 23, 2023

Uh oh!

dshemetov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nmdefries left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dsweber2 commented Oct 11, 2023 •

edited

Loading

nmdefries Oct 25, 2023 •

edited

Loading