Djm/resids hotfix #294

dajmcdon · 2024-02-20T20:48:35Z

Checklist

Please:

Make sure this PR is against "dev", not "main".
Request a review from one of the current epipredict main reviewers:
dajmcdon.
Makes sure to bump the version number in DESCRIPTION and NEWS.md.
Always increment the patch version number (the third number), unless you are
making a release PR from dev to main, in which case increment the minor
version number (the second number).
Describe changes made in NEWS.md, making sure breaking changes
(backwards-incompatible changes to the documented interface) are noted.
Collect the changes under the next release number (e.g. if you are on
0.7.2, then write your changes under the 0.8 heading).

Change explanations for reviewer

If grab_residuals() returns a data frame with some keys, then the by_key argument would cause an error. This is be cause, for example, if geo_value was a key (it always is), and the r object had the columns c("geo_value", ".resid"), then bind_cols(key_cols, r) renames the two geo_value columns to be distinct, and subsequent group_by operations failed.

Magic GitHub syntax to mark associated Issue(s) as resolved when this is merged into the default branch

Resolves #{issue number}

dsweber2 · 2024-02-20T22:33:12Z

Interesting, a little surprised I haven't actually run into this problem. If we're not doing grouping by geo_value by default, when would we want to do this? Is this a problem that comes up naturally for multi-aheads, e.g. smooth_quantile_regression?

So I tried the test without the change and it seems to have passed; doing a browser(), it looks like grab_residuals only returns a .resid column. My guess is that you had a more complicated situation where grab_residuals actually had too many columns that made this unit test from?

dsweber2

Probably need to extract the test case slightly differently. Looking at grab_residuals, seems like it could only happen if ".resid" is already in the residuals of the_fit, which is returning a dataframe.

Were you maybe using a different engine than linear_reg?

dajmcdon · 2024-02-20T22:38:02Z

Here's the failure:

library(epipredict)
#> Loading required package: epiprocess
#> 
#> Attaching package: 'epiprocess'
#> The following object is masked from 'package:stats':
#> 
#>     filter
#> Loading required package: parsnip
flat1 <- flatline_forecaster(case_death_rate_subset, "death_rate")
flat2 <- flatline_forecaster(
  case_death_rate_subset, "death_rate",
  args_list = flatline_args_list(quantile_by_key = "geo_value")
)
#> New names:
#> • `geo_value` -> `geo_value...1`
#> • `geo_value` -> `geo_value...2`
#> Error in `dplyr::group_by()`:
#> ! Must group by variables found in `.data`.
#> ✖ Column `geo_value` is not found.

^{Created on 2024-02-20 with reprex v2.1.0}

And on rebuilding with this fix:

library(epipredict)
#> Loading required package: epiprocess
#> 
#> Attaching package: 'epiprocess'
#> The following object is masked from 'package:stats':
#> 
#>     filter
#> Loading required package: parsnip
flat1 <- flatline_forecaster(case_death_rate_subset, "death_rate")
flat2 <- flatline_forecaster(
  case_death_rate_subset, "death_rate",
  args_list = flatline_args_list(quantile_by_key = "geo_value")
)

^{Created on 2024-02-20 with reprex v2.1.0}

dajmcdon · 2024-02-20T22:39:55Z

It's because of the grab_residuals() function returning a tibble with 2 columns (geo_value and .resid) for the flatline engine. This doesn't (necessarily) happen with other engines.

dsweber2 · 2024-02-20T22:49:08Z

That seems simple enough to include as the unit test maybe?

I tried the straightforward wf <- epi_workflow(r, parsnip::linear_reg() %>% parsnip::set_engine("flatline")) %>% fit(jhu) to get the test to fail on the old version, but that gives the confusing error of

Error in `dplyr::arrange()`:
ℹ In argument: `..1 = time_value`.
Caused by error:
! object 'time_value' not found

* avoids bug when requesting quantile_values close to existing ones but off by a small tolerance * never creates unsorted quantiles * extrapolates outside the existing range by linearly interpolating on the logistic scale

dsweber2

Half of this is addressing a new issue right? Seems like you changed the extrapolation to use the same method regardless of region and refactored some functions?

I have lingering questions about things but I wouldn't let those block

dsweber2 · 2024-02-21T18:34:46Z

tests/testthat/test-dist_quantiles.R

+  l <- 1:9 / 10
+  v <- 1:9
+  distn <- dist_quantiles(list(v), list(l))
+  expect_equal(quantile(distn, c(.25, .75)), list(c(2.5, 7.5)))


Sanity check, this is 2.5 because that's halfway between 2 and 3, which are the .2 and .3 quantile values?

dsweber2 · 2024-02-21T18:38:56Z

tests/testthat/test-dist_quantiles.R

+  expect_equal(
+    unlist(quantile(distn, c(.01, .05))),
+    tail_extrapolate(c(.01, .05), head(qv, 2))
+  )


Not sure I quite get how this is testing the tail behavior; is it just checking that quantile is using tail_extrapolate to calculate the values at those quantiles?

dsweber2 · 2024-02-21T18:42:15Z

oh, I pushed the change for the styler, which seems to have been the only thing that was breaking the CI

dajmcdon · 2024-03-06T16:26:49Z

@dsweber2 Am I good to merge this?

dsweber2 · 2024-03-06T19:30:07Z

oh, I suppose I should've said that in addition to approving. Would you prefer I just merge if I approve when I'm tagged?

But yes, lgtm!

dajmcdon · 2024-03-06T19:33:12Z

Ah, no problem. I think better for the PR creator to merge once satisfied, but I lost track of whether we were both satisfied!

dajmcdon added 3 commits February 20, 2024 12:24

handle the quantile grouping correctly

1fc9813

existing tests pass, remove non-standard file

3094da9

add a basic test

63cd991

dajmcdon requested a review from dsweber2 February 20, 2024 20:48

dsweber2 requested changes Feb 20, 2024

View reviewed changes

dajmcdon added 5 commits February 20, 2024 18:06

add flatline test

aceeac4

fix: refactor quantile calculation

695aeb0

* avoids bug when requesting quantile_values close to existing ones but off by a small tolerance * never creates unsorted quantiles * extrapolates outside the existing range by linearly interpolating on the logistic scale

bump version

fb04f20

add to news

0165001

remove test for nonexistent functions

0d133df

dsweber2 approved these changes Feb 21, 2024

View reviewed changes

style only

3b8c889

dajmcdon merged commit 7a4ea55 into dev Mar 6, 2024

dajmcdon deleted the djm/resids-hotfix branch September 20, 2024 21:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Djm/resids hotfix #294

Djm/resids hotfix #294

Uh oh!

dajmcdon commented Feb 20, 2024 •

edited

Loading

Uh oh!

dsweber2 commented Feb 20, 2024

Uh oh!

dsweber2 left a comment •

edited

Loading

Uh oh!

dajmcdon commented Feb 20, 2024

Uh oh!

dajmcdon commented Feb 20, 2024

Uh oh!

dsweber2 commented Feb 20, 2024

Uh oh!

dsweber2 left a comment

Uh oh!

dsweber2 Feb 21, 2024

Uh oh!

dsweber2 Feb 21, 2024

Uh oh!

dsweber2 commented Feb 21, 2024

Uh oh!

dajmcdon commented Mar 6, 2024

Uh oh!

dsweber2 commented Mar 6, 2024

Uh oh!

dajmcdon commented Mar 6, 2024

Uh oh!

Uh oh!

Djm/resids hotfix #294

Djm/resids hotfix #294

Uh oh!

Conversation

dajmcdon commented Feb 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Change explanations for reviewer

Magic GitHub syntax to mark associated Issue(s) as resolved when this is merged into the default branch

Uh oh!

dsweber2 commented Feb 20, 2024

Uh oh!

dsweber2 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dajmcdon commented Feb 20, 2024

Uh oh!

dajmcdon commented Feb 20, 2024

Uh oh!

dsweber2 commented Feb 20, 2024

Uh oh!

dsweber2 left a comment

Choose a reason for hiding this comment

Uh oh!

dsweber2 Feb 21, 2024

Choose a reason for hiding this comment

Uh oh!

dsweber2 Feb 21, 2024

Choose a reason for hiding this comment

Uh oh!

dsweber2 commented Feb 21, 2024

Uh oh!

dajmcdon commented Mar 6, 2024

Uh oh!

dsweber2 commented Mar 6, 2024

Uh oh!

dajmcdon commented Mar 6, 2024

Uh oh!

Uh oh!

dajmcdon commented Feb 20, 2024 •

edited

Loading

dsweber2 left a comment •

edited

Loading