Update quidel covidtest (Add Age Groups Signals, Add rest-of-state reports) #1467

jingjtang · 2022-01-14T14:42:37Z

Description

Add age group specific signals for Quidel Covid
- Use the age group breakdown: 0-4, 5-17, 18-49, 50-64, 65+, 0-17(supergroup)
- Stick with the current 50 availability threshold for 1) consistency with what we have published 2) statistical consideration 3) not big difference if lower the threshold
Add megacountyies to Quidel Covid
- Megacounties are added before the availability threshold checking.
- For raw signals, counties with counties <= 50 will be merged together as megacounties
- For smoothed signals, counties with counties <= 25 will be merged together as megacounties
- After getting the megacounties, if the count for a megacounty does not pass the availability threshold, we will not report anything for this megacounty.
Live docs here for availability checking and correlation analysis

Changelog

Itemize code/test/documentation changes and files added/removed.

constants.py, pull.py, run.py, geo_map.py, generate_sensor.py
tests/test_constants.py, tests/test_pull.py, tests/test_run.py, tests/test_geo_map.py, tests/test_generate_sensor.py, tests/test_data

Fixes

Fixes #1452, #458

jingjtang · 2022-01-14T15:15:01Z

We need new signals names for age-group specific siganls.
The current ones:

covid_ag_raw_pct_positive_age_0to4
covid_ag_raw_pct_positive_age_5to17
covid_ag_raw_pct_positive_age_18to49
covid_ag_raw_pct_positive_age_50to64
covid_ag_raw_pct_positive_age_65toOlder
covid_ag_raw_pct_positive
covid_ag_smoothed_pct_positive_age_0to4
covid_ag_smoothed_pct_positive_age_5to17
covid_ag_smoothed_pct_positive_age_18to49
covid_ag_smoothed_pct_positive_age_50to64
covid_ag_smoothed_pct_positive_age_65toOlder
covid_ag_smoothed_pct_positive

Need @RoniRos @ryantibs @krivard 's approval for the signal names.

krivard · 2022-01-14T15:56:44Z

We need new signals names for age-group specific siganls.

Let me find what we're using in covid_hosp; we should be consistent where possible

krivard · 2022-01-14T16:00:17Z

Recommended set:

covid_ag_raw_pct_positive_age_0to4 -> covid_ag_raw_pct_positive_age_0_4
covid_ag_raw_pct_positive_age_5to17 -> covid_ag_raw_pct_positive_age_5_17
covid_ag_raw_pct_positive_age_18to49 -> covid_ag_raw_pct_positive_age_18_49
covid_ag_raw_pct_positive_age_50to64 -> covid_ag_raw_pct_positive_age_50_64
covid_ag_raw_pct_positive_age_65toOlder -> covid_ag_raw_pct_positive_age_65plus
covid_ag_raw_pct_positive
covid_ag_smoothed_pct_positive_age_0to4 -> covid_ag_smoothed_pct_positive_age_0_4
covid_ag_smoothed_pct_positive_age_5to17 -> covid_ag_smoothed_pct_positive_age_5_17
covid_ag_smoothed_pct_positive_age_18to49 -> covid_ag_smoothed_pct_positive_age_18_49
covid_ag_smoothed_pct_positive_age_50to64 -> covid_ag_smoothed_pct_positive_age_50_64
covid_ag_smoothed_pct_positive_age_65toOlder -> covid_ag_smoothed_pct_positive_age_65plus
covid_ag_smoothed_pct_positive

ryantibs · 2022-01-14T16:34:35Z

Katie's last naming scheme: 👍 from me.

jingjtang · 2022-01-14T18:15:32Z

As suggested by @RoniRos, we can add another two super age groups: 0-17; 18-64, so that we have information for those small age groups like 0-4 added but not bothered too much by the available threshold

RoniRos · 2022-01-14T18:39:30Z

Katie’s last naming scheme LGTM, too. From: ryantibs ***@***.***> Sent: Friday, January 14, 2022 11:35 AM To: cmu-delphi/covidcast-indicators ***@***.***> Cc: RoniRos ***@***.***>; Mention ***@***.***> Subject: Re: [cmu-delphi/covidcast-indicators] Update quidel covidtest (Add Age Groups Signals, Add rest-of-state reports) (PR #1467) Katie's last naming scheme: 👍 from me. — Reply to this email directly, view it on GitHub<#1467 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AFQ3C3QREPIZEJHKSJITKFLUWBGCRANCNFSM5L67GA4Q>. Triage notifications on the go with GitHub Mobile for iOS<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>. You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>

RoniRos · 2022-01-14T18:41:32Z

This suggestion was for our consideration. By adding the super-groups of 0-17 and 18-64, we may have many counties for which Quidel age-specific signal is available. Jingjing will calculate and share the # counties above threshold, so we can consider if it’s worth it. From: Jingjing Tang ***@***.***> Sent: Friday, January 14, 2022 1:16 PM To: cmu-delphi/covidcast-indicators ***@***.***> Cc: RoniRos ***@***.***>; Mention ***@***.***> Subject: Re: [cmu-delphi/covidcast-indicators] Update quidel covidtest (Add Age Groups Signals, Add rest-of-state reports) (PR #1467) As suggested by @RoniRos<https://github.com/RoniRos>, we can add another two super age groups: 0-17; 18-64, so that we have information for those small age groups like 0-4 added but not bothered too much by the available threshold — Reply to this email directly, view it on GitHub<#1467 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AFQ3C3VDTIXGATEM6MFSME3UWBR47ANCNFSM5L67GA4Q>. Triage notifications on the go with GitHub Mobile for iOS<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>. You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>

jingjtang · 2022-01-14T19:08:33Z

@ryantibs @RoniRos

The new availability is shown here. I think those two super age groups are good to be added.

RoniRos · 2022-01-14T20:42:57Z

I agree with Jingjing that these two supergroups could be useful, especially at the county level. I’d like to know what Ryan and Kate think in terms of value vs. cost. Btw, Jingjing, the State-level panels include also a coverage curve for “total” (all-ages), but the county-level panel does not. Can you please add it at your convenience? It’s a good indication of upper limit, and what is lost when breaking down by age-group. From: Jingjing Tang ***@***.***> Sent: Friday, January 14, 2022 2:09 PM To: cmu-delphi/covidcast-indicators ***@***.***> Cc: RoniRos ***@***.***>; Mention ***@***.***> Subject: Re: [cmu-delphi/covidcast-indicators] Update quidel covidtest (Add Age Groups Signals, Add rest-of-state reports) (PR #1467) [location_nums_over_time]<https://user-images.githubusercontent.com/31444565/149571274-8ec093eb-f89b-485f-ba6b-730b0f77f71e.png> The new available ability is shown here. I think those two super age groups are good to be added. — Reply to this email directly, view it on GitHub<#1467 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AFQ3C3VN3KRXB3AE6L5ZIHLUWBYDZANCNFSM5L67GA4Q>. Triage notifications on the go with GitHub Mobile for iOS<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>. You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>

ryantibs · 2022-01-14T21:12:09Z

It's quite hard for me to tell based on the current plots whether the tradeoff is actually worth it. I suspect it may not be. The easiest & most direct way to tell would be to add up the availability of the subgroups within each combined group

Eg, just compare the availability of 0_4 + 5_17 to 0_17 (so just comparing two lines), and likewise for the other case.

RoniRos · 2022-01-14T21:52:05Z

Ryan: I’m not sure why you would want to add up the coverages. I would think one would want to subtract the coverage of each sub-group from the coverage of its super-group, e.g. : * How many counties that do not support a 0-4 signal, at least support a 0-17 signal. And separately: * How many counties that do not support a 5-17 signal, at least support a 0-17 signal. From: ryantibs ***@***.***> Sent: Friday, January 14, 2022 4:12 PM To: cmu-delphi/covidcast-indicators ***@***.***> Cc: RoniRos ***@***.***>; Mention ***@***.***> Subject: Re: [cmu-delphi/covidcast-indicators] Update quidel covidtest (Add Age Groups Signals, Add rest-of-state reports) (PR #1467) It's quite hard for me to tell based on the current plots whether the tradeoff is actually worth it. I suspect it may not be. The easiest & most direct way to tell would be to add up the availability of the subgroups within each combined group Eg, just compare the availability of 0_4 + 5_17 to 0_17 (so just comparing two lines), and likewise for the case. — Reply to this email directly, view it on GitHub<#1467 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AFQ3C3UV2JEQFB3RJSJOIGDUWCGTHANCNFSM5L67GA4Q>. Triage notifications on the go with GitHub Mobile for iOS<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>. You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>

ryantibs · 2022-01-14T22:01:05Z

Yes sorry, my "+" was supposed to be an "OR". So I think we are saying equivalent things.

RoniRos · 2022-01-15T00:11:01Z

Sorry to be a pest, but I still don't understand. Why would you want to do an OR, rather than add up the two differences I listed as an example? If there is a county were neither 0-4 nor 5-17 is available, arguably it should count twice as much as a county where only one of them is unavailable. No?

Anyway, what are the use cases you envision for these signals?

RoniRos · 2022-01-15T00:15:29Z

One could argue that when, say 0-17 and 5-17 are available but 0-4 is not, one could derive an estimate of 0-4 from the other two, based on their proportion in the population (since we don't give them their proportion among the tests). So the case when neither 0-4 nor 5-17 are available is by far the worst.

ryantibs · 2022-01-15T01:16:18Z

Not a pest, I just wasn't reading/thinking carefully! Yes, you're right, your message is alluding to something different.

But we wouldn't be able to find out from your counts whether in a particular county we have what you call the "worst case" (neither 0-4 nor 5-17 are available), since it's just marginal counts, right? And yes I buy your argument. So I guess the following four numbers would be useful:

How many counties that do not support a 0-4 signal, support a 0-17 signal.
How many counties that do not support a 5-17 signal, support a 0-17 signal.
How many counties that do not support a 0-4 signal OR do not support a 5-17 signal, support a 0-17 signal.
How many counties that do not support a 0-4 signal AND do not support a 5-17 signal, support a 0-17 signal.

@jingjtang Can you do this, and do the same for the other category 18-64, so that we can get a sense of the value ad?

jingjtang · 2022-01-15T01:29:20Z

@RoniRos We initially added the availability checking for all-age groups here, but that curve for county only is removed later ONLY for a more clear looking at the bottom area for the age-specific signals.

In case you are not able to jump to the slack thread, the fig is attached below

(The first legend should be 0-4, not 0-5)

RoniRos · 2022-01-15T01:29:47Z

Ok.
From the curves Jingjing published so far, it's clear there is a non-negligible number of counties "rescued" by the merging. But we still don't have a clear formula to help us decide whether or not to include these merged groups.

I just browsed some of CDC's data and found many places were they present age breakdowns as 0-17,18-49,50-64,65+. E.g. https://www.cdc.gov/coronavirus/2019-ncov/cases-updates/burden.html

So my suggested is that we at least include 0-17.

jingjtang · 2022-01-15T02:48:34Z

@ryantibs
0_17 but not 0_4: Available for 0-17 but not available for 0-4
0_17 but not 5_17: Available for 0-17 but not available for 5-17
0_17 but not OR: Available for 0-17 but not available for either 0-4 or 5-17
0_17 but not AND: Available for 0-17 but not available for both 0-14 and 5-17

Similar ones for 18-64

ryantibs · 2022-01-15T15:01:07Z

@RoniRos @jingjtang Thanks!

I'm convinced we should add 0-17. We can just make it clear in our documentation that we also wanted to separate this out into two subcategories duet to the fact that 0-4 is unvaccinated population. This would be a clean explanation as to why there is overlap/nesting in the lowest 3 categories (0-4, 5-17, 0-17).

I'm less convinced we should add 18-64. Rationale: if we rescue any counties here with this big bucket, then the signal we publish is probably going to be very similar to the all-ages signal. (Because I'm guessing this big bucket includes the most common ages from a marginal perspective, and if signals at 18-49 and 50-64 are each missing, then it must means we're just right below the threshold we set for availability, and 18-64 puts us right above).

So I suggest we just publish 0-17, along with the existing breakdowns, and finish this off ASAP. However, not strongly opposed to also including 18-64, if Roni or somebody else feels differently. Thanks!

jingjtang · 2022-01-25T16:37:10Z

@ryantibs

The plot titles say "< 25 counts for every consecutive 7 days". I just want to check to understand what you're plotting. You're plotting the sum of counts in 7-day rolling windows, and then reporting how many times that's < 25, right?
Yes, you are correct. But to be more clear, the smoothed signals use the sum of counts in 7-day rolling windows to be sample size in our API, which is different from the raw signals.

And for the time series plot, the left most one, I checked how many counties have < 25 counts for each date (and if we switch to option2, those counties will be unavailable at all).

In the above plots with Counties 21079 and 8107, the straight blue line is just a plotting artifact right? I assume the real signal value is missing there, and the plotting behavior is just to connect it by a straight line?

Yes, it is just a plotting artifact. In fact, we only have values reported at the two endpoints. All the dates in between are unavailable for that county. And remember that, we also have counties with only <10 days available due to geographical pooling (they has < 10 days with counts in [25, 50] but the rest of the days with counts < 25), I didn't show those examples since the figures are just dominated by the line of option1 but you can image how they look like. And we do have those counties, the numbers are shown in the middle one of the time series plot

ryantibs · 2022-01-25T18:27:31Z

Thank you. I think option 2 is the winner in my mind, but will have a decision for by 5pm (just to be sure, will check to if Roni agrees with me at team-leads meeting).

ryantibs · 2022-01-25T22:34:27Z

Hi @jingjtang, confirming that we should go with option 2. Thank you for working hard to compare the options.

jingjtang · 2022-01-26T00:46:26Z

Thanks @ryantibs. @dshemetov The pipeline is ready for review with the new strategy. Described in details below for your reference:

As for smoothed signals:
- If counts smaller than 50, we do geo shrinkage. We will borrow no more than the counts that we currently have. That is, for example, if county c has 2 counts, we can only borrow 2 counts at most as pseudo counts from its home state.
- If counts larger than 50, we don't borrow anything.
- The counts for smoothed signals are the sum of counts in the 7-day moving windows.
- After the temporal pooling and geographical pooling, we still have an availability checking. If the number of counts is larger than 50, we report it; otherwise, we report nothing.
As for raw signals:
- For counts smaller than 50, we report nothing.
- For counts larger than 50, we report what we have.
- The counts for raw signals are just the counts for the corresponding date (different from the smoothed signals)

dshemetov

I think I found a small bug, see the comments. Otherwise looks good!

dshemetov · 2022-01-26T21:40:43Z

quidel_covidtest/delphi_quidel_covidtest/geo_maps.py

+    # For raw signals, the threshold is MIN_OBS
+    # For smoothed signals, the threshold is MIN_OBS/2
+    if smooth:
+        threshold_visits = MIN_OBS/2


What's the reason for this change in thresholding here?

quidel_covidtest/tests/test_run.py

dshemetov

LGTM!

dshemetov · 2022-01-28T18:01:55Z

quidel_covidtest/tests/test_run.py

+                           [(14+parent_pos*22/parent_test+0.5)/(50+1)*100, 
+                            (26+parent_pos*1/parent_test+0.5)/(50+1)*100,
+                            (24+0.5)/(60+1)*100,
+                            (32+0.5)/(64+1)*100], equal_nan=True)


This is excellent, thank you!

krivard · 2022-01-28T21:21:27Z

Three questions:

Will this change require us to reissue any past data?
Does this change require any updates to the API documentation?
Are there any special instructions for the first run of the pipeline once this change is released?

jingjtang · 2022-01-29T02:56:13Z

@krivard

Will this change require us to reissue any past data?
Yes, but only needs deletion. We have have some counties for smoothed signals deleted. They will not be reported anymore. But for the counties that still pass the availability threshold, the values are unchanged.

Does this change require any updates to the API documentation?
We can make an update to it to explain the current strategy more clearly. But the current one matches to the current code.

Are there any special instructions for the first run of the pipeline once this change is released?
Yes. If we run all the data back to the very first date for Quidel(2020-05-26), the cache should be deleted before the first run.

By the way, since it's a time to re-run the past data, I want to ask whether we are able to extend the export range for Quidel from 45 days to 175 days? (If storage allows) This means for the daily run, we generate reports for the dates from -180 days to -5 days instead of -50 days to -5 days. The backfill related pipeline will benefit from this.

ryantibs · 2022-01-29T13:21:06Z

Sorry if this is contributing noise, but I just want to double check --- apart from the new stricter way of pooling, and the new signals --- 1. that we'll get megacounties, and 2. that the national signal (and all signals) will take values for references dates all the way back to the start of the time period. These are things I've asked for on other issues or messages.

jingjtang · 2022-01-29T17:51:17Z

Sorry if this is contributing noise, but I just want to double check --- apart from the new stricter way of pooling, and the new signals --- 1. that we'll get megacounties, and 2. that the national signal (and all signals) will take values for references dates all the way back to the start of the time period. These are things I've asked for on other issues or messages.

@ryantibs Yes, we will have

megacounties: same smoothing method for smoothed signals. And for raw signal, if the counts are less than 50, they are merged into a megacounty; for smoothed signal, if the counts are less than 25, they are merged into a megacounty.
age-specific signals: 0-4, 5-17, 0-17, 18-49, 50-64, 65+
If the engineering side allows, we will at least have one time run to generate all the reports back to the start (2020-05-26), and maybe not all the issues(@krivard @korlaxxalrok)
new availability threshold for smoothed signals:
considering the sample size after temporal pooling

<25: report nothing (different from the previous strategy)
[25, 50): geographical pooling
[50, +infinity): report what they are

for this PR.

krivard · 2022-01-31T15:37:57Z

Will this change require us to reissue any past data? -> Yes, but only needs deletion.

Cool. To do this we will need the full list of (issue, reference date, geo type, geo value) to delete.

Does this change require any updates to the API documentation? -> No

Acknowledged

Are there any special instructions for the first run of the pipeline once this change is released? -> Yes: delete the cache

Acknowledged

extend the export range for Quidel from 45 days to 175 days?

That's just shy of 4x, on top of the 7x we're adding with this PR -- I don't recommend it. Quidel currently adds ~110k rows to the database per day, out of a total ~2.8M rows added daily across all indicators. Here's some rough figures imagining the impact on the database if we did that (upper bound; i imagine the age group signals have lower coverage than all-ages and so are somewhat less than 7x)

scenario	quidel rows added per day	total rows added per day	quidel pct of total
current	110k	2.8M	4%
add age groups	770k	3.6M	22%
extend export range	440k	3.1M	14%
age groups + export range	3.1M	5.8M	54%

I do notice that quidel isn't currently using the archive-differ, so some of that 110k rows might be duplicates. Do you happen to know the actual rate of new/updated figures vs duplicates in each submission?

krivard · 2022-01-31T19:23:22Z

Releasing this change has some subtle details -- here's a draft of our options. Pick:

(A or B) plus (C or D)
E

A: If we think the initial run will be fast (<40 minutes)

Merge this PR
Release covidcast-indicators
On prod, clear the pulled_until cache. Swap out the production params file with one that will export from 2020-02 to present. Run the indicator. Swap the params file back to the production version.

B: If we think the initial run will be slow

Merge this PR
On staging, clear the pulled_until cache. Set the params file to export from 2020-02 to present. Run the indicator. Drop the resulting files off in production receiving.
Release covidcast-indicators
On prod, clear the pulled_until cache.

C: If we think the deletions will be fast (<4 hours)

Delete any time after A2 or B3

D: If we think the deletions will be slow

Do the deletions need to be in place relatively simultaneously with release? Is it bad for the database to spend multiple days with deletions only partially applied?

Di: if we need to be rigorous

Schedule a 3-day patch period the same way we did with the JHU patch last fall. Start A1 or B1 on day one of the patch. Complete A/B before concluding the patch period.

Dii: if we can be more flexible

Delete in batches daily until done. Could take place before, during, or after A/B, depending on if we want deletions finished before the new code runs, or if we want the new code to run before we finalize the set of deletions needed.

E: If we want archive-differ to handle deletions

If we don't care about deleting from past issues, archive-differ will automatically mark as deleted anything that switches from showing up in an output file to not-showing-up in an output file. "Mark as deleted" means the data is still in the database, and will appear in as-of queries for as-ofs before the deletion date, but will not appear in most-recent queries or in as-of queries for as-ofs after the deletion date.

Set up archive-differ bucket in S3
Add archiver section to quidel params
Initialize the archive bucket with the current snapshot: On staging, set the params file to export from 2020-02 to present. Run the indicator using indicator-runner. Drop the resulting files off in production receiving.
Merge this PR
Release covidcast-indicators
On prod, clear the pulled_until cache. Swap out the production params file with one that will export from 2020-02 to present. Run the indicator. Swap the params file back to the production version.

jingjtang · 2022-01-31T20:45:26Z

@krivard

Do you happen to know the actual rate of new/updated figures vs duplicates in each submission?

I do not have that kind of data in hand. Do you want me to check that?

Choice between A and B: the runtime will be longer than 40 mins but ~60 mins or so, no more than 120 mins according to my experience.

Choice for deletion: I think Dii is better. We can have the new signals produced as soon as possible and have those corrections for the historical release.

We do need correction (deleting) for the past issues. So I won't choose E.

krivard · 2022-02-01T15:06:22Z

Do you happen to know the actual rate of new/updated figures vs duplicates in each submission?

I do not have that kind of data in hand. Do you want me to check that?

Yes please; just for a week or two worth of data should be fine.

jingjtang · 2022-02-02T16:53:55Z

@krivard Here is a table showing the percentage of 0 diff

compare the val reported today with the val reported yesterday.
zero_diff_pct = 90, means on issue_date D for a specific signal, we have 90% of the reported val unchanged across all counties considered and across all dates reported (-50 days to -5 days)
If yesterday's report for location i and date d is NA, but today's is not NA, then there is a non-zero diff.
issue_date ranging from 2021-12-21 to 2021-12-28

We can see that for the smallest geographical level (county), we have zero_diff_pct >= 80 for most of the signals. It's good to apply archive-differ to quidel_covidtest too.
quidel_zero_diff_pct.csv

krivard · 2022-02-02T19:01:13Z

Excellent news!

So if the update to the smoothing means we expect to drop 15-50% of new rows we currently report each day for quidel,
and adding archive-differ means we expect to drop 80-90% of new rows we currently report each day for quidel,
that gives us a 5x-20x total reduction depending on how those two sets overlap.

Let's install archive-differ, release this, see where we are, and then revisit expanding the export range once we have a better idea of where we wind up. That'll look like E without the initialization step, or:

~~Set up archive-differ bucket in S3~~ Done: quidel
Add archiver section to quidel params

  "archive": {
    "aws_credentials": {
      "aws_access_key_id": "{{ delphi_aws_access_key_id }}",
      "aws_secret_access_key": "{{ delphi_aws_secret_access_key }}"
    },
    "bucket_name": "delphi-covidcast-indicator-output",
    "cache_dir": "./archivediffer_cache",
    "indicator_prefix": "quidel"
  }

Merge this PR
Release covidcast-indicators
On prod, add the archivediffer_cache directory. Clear the pulled_until cache. Swap out the production params file with one that will export from 2020-02 to present. Run the indicator. Swap the params file back to the production version.

jingjtang · 2022-02-06T19:49:46Z

@krivard File created to provide deletion info: https://cmu.box.com/s/y41vjkj8z8cjvaqgtz8g5phagms1i3t0
Support code here: https://github.com/cmu-delphi/covidcast-indicators/blob/quidel_deletion/quidel_covidtest/delphi_quidel_covidtest/quidel_deletion_info.py

I did the sanity check by

randomly check the value and stderr to see whether they match what we have in our API
the sample size should be all 50
the number of the reports for each date at county level seems reasonable.

But it's still better to have another one to help do another sanity check for this.

Jingjing Tang added 3 commits January 11, 2022 00:16

add age groups

20a46d2

update code for adding megacounties

801e2f5

update unit tests

15f7690

jingjtang requested a review from krivard January 14, 2022 14:42

Jingjing Tang added 5 commits January 14, 2022 09:56

get smoothers out of the complicated loop

010491d

fix a linting

d215d6c

fix an error

a120175

ignore too-many-branches in pylintrc

b199cad

fix a linting error

e7870e8

Jingjing Tang added 2 commits January 14, 2022 14:10

update signal names, add two super age groups

07642fa

fix a linting error

1ee31ae

jingjtang requested a review from dshemetov January 26, 2022 00:47

Add a few tests to double check county censoring

49af726

dshemetov requested changes Jan 26, 2022

View reviewed changes

dshemetov and others added 2 commits January 26, 2022 13:56

Remove faux-breakpoint, update test_data, update test_run

3218b1e

fix the test in test_run

5c6d798

jingjtang requested a review from dshemetov January 27, 2022 19:27

Jingjing Tang added 2 commits January 27, 2022 14:28

remove the question in comments

3e9232e

add tests for values

ab25b35

dshemetov approved these changes Jan 28, 2022

View reviewed changes

jingjtang requested a review from krivard January 28, 2022 18:16

Jingjing Tang added 2 commits February 2, 2022 16:49

add archiver section to quidel params

8fbff93

fix params

963bb5a

krivard merged commit ea31835 into main Feb 8, 2022

krivard deleted the Add_Age_Group_to_QuidelCovidtest branch February 8, 2022 16:02

krivard mentioned this pull request Feb 8, 2022

Release covidcast-indicators 0.3.2 #1516

Merged

Update quidel covidtest (Add Age Groups Signals, Add rest-of-state reports) #1467

Update quidel covidtest (Add Age Groups Signals, Add rest-of-state reports) #1467

Uh oh!

Conversation

jingjtang commented Jan 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changelog

Fixes

Uh oh!

jingjtang commented Jan 14, 2022

Uh oh!

krivard commented Jan 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

krivard commented Jan 14, 2022

Uh oh!

ryantibs commented Jan 14, 2022

Uh oh!

jingjtang commented Jan 14, 2022

Uh oh!

RoniRos commented Jan 14, 2022 via email

Uh oh!

RoniRos commented Jan 14, 2022 via email

Uh oh!

jingjtang commented Jan 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RoniRos commented Jan 14, 2022 via email

Uh oh!

ryantibs commented Jan 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RoniRos commented Jan 14, 2022 via email

Uh oh!

ryantibs commented Jan 14, 2022

Uh oh!

RoniRos commented Jan 15, 2022

Uh oh!

RoniRos commented Jan 15, 2022

Uh oh!

ryantibs commented Jan 15, 2022

Uh oh!

jingjtang commented Jan 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RoniRos commented Jan 15, 2022

Uh oh!

jingjtang commented Jan 15, 2022

Uh oh!

ryantibs commented Jan 15, 2022

Uh oh!

jingjtang commented Jan 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ryantibs commented Jan 25, 2022

Uh oh!

ryantibs commented Jan 25, 2022

Uh oh!

jingjtang commented Jan 26, 2022

Uh oh!

dshemetov left a comment

Choose a reason for hiding this comment

Uh oh!

dshemetov Jan 26, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dshemetov left a comment

Choose a reason for hiding this comment

Uh oh!

dshemetov Jan 28, 2022

Choose a reason for hiding this comment

Uh oh!

krivard commented Jan 28, 2022 • edited by jingjtang Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jingjtang commented Jan 29, 2022

Uh oh!

jingjtang commented Jan 14, 2022 •

edited

Loading

krivard commented Jan 14, 2022 •

edited

Loading

jingjtang commented Jan 14, 2022 •

edited

Loading

ryantibs commented Jan 14, 2022 •

edited

Loading

jingjtang commented Jan 15, 2022 •

edited

Loading

jingjtang commented Jan 25, 2022 •

edited

Loading

krivard commented Jan 28, 2022 •

edited by jingjtang

Loading

jingjtang commented Jan 29, 2022 •

edited

Loading

krivard commented Jan 31, 2022 •

edited

Loading

jingjtang commented Jan 31, 2022 •

edited

Loading

jingjtang commented Feb 6, 2022 •

edited

Loading