-
Notifications
You must be signed in to change notification settings - Fork 16
Ingest Quidel COVID test data #40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Have we received any of this data yet, so we know what format to expect? |
I talked to Roni this morning, it seems we have not got it yet.
Best,
Jingjing
… On May 27, 2020, at 2:24 PM, Alex Reinhart ***@***.***> wrote:
Have we received any of this data yet, so we know what format to expect?
—
You are receiving this because you were assigned.
Reply to this email directly, view it on GitHub <#40 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHP44VORLWCSRZM74CLW24TRTVLGHANCNFSM4NH3626Q>.
|
@krivard to find Jeremy Weiss and get an update on what the current volume is |
Volume is now 18-19k tests/day Substantial backfill, like ~1 month Jeremy's analyses are based on test date, not storage date, falling back to storage date if the test date is obviously wrong. Test date is the day of the assay not the day the patient's sample was taken. Data is already being sent to the drop!
@jingjtang and maybe @eujing to take on implementation, and ping @nloliveira for help if needed |
The estimator we want is the test positivity rate; tests per device is fine but lower priority |
18-19k cases per day includes backfill. We get <100 tests per state per day if you just see yesterday.
We also don't receive data every day; e.g. today the 14th, the most recent data is for the 12th. May be able to absorb this into our lag policy, revisit if not. |
@RoniRos would like to show any correlations plots we generate for this when he presents at the CDC meeting on |
Actually, my bad, they want it for a community-wide presentation on Tuesday (the weekly meetings that James and I usually attend). Maybe it's better if we schedule it for next Tuesday, and have someone from the Quidel team present it and answer questions. |
Some problems loading csv files into the correlations app (probably due to the client update from this weekend); uploaded |
Is there a document that explains our current naming convention, and lists all current sources and signals (in a more convenient format than this) ? |
Here they are. Quidel has not been included yet. https://cmu-delphi.github.io/delphi-epidata/api/covidcast_signals.html |
No — we switched to a composite signal a while ago. We get Puerto Rico from
JHU and everything else comes from USA Facts.
…On Mon, Jul 27, 2020 at 9:07 PM RoniRos ***@***.***> wrote:
In the API documentation in the link above, shouldn't the source for the
JHU Cases & Deaths be "JHU-CSSE" ?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#40 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAI24CQRAIG3S3RK2P3NYF3R5YQEFANCNFSM4NH3626Q>
.
|
|
In Quidel, there will be both Covid and flu raw data streams to ingest, and they might even at some point be integrated (once a joint test panel is approved and distributed). Then there will likely be multiple signals made for each, some of which will make it to the map. So I think it might make sense to declare QUIDEL to be the source, and keep the flu/covid designation to be part of the signal. However, note that none of our signals (or sources!) to date explicitly indicate Covid! We need to think how to handle the transition to multi-disease tracking. One solution is to create new signal names that explicitly include a disease name (Covid and flu for now), and map requests with old signal names to the Covid ones: initially silently, then with a gentle suggestion, and maybe at some point deprecate them. |
Source: |
@jingjtang will be giving a talk on this signal at Tuesday's meeting with the CDC community. |
Backfill problem in Quidel COVID (record here): This problem has been fixed after switching to upload files for -45 days to -5 days every day. (Minor issue but record here)
Until 07-30-2020, only 2,324 tests out of 833,010 tests for those zip codes. |
|
The number of tests reported for (day D to day D+5) is about 85% of the number total tests in the data received on a certain day. So for the data received on Aug 3rd, since we only upload the report for July 29th. The data for July 29th to Aug 3rd (~85%) will be ingested today or several days later. But there are 15% tests will never be reported since those reports have already been uploaded. (This will be fixed if we upload the report for data going back to at least one month ago every day). For your convenience, you can take a look at the same problem for Flu Test data here which is much more severe. |
@RoniRos This is the figure generated following your logic. For a certain x, we calculate the proportion of tests report for data D after x days among all of the tests that will be reported for date D eventually. Then we will get a time series vector ingested_prop. The corresponding y = 1 - median(ingested_prop). |
Thanks @jingjtang . I think it may be easier for people to understand if you invert the Y axis, namely, report the average fraction of the eventual number of tests (after, say, at least a month) that is reported D days after the date in question. You can simple title it "Fraction of tests reported", and call the X axis "Number of days after the test date". Y axis could be "Fraction of tests reported". This is the only slide you need on this topic. |
Released in 1.7a on 4 August. |
Antigen test results will start coming to us Monday. Not as high-quality as PCR, but much faster, and still good enough for clinical use.
The text was updated successfully, but these errors were encountered: