-
Notifications
You must be signed in to change notification settings - Fork 16
Update Quidel Covid pipeline #1452
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
|
Conclusions:
|
Remaining work on this:
|
Deletions are complete, but checks are showing suspicious counts. Here's what was done:
The problem:
Current plan:
edit 20220317: Sample of 1000 rows found none missing. Trying a larger sample and preparing to check exhaustively. edit 20220328: Exhaustive check complete. No missing rows identified. All rows from the 21,635,443 were matched to an equivalent row from the 21,616,771 based on the following query:
The 21,635,443 were passed through Mystery solved, we're good to release. |
@krivard what does this mean? Why do we have issues< 20200208?
No worries, I think it is a typo. So the deletion work actually deleted 1,243,315 rows more than expected. I read the current process, it seems that the deletion work is not done by deleting the rows in the deletion CSV that I provided but just using that for checking, is that correct? |
If the answer to the previous question is yes. It could because that there is some gaps in the deletion file and our previous released data. I remembered that we have several times of update in this pipeline and that could generate different outputs. |
apologies -- I could not get the file you generated to match against the database, but never followed up. (also a lot of this is just notekeeping for me as I refine my analysis; I will continue to edit as I learn more) |
Details above -- deletions confirmed correct. |
We should have several updates to Quidel Covid Test data at the current stage of COVID.
The text was updated successfully, but these errors were encountered: