-
Notifications
You must be signed in to change notification settings - Fork 67
NAN coding database and acquisition changes #417
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
could you add some integration tests for the acquisition step? |
what do you mean by integration tests? I'm just vaguely familiar with the term. like test communication with a database? |
Yea, see the integrations/ folder for some examples. They rely on a local web server/db and can run the acquisition end to end |
OBSOLETE: Didn't need an expected geomap.Recording some decision logs with @krivard. So there’s a bit of variation in county coverage across the indicators. A majority of the indicators have 3k+ counties, three have ~1500, ~2000, ~2500, and two outliers have ~100 and ~900.
To construct an expected geo map for an indicator, we need to consider the time window to use for it. If we use the full time window, we’ll get everything, but we might get geos that have stopped being reported. While this would add some nans to the database, this doesn’t seem to be a big issue though. As the table below shows, most counties that have reported at least once in the full time window reported at least once in the last month, which suggests that we won't have many empty counties reporting nothing but nans.
Our current plan is to make a utility that will build custom expected geo maps for some of the indicators. Given the slow rate of change in the geos, we will schedule an update to this list once a month. |
* add 3 missing columns to covidcast table schema * update acquisition with validation and of new missing columns in CSVs * update api server to serve the missing columns * add integration tests for the new columns * Ben Smith and Andrew Chin's review suggestions integrated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i found a couple nits, but otherwise LGTM
Co-authored-by: melange396 <[email protected]>
Co-authored-by: melange396 <[email protected]>
Ooof, let's wait to merge this until next week, I found a couple things I'd like to fix. |
Co-authored-by: Katie Mazaitis <[email protected]>
Co-authored-by: Katie Mazaitis <[email protected]>
Co-authored-by: Katie Mazaitis <[email protected]>
Co-authored-by: Katie Mazaitis <[email protected]>
Co-authored-by: Katie Mazaitis <[email protected]>
If our infra uses Docker, then a requirements file needs to be updated. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
xlnt
ci: require build to be successful before creating image
A few things coming up while merging Flask server and working with Docker:
I'm getting a few failing tests due to databases not existing at the moment. |
Can you paste the exact errors in thread? CI doesn't run on forks so we can't see what you're seeing. |
Once I installed the right dependencies and added the flag |
Once I updated the |
For future ref, this link to the CI recipe from @sgratzl was really helpful. Going to add that to the ops docs. |
🙏 thank you for both going on that goose chase and ensuring future-us won't have to chase that particular goose again! If you change this to merging into |
We should incorporate these naming changes to |
https://pypi.org/project/delphi-utils/0.1.1/ is up; merging |
This PR begins to address issue #195. Indicator and epidata client PRs to follow. See the design doc for more info. The following changes are implemented:
TODO: