Skip to content

Docs: update nans codenames #536

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
May 10, 2021
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions docs/api/covidcast-signals/ght.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,8 +23,8 @@ arbitrary units (its scale is meaningless); larger numbers represent higher
numbers of COVID-related searches.

These signals were updated daily until March 8, 2021. After that date, Google
dropped support for Google Health Trends access. We recommend the [Google
Symptoms source](google-symptoms.md) as an alternative, which provides
dropped support for Google Health Trends access. We recommend the
[Google Symptoms source](google-symptoms.md) as an alternative, which provides
finer-grained measures of search volume at the symptom level.

| Signal | Description |
Expand Down
7 changes: 3 additions & 4 deletions docs/api/covidcast-signals/hhs.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,8 +27,7 @@ resource usage in different ways.
This indicator makes available several commonly-used combinations of
those columns, aggregated geographically. In particular, we include
the sum of all adult and pediatric COVID-19 hospital admissions. This
sum is used as the "ground truth" for hospitalizations by the [COVID-19
Forecast Hub](https://github.com/reichlab/covid19-forecast-hub/blob/master/data-processed/README.md#hospitalizations).
sum is used as the "ground truth" for hospitalizations by the [COVID-19 Forecast Hub](https://github.com/reichlab/covid19-forecast-hub/blob/master/data-processed/README.md#hospitalizations).


| Signal | Geography | Resolution | Description |
Expand All @@ -51,7 +50,7 @@ the HHS state timeseries dataset:

* `previous_day_admission_[adult|pediatric]_covid_[confirmed|suspected]`

The `confirmed` signal is the sum of the two `confirmed` columns:
The `confirmed` signal is the sum of the two `confirmed` columns:

* adult
* pediatric
Expand Down Expand Up @@ -95,6 +94,6 @@ published by the U.S. Department of Health & Human Services under an
[Open Data Commons Open Database License
(ODbL)](https://opendatacommons.org/licenses/odbl/1-0/). The ODbL
permits sharing, transformation, and redistribution of data or derived
works so long as all public uses are distributed under the ODbL and
works so long as all public uses are distributed under the ODbL and
attributed to the source. For more details, consult the official
license text.
4 changes: 2 additions & 2 deletions docs/api/covidcast-signals/nchs-mortality.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,8 +18,8 @@ grand_parent: COVIDcast Epidata API
This data source of national provisional death counts is based on death
certificate data received and coded by the National Center for Health Statistics
[(NCHS)](https://www.cdc.gov/nchs/nvss/vsrr/COVID19/index.htm). This data is
different from the death data available from [USAFacts](usa-facts.md) and [JHU
CSSE](jhu-csse.md): deaths are reported by the date they occur, not the date
different from the death data available from [USAFacts](usa-facts.md) and
[JHU CSSE](jhu-csse.md): deaths are reported by the date they occur, not the date
they are reported by local health departments, and data is frequently reissued
as additional death certificates from recent weeks are received and tabulated.

Expand Down
5 changes: 2 additions & 3 deletions docs/api/covidcast_geography.md
Original file line number Diff line number Diff line change
Expand Up @@ -48,12 +48,11 @@ by their original sources at geographic resolutions which are too coarse.
## Small Sample Sizes and "Megacounties"

Most sources do not report the same amount of data for every county; for
example, since the survey sources rely on survey responses submitted each day, counties with small populations
example, since the survey sources rely on survey responses submitted each day, counties with small populations
may have comparatively few survey responses. We do not report individual county
estimates when small sample sizes would make estimates unreliable or would allow
identification of respondents, violating privacy and confidentiality agreements.
Additional considerations for specific signals are discussed in the [source and
signal documentation](covidcast_signals.md).
Additional considerations for specific signals are discussed in the [source and signal documentation](covidcast_signals.md).

On each day, in each state, we collect the data from all counties with
insufficient data to be individually reported. These counties are combined into
Expand Down
6 changes: 3 additions & 3 deletions docs/api/missing_codes.md
Original file line number Diff line number Diff line change
Expand Up @@ -13,11 +13,11 @@ corresponds to the `value` column). The integer codes are as follows

| Code | Name | Description |
| --- | --- | --- |
| 0 | DEFAULT | This is the default value for when the field is not missing. |
| 0 | NOT MISSING | This is the default value for when the field is not missing. |
| 1 | NOT APPLICABLE | This value is used when the field is not expected to have a value (e.g. stderr for a signal that is not estimated from a sample). |
| 2 | REGION EXCEPTION | This value is used when the field is not reported because the particular indicator does not serve the geographical region requested. |
| 3 | PRIVACY | This value is used when the field has been censored for data privacy reasons. This could be due to reasons such as low sample sizes or simply a requirement from our data partners. |
| 3 | CENSORED | This value is used when the field has been censored for data privacy reasons. This could be due to reasons such as low sample sizes or simply a requirement from our data partners. |
| 4 | DELETED | This value is used when the field was present in previous issues, but is no longer reported. Deletions can arise due to bug fixes, changing censorship requirements, or data corrections from the source. |
| 5 | UNKNOWN | This value is used when the field is missing, but does not fall into any of the categories above. |
| 5 | OTHER | This value is used when the field is missing, but does not fall into any of the categories above. |

These codes are supplied as part of the `delphi_utils` Python library (see [here](https://pypi.org/project/delphi-utils/)).
8 changes: 3 additions & 5 deletions docs/symptom-survey/contingency-tables.md
Original file line number Diff line number Diff line change
Expand Up @@ -59,8 +59,7 @@ derived indicators will be included in aggregations for that period but will
only use a partial week or month of data.

CSVs for the month of January 2021 only use data from January 6-31 due to a
[definitional change in a major vaccine item on January
6](./coding.md#new-items-2).
[definitional change in a major vaccine item on January 6](./coding.md#new-items-2).

### Regions

Expand Down Expand Up @@ -150,9 +149,8 @@ indicator columns will be marked as missing (`NA`) for that group.

## Indicators

The files contain [weighted
estimates](../api/covidcast-signals/fb-survey.md#survey-weighting) of the
percent of respondents who fulfill one or several criteria. Estimates are
The files contain [weighted estimates](../api/covidcast-signals/fb-survey.md#survey-weighting)
of the percent of respondents who fulfill one or several criteria. Estimates are
broken out by state, age, gender, race, ethnicity, occupation, and health
conditions.

Expand Down