Skip to content

Commit 184e1f8

Browse files
authored
Merge branch 'docs/1.11' into safegraph_pattern_api_docs
2 parents 213616a + 2448628 commit 184e1f8

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

48 files changed

+146
-118
lines changed

README.md

Lines changed: 15 additions & 36 deletions
Original file line numberDiff line numberDiff line change
@@ -8,26 +8,22 @@ This is the home of [Delphi](https://delphi.cmu.edu/)'s epidemiological data
88
API. See our [API documentation](https://cmu-delphi.github.io/delphi-epidata/)
99
for details on the available data sets, APIs, and clients.
1010

11-
# COVID-19 Notice
11+
## COVIDcast
1212

13-
We are working on collecting several new data sources that may be useful for
14-
nowcasting and forecasting ILI during the COVID-19 pandemic. Each of these will
15-
make these available as soon as individually possible, through our [covidcast
16-
endpoint](https://cmu-delphi.github.io/delphi-epidata/api/covidcast.html).
13+
At the present, our primary focus is developing and expanding the
14+
[COVIDcast API](https://cmu-delphi.github.io/delphi-epidata/api/covidcast.html),
15+
which offers a number of data streams relating to the ongoing COVID-19
16+
pandemic and powers the [COVIDcast project](https://covidcast.cmu.edu/).
1717

18-
For a list of many other data sources relevant to COVID-19, publicly available
19-
through external sites, we have compiled a simple
20-
[spreadsheet](https://docs.google.com/spreadsheets/d/16Nn_3ZvSLnpxRyA2DkoMMzyrd11-AlGJXasS0owln88/edit#gid=0).
18+
Our other, non-COVID data sources remain generally available on a best-effort
19+
basis. However, non-COVID sources should be held to scrutiny at this time since
20+
they were designed to serve as indicators of typical seasonal ILI
21+
(influenza-like illness), not pandemic ILI or CLI (COVID-like illness).
2122

22-
**Note:** apart from the COVID-19-specific data sources described above, the
23-
rest of this repository was built to support modeling and forecasting efforts
24-
surrounding seasonal influenza (and dengue). All other data sources than those
25-
in our covidcast endpoint should be **held to great scrutiny** (if they are even
26-
to be considered at all in this pandemic period), since they were designed to
27-
serve as indicators of typical seasonal ILI (influenza-like illness), and
28-
certainly not pandemic ILI or CLI (covid-like illness).
23+
For a high-level introduction to the COVIDcast API, see our recent
24+
[blog post](https://delphi.cmu.edu/blog/2020/10/07/accessing-open-covid-19-data-via-the-covidcast-epidata-api/).
2925

30-
## Contributing
26+
# Contributing
3127

3228
If you are interested in contributing:
3329

@@ -37,36 +33,19 @@ If you are interested in contributing:
3733
[open an issue](https://github.com/cmu-delphi/delphi-epidata/issues/new)
3834
describing your idea.
3935

40-
## Citing
36+
# Citing
4137

4238
We hope that this API is useful to others outside of our group, especially for
4339
epidemiological and other scientific research. If you use this API and would
4440
like to cite it, we would gratefully recommend the following copy:
4541

4642
> David C. Farrow,
4743
> Logan C. Brooks,
48-
> Aaron Rumack,
4944
> Ryan J. Tibshirani,
5045
> Roni Rosenfeld
5146
> (2015).
5247
> _Delphi Epidata API_.
5348
> https://github.com/cmu-delphi/delphi-epidata
5449
55-
# Related work
56-
57-
- Cook, Samantha, et al. "Assessing Google flu trends performance in the United
58-
States during the 2009 influenza virus A (H1N1) pandemic." PloS one 6.8
59-
(2011): e23610.
60-
- Broniatowski, David A., Michael J. Paul, and Mark Dredze. "National and local
61-
influenza surveillance through Twitter: an analysis of the 2012-2013
62-
influenza epidemic." (2013): e83672.
63-
- Dredze, Mark, et al. "HealthTweets. org: A Platform for Public Health
64-
Surveillance using Twitter." AAAI Conference on Artificial
65-
Intelligence. 2014.
66-
- Generous, Nicholas, et al. "Global disease monitoring and forecasting with
67-
Wikipedia." (2014): e1003892.
68-
- Hickmann, Kyle S., et al. "Forecasting the 2013–2014 Influenza Season Using
69-
Wikipedia." (2015): e1004239.
70-
- McIver, David J., and John S. Brownstein. "Wikipedia usage estimates
71-
prevalence of influenza-like illness in the United States in near real-time."
72-
PLoS Comput Biol 10.4 (2014): e1003581.
50+
See [related work](docs/related_work.md) for a sample of additional work in
51+
this area.

docs/api/README.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,13 +1,14 @@
11
---
2-
title: Delphi Epidata API
2+
title: Epidata API (Other Epidemics)
33
nav_order: 3
44
has_children: true
55
---
66

77
# Delphi Epidata API
88

99
This is the home of [Delphi](https://delphi.cmu.edu/)'s epidemiological data
10-
API.
10+
API for tracking epidemics such as influenza, dengue, and norovirus. Note that
11+
our work on COVID-19 is available in the [COVIDcast Epidata API documentation](covidcast.md).
1112

1213
## Contributing
1314

docs/api/afhsb.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
22
title: AFHSB
3-
parent: Delphi Epidata API
3+
parent: Epidata API (Other Epidemics)
44
---
55

66
# AFHSB

docs/api/cdc.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
22
title: CDC
3-
parent: Delphi Epidata API
3+
parent: Epidata API (Other Epidemics)
44
---
55

66
# CDC

docs/api/covidcast-signals/_source-template.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: SOURCE NAME
33
parent: Data Sources and Signals
4-
grand_parent: COVIDcast API
4+
grand_parent: COVIDcast Epidata API
55
---
66

77
# SOURCE NAME

docs/api/covidcast-signals/doctor-visits.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,15 +1,15 @@
11
---
22
title: Doctor Visits
33
parent: Data Sources and Signals
4-
grand_parent: COVIDcast API
4+
grand_parent: COVIDcast Epidata API
55
---
66

77
# Doctor Visits
88
{: .no_toc}
99

1010
* **Source name:** `doctor-visits`
11-
* **Number of data revisions since 19 May 2020:** 0
12-
* **Date of last change:** Never
11+
* **Number of data revisions since 19 May 2020:** 1
12+
* **Date of last change:** 9 November 2020
1313
* **Available for:** county, hrr, msa, state (see [geography coding docs](../covidcast_geography.md))
1414

1515
This data source is based on information about outpatient visits, provided to us

docs/api/covidcast-signals/fb-survey.md

Lines changed: 17 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: Symptom Surveys
33
parent: Data Sources and Signals
4-
grand_parent: COVIDcast API
4+
grand_parent: COVIDcast Epidata API
55
---
66

77
# Symptom Surveys
@@ -30,13 +30,6 @@ described in the sections below:
3030
3. [Testing indicators](#testing-indicators) based on respondent reporting of
3131
their COVID test results
3232

33-
For most `raw_` signals below, there are additional signals with names beginning
34-
with `smoothed_`. These estimate the same quantities as the above signals, but
35-
are smoothed in time to reduce day-to-day sampling noise; see [details
36-
below](#smoothing). Crucially, because the smoothed signals combine information
37-
across multiple days, they have larger sample sizes and hence are available for
38-
more counties and MSAs than the raw signals.
39-
4033
## Table of contents
4134
{: .no_toc .text-delta}
4235

@@ -84,14 +77,14 @@ influenza-like illness (fever, along with cough or sore throat). Using this
8477
survey data, we estimate the percentage of people who have a COVID-like illness,
8578
or influenza-like illness, in a given location, on a given day.
8679

87-
| Signal | Description |
80+
| Signals | Description |
8881
| --- | --- |
89-
| `raw_cli` | Estimated percentage of people with COVID-like illness based on the [criteria below](#ili-and-cli-indicators), with no smoothing or survey weighting |
90-
| `raw_ili` | Estimated percentage of people with influenza-like illness based on the [criteria below](#ili-and-cli-indicators), with no smoothing or survey weighting |
91-
| `raw_wcli` | Estimated percentage of people with COVID-like illness; adjusted using survey weights [as described below](#survey-weighting) |
92-
| `raw_wili` | Estimated percentage of people with influenza-like illness; adjusted using survey weights [as described below](#survey-weighting) |
93-
| `raw_hh_cmnty_cli` | Estimated percentage of people reporting illness in their local community, as [described below](#estimating-community-cli), including their household, with no smoothing or survey weighting |
94-
| `raw_nohh_cmnty_cli` | Estimated percentage of people reporting illness in their local community, as [described below](#estimating-community-cli), not including their household, with no smoothing or survey weighting |
82+
| `raw_cli` and `smoothed_cli` | Estimated percentage of people with COVID-like illness based on the [criteria below](#ili-and-cli-indicators), with no survey weighting |
83+
| `raw_ili` and `smoothed_ili` | Estimated percentage of people with influenza-like illness based on the [criteria below](#ili-and-cli-indicators), with no survey weighting |
84+
| `raw_wcli` and `smoothed_wcli` | Estimated percentage of people with COVID-like illness; adjusted using survey weights [as described below](#survey-weighting) |
85+
| `raw_wili` and `smoothed_wili` | Estimated percentage of people with influenza-like illness; adjusted using survey weights [as described below](#survey-weighting) |
86+
| `raw_hh_cmnty_cli` and `smoothed_hh_cmnty_cli` | Estimated percentage of people reporting illness in their local community, as [described below](#estimating-community-cli), including their household, with no survey weighting |
87+
| `raw_nohh_cmnty_cli` and `smoothed_nohh_cmnty_cli` | Estimated percentage of people reporting illness in their local community, as [described below](#estimating-community-cli), not including their household, with no survey weighting |
9588

9689
Note that for `raw_hh_cmnty_cli` and `raw_nohh_cmnty_cli`, the illnesses
9790
included are broader: a respondent is included if they know someone in their
@@ -115,6 +108,12 @@ can be asymptomatic. Instead, we expect these indicators to be useful for
115108
comparison across the United States and across time, to determine where symptoms
116109
appear to be increasing.
117110

111+
**Smoothing.** The signals beginning with `smoothed_` estimate the same quantities as their
112+
`raw` partners, but are smoothed in time to reduce day-to-day sampling noise;
113+
see [details below](#smoothing). Crucially, because the smoothed signals combine
114+
information across multiple days, they have larger sample sizes and hence are
115+
available for more counties and MSAs than the raw signals.
116+
118117
### Defining Household ILI and CLI
119118

120119
For a single survey, we are interested in the quantities:
@@ -268,9 +267,9 @@ problematic.
268267

269268
### Smoothing
270269

271-
The smoothed versions of the signals described above (with `smoothed_` prefix)
272-
are calculated using seven day pooling. For example, the estimate reported for
273-
June 7 in a specific geographical area (such as county or MSA) is formed by
270+
The smoothed versions of all `fb-survey` signals (with `smoothed_` prefix) are
271+
calculated using seven day pooling. For example, the estimate reported for June
272+
7 in a specific geographical area (such as county or MSA) is formed by
274273
collecting all surveys completed between June 1 and 7 (inclusive) and using that
275274
data in the estimation procedures described above.
276275

docs/api/covidcast-signals/ght.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: Google Health Trends
33
parent: Data Sources and Signals
4-
grand_parent: COVIDcast API
4+
grand_parent: COVIDcast Epidata API
55
---
66

77
# Google Health Trends

docs/api/covidcast-signals/google-survey.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: Google Symptom Surveys
33
parent: Inactive Signals
4-
grand_parent: COVIDcast API
4+
grand_parent: COVIDcast Epidata API
55
---
66

77
# Google Symptom Surveys

docs/api/covidcast-signals/hospital-admissions.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,16 @@
11
---
22
title: Hospital Admissions
33
parent: Data Sources and Signals
4-
grand_parent: COVIDcast API
4+
grand_parent: COVIDcast Epidata API
55
---
66

77
# Hospital Admissions
88
{: .no_toc}
99

1010
* **Source name:** `hospital-admissions`
1111
* **First issued:** June 30, 2020
12-
* **Number of data revisions since 19 May 2020:** 0
13-
* **Date of last change:** Never
12+
* **Number of data revisions since 19 May 2020:** 1
13+
* **Date of last change:** 20 October 2020
1414
* **Available for:** county, hrr, msa, state (see [geography coding docs](../covidcast_geography.md))
1515

1616
This data source is based on electronic medical records and claims data about

docs/api/covidcast-signals/indicator-combination.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: Indicator Combination
33
parent: Data Sources and Signals
4-
grand_parent: COVIDcast API
4+
grand_parent: COVIDcast Epidata API
55
---
66

77
# Indicator Combination

docs/api/covidcast-signals/jhu-csse.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: JHU Cases and Deaths
33
parent: Data Sources and Signals
4-
grand_parent: COVIDcast API
4+
grand_parent: COVIDcast Epidata API
55
---
66

77
# JHU Cases and Deaths

docs/api/covidcast-signals/quidel.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: Quidel
33
parent: Data Sources and Signals
4-
grand_parent: COVIDcast API
4+
grand_parent: COVIDcast Epidata API
55
---
66

77
# Quidel
@@ -18,8 +18,8 @@ grand_parent: COVIDcast API
1818
## COVID-19 Tests
1919

2020
* **First issued:** 27 July 2020
21-
* **Number of data revisions since 19 May 2020:** 0
22-
* **Date of last change:** Never
21+
* **Number of data revisions since 19 May 2020:** 1
22+
* **Date of last change:** 22 October 2020
2323
* **Available for:** hrr, msa, state (see [geography coding docs](../covidcast_geography.md))
2424

2525
Data source based on COVID-19 Antigen tests, provided to us by Quidel, Inc. When

docs/api/covidcast-signals/safegraph.md

Lines changed: 12 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: SafeGraph Mobility
33
parent: Data Sources and Signals
4-
grand_parent: COVIDcast API
4+
grand_parent: COVIDcast Epidata API
55
---
66

77
# SafeGraph
@@ -20,8 +20,8 @@ using anonymized location data from mobile phones. SafeGraph provides several di
2020
## SafeGraph Social Distancing Metrics
2121

2222
* **First issued:** 23 June 2020
23-
* **Number of data revisions since 23 June 2020:** 0
24-
* **Date of last change:** never
23+
* **Number of data revisions since 23 June 2020:** 1
24+
* **Date of last change:** 3 November 2020
2525
* **Available for:** county, state (see [geography coding docs](../covidcast_geography.md))
2626

2727
Data source based on [social
@@ -43,6 +43,10 @@ documentation](https://docs.safegraph.com/docs/social-distancing-metrics).
4343
| `full_time_work_prop` | The fraction of mobile devices that spent more than 6 hours at a location other than their home during the daytime (SafeGraph's `full_time_work_behavior_devices / device_count`) |
4444
| `part_time_work_prop` | The fraction of devices that spent between 3 and 6 hours at a location other than their home during the daytime (SafeGraph's `part_time_work_behavior_devices / device_count`) |
4545
| `median_home_dwell_time` | The median time spent at home for all devices at this location for this time period, in minutes |
46+
| `completely_home_prop_7dav` | Offers a 7-day trailing window average of the `completely_home_prop`. |
47+
| `full_time_work_prop_7dav` | Offers a 7-day trailing window average of the`full_time_work_prop`. |
48+
| `part_time_work_prop_7dav` | Offers a 7-day trailing window average of the`part_time_work_prop`.|
49+
| `median_home_dwell_time_7dav` | Offers a 7-day trailing window average of the `median_home_dwell_time`.|
4650

4751
After computing each metric on the census block group (CBG) level, we aggregate
4852
to the county-level by taking the mean over CBGs in a county to obtain the value
@@ -52,6 +56,11 @@ doing so, we make the simplifying assumption that each CBG contributes an iid
5256
observation to the county-level distribution. `n` also serves as the sample
5357
size. The same method is used for aggregation to states.
5458

59+
SafeGraph's signals measure mobility each day, which causes strong day-of-week effects:
60+
weekends have substantially different values than weekdays. Users interested in long-term
61+
trends, rather than mobility on one specific day, may prefer the `7dav` signals since
62+
averaging over the preceding 7 days removes these day-of-week effects.
63+
5564
### Lag
5665

5766
SafeGraph provides this data with a three-day lag, meaning estimates for a

docs/api/covidcast-signals/usa-facts.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,15 +1,15 @@
11
---
22
title: USAFacts Cases and Deaths
33
parent: Data Sources and Signals
4-
grand_parent: COVIDcast API
4+
grand_parent: COVIDcast Epidata API
55
---
66

77
# USAFacts Cases and Deaths
88
{: .no_toc}
99

1010
* **Source name:** `usa-facts`
11-
* **Number of data revisions since 19 May 2020:** 1
12-
* **Date of last change:** [12 October 2020](../covidcast_changelog.md#usa-facts)
11+
* **Number of data revisions since 19 May 2020:** 2
12+
* **Date of last change:** [3 November 2020](../covidcast_changelog.md#usa-facts)
1313
* **Available for:** county, hrr, msa, state (see [geography coding docs](../covidcast_geography.md))
1414

1515
This data source of confirmed COVID-19 cases and deaths is based on reports made

docs/api/covidcast.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,10 @@
11
---
2-
title: COVIDcast API
2+
title: COVIDcast Epidata API
33
has_children: true
44
nav_order: 1
55
---
66

7-
# Delphi's COVIDcast API
7+
# COVIDcast Epidata API
88

99
This is the documentation for accessing the Delphi's COVID-19 Surveillance
1010
Streams (`covidcast`) endpoint of [Delphi](https://delphi.cmu.edu/)'s

0 commit comments

Comments
 (0)