@@ -14,8 +14,10 @@ whose estimate is being reported. Estimates are available for several possible
14
14
* ` county ` : County-level estimates are reported by the county's five-digit [ FIPS
15
15
code] ( https://en.wikipedia.org/wiki/FIPS_county_code ) . All FIPS codes are
16
16
reported using pre-2015 FIPS code assignments, * except* for FIPS codes used by
17
- the ` jhu-csse ` source. These are reported exactly as JHU reports their data;
18
- [ see below] ( #fips-exceptions-in-jhu-data ) .
17
+ the ` jhu-csse ` and ` usa-facts ` sources. These are reported exactly as the
18
+ sources report their data; [ see below] ( #coding-exceptions ) . FIPS codes ending
19
+ in ` 000 ` are not valid counties, and instead represent "megacounties" we
20
+ construct; [ see below] ( #small-sample-sizes-and-megacounties ) .
19
21
* ` hrr ` : Hospital Referral Region, units designed to represent regional health
20
22
care markets. There are roughly 300 HRRs in the United States. A map is
21
23
available
@@ -29,10 +31,11 @@ whose estimate is being reported. Estimates are available for several possible
29
31
media markets, as [ defined by
30
32
Nielsen] ( https://www.nielsen.com/us/en/intl-campaigns/dma-maps/ ) .
31
33
* ` state ` : The 50 states, identified by their two-digit postal abbreviation (in
32
- lower case). Estimates for Puerto Rico are available as state ` pr ` ; Washington, D.C. is available as state ` dc ` .
34
+ lower case). Estimates for Puerto Rico are available as state ` pr ` ;
35
+ Washington, D.C. is available as state ` dc ` .
33
36
34
37
Some signals are not available for all ` geo_type ` s, since they may be reported
35
- from their original sources with different levels of aggregation.
38
+ by their original sources with different levels of aggregation.
36
39
37
40
## Table of contents
38
41
{: .no_toc .text-delta}
@@ -50,18 +53,27 @@ identification of respondents, violating privacy and confidentiality agreements.
50
53
Additional considerations for specific signals are discussed in the [ source and
51
54
signal documentation] ( covidcast_signals.md ) .
52
55
53
- In each state, we collect together the data from all counties with insufficient
54
- data to be individually reported. These counties are combined into a single
55
- "megacounty". For example, if only five counties in a state have sufficient data
56
- to be reported, the remaining counties will form one megacounty representing the
57
- rest of that state. As sample sizes vary from day to day, the counties composing
58
- the megacounty can vary daily; the geographic area covered by the megacounty is
59
- simply the state minus the counties reported for that day.
56
+ On each day, in each state, we collect together the data from all counties with
57
+ insufficient data to be individually reported. These counties are combined into
58
+ a single "megacounty". For example, if only five counties in a state have
59
+ sufficient data to be reported, the remaining counties will form one megacounty
60
+ representing the rest of that state. Megacounty estimates are reported with a
61
+ FIPS code ending with ` 000 ` , which is never a FIPS code for a real county. For
62
+ example, megacounty estimates for the state of New York are reported with FIPS
63
+ code ` 36000 ` , since ` 36 ` is the FIPS code prefix for New York.
60
64
61
- Megacounty estimates are reported with a FIPS code ending with 000, which is
62
- never a FIPS code for a real county. For example, megacounty estimates for the
63
- state of New York are reported with FIPS code 36000, since 36 is the FIPS code
64
- prefix for New York.
65
+ These megacounty estimates are used on our COVIDcast map and in the county maps
66
+ produced by our [ API clients] ( covidcast_clients.md ) , to color in the background
67
+ of states and graphically represent the "rest of" states whose counties are not
68
+ all individually reported.
69
+
70
+ ** Warning:** As sample sizes vary from day to day, the counties composing the
71
+ megacounty can vary daily; the geographic area covered by the megacounty is
72
+ simply the state minus the counties reported for that day. The megacounty
73
+ construction also depends on the specific source and signal, so on one day,
74
+ megacounty ` 36000 ` can cover a different geographic area for the ` doctor-visits `
75
+ source than it does for the ` fb-survey ` source. Do not try to compare megacounty
76
+ estimates across time or between signals.
65
77
66
78
## Coding Exceptions
67
79
0 commit comments