Skip to content

Commit 3ab1871

Browse files
committed
Small update to README
1 parent 8c9a143 commit 3ab1871

File tree

1 file changed

+10
-7
lines changed
  • _delphi_utils_python/data_proc/geomap

1 file changed

+10
-7
lines changed

_delphi_utils_python/data_proc/geomap/README.md

Lines changed: 10 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -24,13 +24,7 @@ We support the following geocodes.
2424
- We are reserving 10001-10099 for states codes of the form 100XX where XX is the FIPS code for the state (the current smallest CBSA is 10100). In the case that the CBSA codes change then it should be verified that these are not used.
2525
- State codes are a series of equivalent identifiers for US state. They include the state name, the state number (state_id), and the state two-letter abbreviation (state_code). The state number is the state FIPS code. See [here](https://en.wikipedia.org/wiki/List_of_U.S._state_and_territory_abbreviations) for more.
2626
- The Hospital Referral Region (HRR) and the Hospital Service Area (HSA). More information [here](https://www.dartmouthatlas.org/covid-19/hrr-mapping/).
27-
- The JHU signal contains its own geographic identifier, labeled the UID. Documentation is provided at [their repo](https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data#uid-lookup-table-logic). Its FIPS codes depart in some special cases, so add some hand additions.
28-
- Dukes and Nantucket counties in Massachusets are aggregated, so we split them with population-proportional weights (approximately 2/3 Dukes and 1/3 Nantucket).
29-
- The same procedure is followed by Kansas City and four of its counties.
30-
- Kusilvak, Alaska is mapped to the FIPS code 02270.
31-
- Ogalala Lakota is mapped to the FIPS code 46113.
32-
- Utah reports at a territory level, so we only report it at in a state level megaFIPS 49000.
33-
- JHU places cases and deaths that cannot be localized to a single county into "Out of State" and "Unassigned" categories. We map these to the "megaFIPS" code XX000, where XX is the state FIPS code. This way, the data is recovered when aggregating up to the state level, but does not interfere with other counties.
27+
- The JHU signal contains its own geographic identifier, labeled the UID. Documentation is provided at [their repo](https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data#uid-lookup-table-logic). Its FIPS codes depart in some special cases, so we produce manual changes listed below.
3428

3529
## Source files
3630

@@ -46,6 +40,15 @@ The source files are requested from a government URL when `geo_data_proc.py` is
4640

4741
The rest of the crosswalk tables are derived from the mappings above. We provide crosswalk functions from granular to coarser codes, but not the other way around. This is because there is no information gained when crosswalking from coarse to granular.
4842

43+
## JHU UID mapping changes
44+
45+
- Dukes and Nantucket counties in Massachusets are aggregated, so we split them with population-proportional weights (approximately 2/3 Dukes and 1/3 Nantucket).
46+
- The same procedure is followed by Kansas City and four of its counties.
47+
- Kusilvak, Alaska is mapped to the FIPS code 02270.
48+
- Ogalala Lakota, South Dakota is mapped to the FIPS code 46113.
49+
- Utah reports at a territory level, so we only report it at in a state level megaFIPS 49000.
50+
- JHU places cases and deaths that cannot be localized to a single county into "Out of State" and "Unassigned" categories. We map these to the "megaFIPS" code XX000, where XX is the state FIPS code. This way, the data is recovered when aggregating up to the state level, but does not interfere with other counties.
51+
4952
## Deprecated source files
5053

5154
- ZIP to FIPS to HRR to states: `02_20_uszips.csv` comes from a version of the table [here](https://simplemaps.com/data/us-zips) modified by Jingjing to include population weights.

0 commit comments

Comments
 (0)