Google symptoms dap #28

jingjtang · 2021-02-18T14:02:05Z

Add GS DAP

Scripts for sensorization/correlation analysis etc. are included in folder ./scripts.
Google_Symptoms_DAP_Final_Report.html is the final report.
More details for correlation analysis in Appendix1_Correlation_Results.html
More details for coefficients and intercepts returned by sensorization in Appendix2_Coefficients_and_Intercepts.html

…ovided

…A+A + GHT

…sons

nmdefries

Unfortunately, a lot of these files are too big for me to leave comments directly, so I'll put them here.

Final Report:

In the introduction, it would be good to link to the previous work establishing anosmia/ageusia as useful for a limited geographic range.

electrical heath records

electronic health records

The violin plot shown above and the barplot shown in Appendix II indicate the difference in the spatial distribution of symptoms in the same symptom set. This leads to the fact that when training linear regression models for different counties, the number of features (symptoms) that are "actually" taken into account is different. If we draw the median of coefficients for symptoms across all the counties available for the symptom set, we simply get 0s for symptoms with low geographical coverage. It should be noted that getting median to zero is not an error message, because the default missingness of this data set is caused by the extremely low search volume.

This paragraph is hard to parse, especially if the reader hasn't gotten to the appendix yet. Essentially you're saying this, right:

"Within a symptom set, there are large differences in geographical availability by symptom. Because of this and the zero-fill procedure when creating a symptom set, some symptoms are implicitly excluded from modeling for a given county. This leads to the coefficient for that symptom to be zero in the model for a given county. Symptoms with high missingness will, as a result, have a median coefficient of zero; this isn't an error."

From an organizational perspective, results in the appendices should be auxiliary -- not required to understand the main report, but providing additional tangential results that someone might be interested in. If you're using results from the appendix in the main report, those results should also be in the main report.

Appendix I is particularly interesting -- it makes an even stronger case for using regression over rawsum based on comparisons for non-sensory symptom sets.

Over all, looks good! Thanks for your hard work!

nmdefries and others added 30 commits September 30, 2020 10:53

Initial comparison of GHT indicator and GS anosmia + ageusia

27302bd

Dig into GS missingness

9a13562

bug fix

eb1362f

Adding tests from symptoms survey blog post

b863b62

Adding tests from symptoms survey blog post

7dae33a

Added additional alternative models to compare usefulness of GS vs GHT

c2147f4

Alternative models run with gurobi or read saved data

3571c40

Comparison of 4 models, with GS A+A and GHT. Supporting dataframes pr…

051a1bc

…ovided

Knit output for only cases vs GS A+A

a8dbfd2

Knit output for cases vs cases + GS A+A vs cases + GHT vs cases + GS …

3cf1eaf

…A+A + GHT

Separated forecasting error by days ahead into pairwise model compari…

ea7434e

…sons

Fixed days-ahead bug

37bacdb

More commentary on days-ahead graphs

77f36d3

add file looking at cases vs cases + symptoms models at the county level

43fc7ee

remove extraneous files

4013181

Add correlation comparison

15da6cc

add predictive power comparison

8e7a794

rm the old combined predictive power comparison script

51c264e

add county level predictive power comparison

77cd4aa

add county level comparison html

c319115

add msa level predictive power comparison

6e6595c

add html version for msa level predictive power comparison

e842b88

remove geo-wise correlation, re-run time-series correlation analysis

aca2ce9

add raw error plot for days ahead

4c8a18a

add exploration for other symptoms

8b94fd1

fixed typos

820a256

fixed an error in the title of figures

5156788

replace states with MSAs

44253b9

add predictive power analysis considering larger geographic scope

1b47981

add scripts for sensorization

5701f8d

Jingjing Tang added 11 commits January 20, 2021 18:07

add script for calculating rank correlations

6c73020

add v0 report

8f828ad

upload appendix for correlation analysis

642ab95

update data source

ce9c391

update scripts

494a2bf

add appendix2

7621982

upload final report

705936b

fixed an error

540ff90

update docs

76e9b01

relocate scripts

f8bb9bb

delete scripts outside the scripts folder

63960e1

jingjtang requested a review from nmdefries February 18, 2021 14:02

nmdefries reviewed Feb 18, 2021

View reviewed changes

Jingjing Tang added 2 commits March 4, 2021 12:33

added explainations and fixed errors

44c9d77

Add details for the issue of as_of date

ef0da2d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Google symptoms dap #28

Google symptoms dap #28

Uh oh!

jingjtang commented Feb 18, 2021 •

edited

Loading

Uh oh!

nmdefries left a comment •

edited

Loading

Uh oh!

Uh oh!

Google symptoms dap #28

Are you sure you want to change the base?

Google symptoms dap #28

Uh oh!

Conversation

jingjtang commented Feb 18, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nmdefries left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Final Report:

Uh oh!

Uh oh!

jingjtang commented Feb 18, 2021 •

edited

Loading

nmdefries left a comment •

edited

Loading