Skip to content

Release 7-day average signals for Safegraph #332

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
7 tasks done
krivard opened this issue Oct 20, 2020 · 26 comments
Closed
7 tasks done

Release 7-day average signals for Safegraph #332

krivard opened this issue Oct 20, 2020 · 26 comments
Assignees
Labels
release Track the finishing work for features ready for release
Milestone

Comments

@krivard
Copy link
Contributor

krivard commented Oct 20, 2020

This work creates 7-day average signals for each of the following:

  • median_home_dwell_time
  • completely_home_prop
  • full_time_work_prop
  • part_time_work_prop
  • Statistical review (usually correlations)
  • Signal / source name review (usually Roni)
@krivard krivard added the release Track the finishing work for features ready for release label Oct 20, 2020
@krivard krivard self-assigned this Oct 20, 2020
@benjaminysmith
Copy link
Contributor

Hi @sgsmob, take a look at #367 for an example of what we need for the initial steps.

@sgsmob
Copy link
Contributor

sgsmob commented Nov 10, 2020

Documentation update PR here: cmu-delphi/delphi-epidata#275

@sgsmob
Copy link
Contributor

sgsmob commented Nov 10, 2020

Draft mailing list notification:

Hi COVIDcast API users,

We are pleased to announce the availability of smoothed versions of existing signals
based on SafeGraph mobility data.

Each of the following signals previously available now has a corresponding additional 7-day trailing average signal, i.e., the reported value for May 10 is the average of values from May 4 - May 10:

  1. Proportion of people at home for the whole day
  2. Proportion of people working full time outside the home
  3. Proportion of people working part time outside the home
  4. Median number of hours spent at home

The signal names and technical details for each of these signals
are provided in our API documentation:
https://cmu-delphi.github.io/delphi-epidata/api/covidcast-signals/safegraph.html

@sgsmob
Copy link
Contributor

sgsmob commented Nov 10, 2020

Correlation plots for the smoothed signals look alright to me, although somewhat concerning that they are becoming less indicative recently.

smoothed_correlations

@sgsmob
Copy link
Contributor

sgsmob commented Nov 10, 2020

Compared to the existing raw signals, the averaged signals look more useful/interpretable:

raw_correlations

@krivard
Copy link
Contributor Author

krivard commented Nov 10, 2020

@sgsmob the additional unsmoothed correlations plot is helpful here, since it shows the same degradation in correlations over the past four weeks -- thanks for including it.

The decline is worrisome, and it's possible we might want to pause or even retire these signals (both raw and 7dav). I've put it on the agenda for Thursday's team leads.

@capnrefsmmat
Copy link
Contributor

If I had to make a wild guess, I'd say the decline in correlation is because cases are now exponentially growing everywhere, and slightly lower mobility is no longer enough to prevent spread. But that's a wild guess.

I think the signals still are useful for research even if they don't correlate with case rates; for example, researchers studying the effects of social distancing orders, compliance with them, the economy, or mental health could all find these signals useful for their work.

@benjaminysmith
Copy link
Contributor

Result of team leads was that we are going to keep the signals around, so this is unblocked.

@benjaminysmith
Copy link
Contributor

@krivard for visual review it looks like the averages match the raw data. Seems reasonable to me. What do you think?

@sgsmob are the signal names in https://docs.google.com/document/d/10hGd4Evce4lJ4VkWaQEKFQxvmw2P4xyYGtIAWF52Sf8/edit#heading=h.vu3vsc11yl09 correct? If so, lets ask for Roni approval.

@sgsmob
Copy link
Contributor

sgsmob commented Nov 13, 2020

Yes, they are correct

@benjaminysmith
Copy link
Contributor

@RoniRos can you please review the names here?

@krivard
Copy link
Contributor Author

krivard commented Nov 13, 2020

For visual review @sgsmob can you drop a map display in thread? Whatever default settings the clients have built in is fine.

@sgsmob
Copy link
Contributor

sgsmob commented Nov 18, 2020

@krivard I have a notebook with slider maps in it but I don't know if you are looking for something different here.

@krivard
Copy link
Contributor Author

krivard commented Nov 18, 2020

That sounds fine?

@sgsmob
Copy link
Contributor

sgsmob commented Nov 18, 2020

What is the best way to share a notebook? Should I commit it to a repo and link it from here? Save the html and upload it? Drive link?

@krivard
Copy link
Contributor Author

krivard commented Nov 18, 2020

Upload the HTML or drive link

@sgsmob
Copy link
Contributor

sgsmob commented Nov 18, 2020

@sgsmob
Copy link
Contributor

sgsmob commented Nov 18, 2020

Something seems wrong with the median computation. I am going to take a look at that some more.

@sgsmob
Copy link
Contributor

sgsmob commented Nov 18, 2020

Something seems wrong with the median computation. I am going to take a look at that some more.

Thought this was in hours, but it is actually in minutes; no problem here.

@RoniRos
Copy link
Member

RoniRos commented Nov 18, 2020

[ ] write Signal description pop-up text review

@krivard, I think this linked to the deprecated version. I think this is the correct link.

@sgsmob
Copy link
Contributor

sgsmob commented Nov 19, 2020

Sample pop-up text:

Name: Away from Home 6hr+ a Day (smoothed)
Id: safegraph
Signal: full_time_work_prop_7dav
Type: public
Levels: [county, state]
YAxis: Proportion
Format: raw
HasStdErr: true

MapTitleText: 7-day trailing average of fraction of people spending 6 hours or more between 8am-6pm, in one location away from their home, based on SafeGraph mobility data

Description: Delphi receives data from SafeGraph, which collects anonymized location data from mobile phones. Using this data, we calculate the fraction of mobile devices that spent more than 6 hours at a location other than their home during the daytime and average it over a 7 day trailing window. This indicator measures how mobile people are, and ought to reflect whether people are traveling to work or school outside their homes. See also our Away from Home 3-6hr indicator.

Links:
Technical description


Name: Away from Home 3-6hr a Day (smoothed)
Id: safegraph
Signal: part_time_work_prop_7dav
Type: public
Levels: [county, state]
YAxis: Proportion
Format: raw
HasStdErr: true

MapTitleText: 7-day trailing average of fraction of people spending 3-6 hours between 8am-6pm, in one location away from their home, based on SafeGraph mobility data

Description: Delphi receives data from SafeGraph, which collects anonymized location data from mobile phones. Using this data, we calculate the fraction of mobile devices that spent between 3 and 6 hours at a location other than their home during the daytime and average it over a 7 day trailing window. This indicator measures how mobile people are. See also our Away from Home 6hr+ indicator.

Credits: We are happy for you to use this data in products and publications under the terms of the Creative Commons Attribution license.

Links:
Technical description


@krivard
Copy link
Contributor Author

krivard commented Nov 19, 2020

I've approved the API docs and transferred the proposed signal description text (with my modifications) into the auto doc. They'll get their final review during the map release candidate comment period; not sure whether to tick the box in this issue or not

@sgsmob
Copy link
Contributor

sgsmob commented Nov 19, 2020

Release notes:

Add 7-day averages of SafeGraph mobility signals

EDIT: Just saw that this might no longer be necessary.

@RoniRos
Copy link
Member

RoniRos commented Nov 19, 2020

@krivard I did not see any comments on, or acceptance/rejection of, my proposed edit from "Away from Home 3-6hr a Day" to "At Away Location 3-6hr", and similarly for 6hr+. Did you mean to reject it? To me, "Away from home X hours" is a mischaracterization, but I can understand if some might prefer to keep a simpler description even if not completely accurate.

Similarly for my suggestion for the Y Axis label to be "Fraction of population" -- I think it's more informative.

@krivard
Copy link
Contributor Author

krivard commented Nov 20, 2020

@RoniRos I think you're looking for the frontend thread about misleading labeling (I approve of your changes; we're just waiting on Alex) -- this thread is instead about adding language for the switch to the smoothed signals.

@krivard
Copy link
Contributor Author

krivard commented Nov 20, 2020

Calling this good

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
release Track the finishing work for features ready for release
Projects
None yet
Development

No branches or pull requests

5 participants