You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -155,14 +155,17 @@ ggplot(fc_cafl, aes(x = fc_target_date, group = time_value, fill = engine_type))
155
155
```
156
156
157
157
For the two states of interest, simple linear regression clearly performs better
158
-
than random forest in terms of accuracy of the predictions and not does not
159
-
result in such in overconfident predictions (too narrow confidence bands).
160
-
Though, in general, both approaches do not perform great. This could be because
158
+
than random forest in terms of accuracy of the predictions and does not
159
+
result in such in overconfident predictions (overly narrow confidence bands).
160
+
Though, in general, neither approach produces amazingly accurate forecasts.
161
+
This could be because
161
162
the behaviour is rather different across states and the effects of other notable
162
163
factors such as age and public health measures may be important to account for
163
164
in such forecasting. Including such factors as well as making enhancements such
164
-
as correcting for outliers are some improvements one could make to our simple
165
-
model in the future.
165
+
as correcting for outliers are some improvements one could make to this simple
166
+
model.[^1]
167
+
168
+
[^1]: Note that, despite the above caveats, simple models like this tend to out-perform many far more complicated models in the online Covid forecasting due to those models high variance predictions.
166
169
167
170
### Example using case data from Canada
168
171
@@ -175,7 +178,7 @@ daily time series data on COVID-19 cases, deaths, recoveries, testing and
175
178
vaccinations at the health region and province levels. Data are collected from
176
179
publicly available sources such as government datasets and news releases.
177
180
Unfortunately, there is no simple versioned source, so we have created our own
178
-
from the Commit history.
181
+
from the Github commit history.
179
182
180
183
First, we load versioned case rates at the provincial level. After converting
181
184
these to 7-day averages (due to highly variable provincial reporting
0 commit comments