Fix smoother imputing polynomial order #490

dshemetov · 2020-11-10T23:09:55Z

Description

Update the smoother's imputation function to perform better. Depends on #478 .

Changelog

Changes the smoother imputation function to have its own polynomial order than the smoother's
The default value for the imputer is also set to 2.

Fixes

Arguably the imputer should interpolate values instead of smoothing them. Hence we avoid situation like this:

>>> signal = np.array([i if i % 3 else np.nan for i in range(1, 40)])
>>> Smoother().impute(signal, impute_order=0)
array([ 1.        ,  2.        ,  1.50520814,  4.        ,  5.        ,
        2.77986492,  7.        ,  8.        ,  4.19921816, 10.        ,
       11.        ,  5.82790041, 13.        , 14.        ,  7.70752267,
       16.        , 17.        ,  9.84727107, 19.        , 20.        ,
       12.22315472, 22.        , 23.        , 14.78905031, 25.        ,
       26.        , 17.4936213 , 28.        , 29.        , 20.29881603,
       31.        , 32.        , 23.17118576, 34.        , 35.        ,
       26.08145499, 37.        , 38.        , 29.01928305])
>>> Smoother().impute(signal, impute_order=2)
array([ 1.,  2.,  3.,  4.,  5.,  6.,  7.,  8.,  9., 10., 11., 12., 13.,
       14., 15., 16., 17., 18., 19., 20., 21., 22., 23., 24., 25., 26.,
       27., 28., 29., 30., 31., 32., 33., 34., 35., 36., 37., 38., 39.])

Without this change, using poly_fit_degree=0 would use this setting in the imputer and introduce the poor performance above.

chinandrew · 2020-11-10T23:16:49Z

_delphi_utils_python/delphi_utils/smooth.py

-                    )
-                # Otherwise, use savgol fitting on the largest window prior
+                # Otherwise, use savgol fitting on the largest window prior,
+                # reduce the polynomial degree if needed


add a note why the reduction of the polynomial degree is needed.

chinandrew

just one comment otherwise lgtm

* entire array of nans is handled * left-padded nans are now ignored * a few other edge cases * add tests to match

* restore the index after smoothing * test to match

* separate out the smoother's polynomial fit degree from the imputer's * default the imputer's fit degree to 2 * add tests

chinandrew

chinandrew · 2020-11-11T00:37:46Z

This depends on #476 -8 right? maybe should add a section for that on the pr template....

dshemetov · 2020-11-11T00:38:23Z

Oops, meant to do that

dshemetov requested a review from chinandrew November 10, 2020 23:10

chinandrew reviewed Nov 10, 2020

View reviewed changes

dshemetov added 3 commits November 10, 2020 15:51

Updated the smoother to remove a non-invertible design matrix bug

a5f81ba

Update smoother to gracefully handle nans:

4ad4c39

* entire array of nans is handled * left-padded nans are now ignored * a few other edge cases * add tests to match

Fixes the bug where the smoother drops the pandas series index:

ad01388

* restore the index after smoothing * test to match

dshemetov force-pushed the fix_smoother_imputing_polyorder branch from a5c18a1 to 45ab905 Compare November 11, 2020 00:20

Update smoother imputer:

fa0b6e2

* separate out the smoother's polynomial fit degree from the imputer's * default the imputer's fit degree to 2 * add tests

dshemetov force-pushed the fix_smoother_imputing_polyorder branch from 45ab905 to fa0b6e2 Compare November 11, 2020 00:23

chinandrew approved these changes Nov 11, 2020

View reviewed changes

krivard merged commit 95ece40 into main Nov 11, 2020

krivard deleted the fix_smoother_imputing_polyorder branch November 11, 2020 14:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix smoother imputing polynomial order #490

Fix smoother imputing polynomial order #490

Uh oh!

dshemetov commented Nov 10, 2020 •

edited

Loading

Uh oh!

chinandrew Nov 10, 2020

Uh oh!

dshemetov Nov 11, 2020

Uh oh!

chinandrew left a comment

Uh oh!

chinandrew left a comment

Uh oh!

chinandrew commented Nov 11, 2020

Uh oh!

dshemetov commented Nov 11, 2020

Uh oh!

Uh oh!

Fix smoother imputing polynomial order #490

Fix smoother imputing polynomial order #490

Uh oh!

Conversation

dshemetov commented Nov 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changelog

Fixes

Uh oh!

chinandrew Nov 10, 2020

Choose a reason for hiding this comment

Uh oh!

dshemetov Nov 11, 2020

Choose a reason for hiding this comment

Uh oh!

chinandrew left a comment

Choose a reason for hiding this comment

Uh oh!

chinandrew left a comment

Choose a reason for hiding this comment

Uh oh!

chinandrew commented Nov 11, 2020

Uh oh!

dshemetov commented Nov 11, 2020

Uh oh!

Uh oh!

dshemetov commented Nov 10, 2020 •

edited

Loading