inverse_transform() in BaseNEncoder does not raise an exception #121

janmotl · 2018-09-06T16:54:15Z

inverse_transform() in BaseNEncoder does not raise an exception when the test set contains a new value:

        X = create_dataset(n_rows=100, has_none=False)
        X_t = create_dataset(n_rows=50, has_none=False)
        X_t_extra = create_dataset(n_rows=50, extras=True, has_none=False)
        cols = ['underscore', 'none', 'extra', 321]

         enc = category_encoders.BaseNEncoder(verbose=1, cols=cols)
         enc.fit(X, y)
         with self.assertRaises(ValueError):
                _ = enc.inverse_transform(enc.transform(X_t_extra))

janmotl · 2018-09-11T16:08:05Z

It looks like an error could be in the remaining encoders. OneHot, Binary and Ordinal use following code:

for col in self.cols:
    if any(X[col] == 0):
        raise ValueError("inverse_transform is not supported because transform impute "
                                     "the unknown category -1 when encode %s"%(col,))

While BaseN uses:

for col in self.cols:
    if any(X[col] == -1):
        raise ValueError("inverse_transform is not supported because transform impute "
                                     "the unknown category -1 when encode %s"%(col,))

Note the difference between X[col] == 0 and X[col] == -1.

My hypothesis is that there should be -1 everywhere. 0 is merely a historical artifact from the time when 0 used to be used to mark unknown categories. Am I right?

wdm0006 · 2018-09-14T18:20:50Z

Ah yes, should be -1 everywhere, 0 is legacy and causes issues. Should switch it to -1.

janmotl · 2018-09-16T14:45:17Z

Resolved with commit 037c7cd

janmotl mentioned this issue Sep 6, 2018

Data Driven Testing #117

Closed

janmotl added a commit that referenced this issue Sep 15, 2018

issue #121

037c7cd

janmotl closed this as completed Sep 16, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

inverse_transform() in BaseNEncoder does not raise an exception #121

inverse_transform() in BaseNEncoder does not raise an exception #121

janmotl commented Sep 6, 2018

janmotl commented Sep 11, 2018

Uh oh!

wdm0006 commented Sep 14, 2018

Uh oh!

janmotl commented Sep 16, 2018

Uh oh!

inverse_transform() in BaseNEncoder does not raise an exception #121

inverse_transform() in BaseNEncoder does not raise an exception #121

Comments

janmotl commented Sep 6, 2018

janmotl commented Sep 11, 2018

Uh oh!

wdm0006 commented Sep 14, 2018

Uh oh!

janmotl commented Sep 16, 2018

Uh oh!