inplace clipping does not account for mixed precision #21476

crepererum · 2018-06-14T11:55:07Z

Code Sample, a copy-pastable example if possible

This is just one case that can go wrong, but other edge cases are possible (see listing below)

import numpy as np
import pandas as pd

series = pd.Series([0], dtype=np.float32)
upper = pd.Series([-np.finfo(np.float64).tiny], dtype=np.float64)
series.clip(upper, inplace=True)
assert (series <= upper).all()

Problem description

Series.clip, Series.clip_upper and Series.clip_lower may return wrong results if the bound arguments have a higher precision than the series (e.g. float64 VS float32, but others are possible as well) and inplace=True was given. This is due to the fact that pandas cannot change the dtype of the clipped series in that case and seems to do run some additional conversation. The following edge cases are possible:

the upper bound gets larger when converted to the lower precision (this is the example shown above)
the lower bound gets larger when converted to the lower precision
the upper bound is negative and is that "large" that it cannot be represented by the lower precision float
the lower bound is positive and is that "large" that it cannot be represented by the lower precision float
the range between lower and upper is that tiny that there is no lower precision float possible

Expected Output

For 1. and 2.: find the closest lower-precision float that satisfies the bound check

For 3. and 4.: return -/+inf

For 5.: return NaN

Output of `pd.show_versions()`

``` INSTALLED VERSIONS ------------------ commit: None python: 3.6.4.final.0 python-bits: 64 OS: Linux OS-release: 4.9.49-moby machine: x86_64 processor: byteorder: little LC_ALL: en_US.UTF-8 LANG: en_US.UTF-8 LOCALE: en_US.UTF-8

pandas: 0.23.1
pytest: 3.4.0
pip: 10.0.1
setuptools: 39.1.0
Cython: 0.28.2
numpy: 1.14.3
scipy: 1.0.0
pyarrow: 0.9.0
xarray: None
IPython: 6.1.0
sphinx: 1.6.7
patsy: 0.5.0
dateutil: 2.7.3
pytz: 2018.4
blosc: None
bottleneck: 1.2.1
tables: None
numexpr: 2.6.5
feather: None
matplotlib: 2.2.2
openpyxl: None
xlrd: None
xlwt: None
xlsxwriter: None
lxml: None
bs4: None
html5lib: 1.0.1
sqlalchemy: 1.2.5
pymysql: None
psycopg2: 2.7.4 (dt dec pq3 ext lo64)
jinja2: 2.8.1
s3fs: None
fastparquet: None
pandas_gbq: None
pandas_datareader: None

</details>

The text was updated successfully, but these errors were encountered:

gfyoung · 2018-06-15T00:56:49Z

@crepererum : Could you post the expected output for series.clip_upper(upper, inplace=True) ?

cc @jreback

crepererum · 2018-06-15T09:46:31Z

"Fun" fact: the same issue is present in numpy:

import numpy as np

array = np.array([0], dtype=np.float32)
out = array.copy()
upper = np.array([-np.finfo(np.float64).tiny], dtype=np.float64)
np.clip(array, None, upper, out)

assert (out <= upper).all()

The following seems to work though, and is also the answer to the expected output:

import numpy as np

def clip_upper_inplace(array, upper):
    np.clip(array, None, upper, array)
    wrong = (array > upper)
    array[wrong] = np.nextafter(array[wrong], -np.inf)

array = np.array([0], dtype=np.float32)
upper = np.array([-np.finfo(np.float64).tiny], dtype=np.float64)
clip_upper_inplace(array, upper)

assert (array <= upper).all()

(array is [-1.e-45] at this point)

gfyoung added Numeric Operations Arithmetic, Comparison, and Logical operations Algos Non-arithmetic algos: value_counts, factorize, sorting, isin, clip, shift, diff Bug and removed Bug labels Jun 15, 2018

crepererum mentioned this issue Jun 20, 2018

inplace clipping does not account for mixed precision numpy/numpy#11387

Open

jreback mentioned this issue Dec 28, 2018

BUG: clip doesn't preserve dtype by column #24458

Merged

4 tasks

mroeschke added the Bug label May 13, 2020

mroeschke removed Numeric Operations Arithmetic, Comparison, and Logical operations Algos Non-arithmetic algos: value_counts, factorize, sorting, isin, clip, shift, diff labels Jun 20, 2021

jbrockmendel added the inplace Relating to inplace parameter or equivalent label Oct 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inplace clipping does not account for mixed precision #21476

inplace clipping does not account for mixed precision #21476

crepererum commented Jun 14, 2018 •

edited by mroeschke

Loading

gfyoung commented Jun 15, 2018

crepererum commented Jun 15, 2018

inplace clipping does not account for mixed precision #21476

inplace clipping does not account for mixed precision #21476

Comments

crepererum commented Jun 14, 2018 • edited by mroeschke Loading

Code Sample, a copy-pastable example if possible

Problem description

Expected Output

Output of pd.show_versions()

gfyoung commented Jun 15, 2018

crepererum commented Jun 15, 2018

crepererum commented Jun 14, 2018 •

edited by mroeschke

Loading

Output of `pd.show_versions()`