Bug: TypeError: ufunc 'sqrt' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe'' #32904

SultanOrazbayev · 2020-03-22T14:07:53Z

Code Sample, a copy-pastable example if possible

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

df = pd.DataFrame(np.random.random((10,3))*100, columns=['a', 'b', 'c'])

plt.scatter(x='a', y='b', s='c', data=df, alpha=0.3)

df.plot.scatter(x='a', y='b', s='c')

Problem description

Calling plot.scatter on a pandas dataframe with option for size results in a TypeError. The same data can be plotted directly with matplotlib.

Output of `pd.show_versions()`


INSTALLED VERSIONS
------------------
commit           : None
python           : 3.7.6.final.0
python-bits      : 64
OS               : Darwin
OS-release       : 19.3.0
machine          : x86_64
processor        : i386
byteorder        : little
LC_ALL           : None
LANG             : en_GB.UTF-8
LOCALE           : en_GB.UTF-8

pandas           : 1.0.3
numpy            : 1.18.1
pytz             : 2019.3
dateutil         : 2.8.1
pip              : 19.3.1
setuptools       : 46.0.0.post20200311
Cython           : None
pytest           : None
hypothesis       : None
sphinx           : None
blosc            : None
feather          : None
xlsxwriter       : None
lxml.etree       : 4.5.0
html5lib         : None
pymysql          : None
psycopg2         : None
jinja2           : 2.11.1
IPython          : 7.13.0
pandas_datareader: None
bs4              : None
bottleneck       : None
fastparquet      : 0.3.3
gcsfs            : None
lxml.etree       : 4.5.0
matplotlib       : 3.2.0
numexpr          : None
odfpy            : None
openpyxl         : 3.0.3
pandas_gbq       : None
pyarrow          : 0.16.0
pytables         : None
pytest           : None
pyxlsb           : None
s3fs             : None
scipy            : 1.4.1
sqlalchemy       : 1.3.15
tables           : None
tabulate         : None
xarray           : None
xlrd             : 1.2.0
xlwt             : None
xlsxwriter       : None
numba            : 0.48.0

The text was updated successfully, but these errors were encountered:

TomAugspurger · 2020-03-22T16:29:40Z

Can you post the full traceback?

SultanOrazbayev · 2020-03-22T16:32:11Z

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-1-2a07d407e318> in <module>
      7 plt.scatter(x='a', y='b', s='c', data=df, alpha=0.3)
      8 
----> 9 df.plot.scatter(x='a', y='b', s='c')

~/miniconda/envs/mydef/lib/python3.7/site-packages/pandas/plotting/_core.py in scatter(self, x, y, s, c, **kwargs)
   1497             ...                       colormap='viridis')
   1498         """
-> 1499         return self(kind="scatter", x=x, y=y, s=s, c=c, **kwargs)
   1500 
   1501     def hexbin(self, x, y, C=None, reduce_C_function=None, gridsize=None, **kwargs):

~/miniconda/envs/mydef/lib/python3.7/site-packages/pandas/plotting/_core.py in __call__(self, *args, **kwargs)
    790         if kind in self._dataframe_kinds:
    791             if isinstance(data, ABCDataFrame):
--> 792                 return plot_backend.plot(data, x=x, y=y, kind=kind, **kwargs)
    793             else:
    794                 raise ValueError(f"plot kind {kind} can only be used for data frames")

~/miniconda/envs/mydef/lib/python3.7/site-packages/pandas/plotting/_matplotlib/__init__.py in plot(data, kind, **kwargs)
     59             kwargs["ax"] = getattr(ax, "left_ax", ax)
     60     plot_obj = PLOT_CLASSES[kind](data, **kwargs)
---> 61     plot_obj.generate()
     62     plot_obj.draw()
     63     return plot_obj.result

~/miniconda/envs/mydef/lib/python3.7/site-packages/pandas/plotting/_matplotlib/core.py in generate(self)
    261         self._compute_plot_data()
    262         self._setup_subplots()
--> 263         self._make_plot()
    264         self._add_table()
    265         self._make_legend()

~/miniconda/envs/mydef/lib/python3.7/site-packages/pandas/plotting/_matplotlib/core.py in _make_plot(self)
    973             label=label,
    974             cmap=cmap,
--> 975             **self.kwds,
    976         )
    977         if cb:

~/miniconda/envs/mydef/lib/python3.7/site-packages/matplotlib/__init__.py in inner(ax, data, *args, **kwargs)
   1541     def inner(ax, *args, data=None, **kwargs):
   1542         if data is None:
-> 1543             return func(ax, *map(sanitize_sequence, args), **kwargs)
   1544 
   1545         bound = new_sig.bind(ax, *args, **kwargs)

~/miniconda/envs/mydef/lib/python3.7/site-packages/matplotlib/cbook/deprecation.py in wrapper(*args, **kwargs)
    356                 f"%(removal)s.  If any parameter follows {name!r}, they "
    357                 f"should be pass as keyword, not positionally.")
--> 358         return func(*args, **kwargs)
    359 
    360     return wrapper

~/miniconda/envs/mydef/lib/python3.7/site-packages/matplotlib/axes/_axes.py in scatter(self, x, y, s, c, marker, cmap, norm, vmin, vmax, alpha, linewidths, verts, edgecolors, plotnonfinite, **kwargs)
   4427                 offsets=offsets,
   4428                 transOffset=kwargs.pop('transform', self.transData),
-> 4429                 alpha=alpha
   4430                 )
   4431         collection.set_transform(mtransforms.IdentityTransform())

~/miniconda/envs/mydef/lib/python3.7/site-packages/matplotlib/collections.py in __init__(self, paths, sizes, **kwargs)
    912         Collection.__init__(self, **kwargs)
    913         self.set_paths(paths)
--> 914         self.set_sizes(sizes)
    915         self.stale = True
    916 

~/miniconda/envs/mydef/lib/python3.7/site-packages/matplotlib/collections.py in set_sizes(self, sizes, dpi)
    884             self._sizes = np.asarray(sizes)
    885             self._transforms = np.zeros((len(self._sizes), 3, 3))
--> 886             scale = np.sqrt(self._sizes) * dpi / 72.0 * self._factor
    887             self._transforms[:, 0, 0] = scale
    888             self._transforms[:, 1, 1] = scale

TypeError: ufunc 'sqrt' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe''

TomAugspurger · 2020-03-23T16:11:24Z

We're eventually passing the string s='c' to matplotlib, rather than extracting the column. In

pandas/pandas/plotting/_matplotlib/core.py

Lines 933 to 936 in d3e5485

    
           if s is None: 
        
               # hide the matplotlib default for size, in case we want to change 
        
               # the handling of this argument later 
        
               s = 20

I think we need an else clause checking for the user providing a column as a string.

As a workaround, you can pass the column directly with s=df['c'].

SultanOrazbayev · 2020-03-23T16:28:14Z

Thank you for the pointer! Submitted a PR here: #32937

TomAugspurger · 2020-03-23T18:18:26Z

We'll leave this open until it's fixed.

TomAugspurger added good first issue Visualization plotting labels Mar 23, 2020

TomAugspurger added this to the Contributions Welcome milestone Mar 23, 2020

SultanOrazbayev mentioned this issue Mar 23, 2020

Fixing scatter plot size (#32904) #32937

Closed

5 tasks

SultanOrazbayev closed this as completed Mar 23, 2020

TomAugspurger reopened this Mar 23, 2020

This was referenced Mar 26, 2020

fix #32904 #33033

Closed

Bug: TypeError: ufunc 'sqrt' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe'' #33034

Closed

SultanOrazbayev mentioned this issue Mar 28, 2020

ENH/VIZ: Allowing s parameter of scatter plots to be a column name #33107

Merged

6 tasks

TomAugspurger closed this as completed in #33107 Mar 31, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: TypeError: ufunc 'sqrt' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe'' #32904

Bug: TypeError: ufunc 'sqrt' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe'' #32904

SultanOrazbayev commented Mar 22, 2020 •

edited

Loading

TomAugspurger commented Mar 22, 2020

SultanOrazbayev commented Mar 22, 2020

TomAugspurger commented Mar 23, 2020

SultanOrazbayev commented Mar 23, 2020 •

edited

Loading

TomAugspurger commented Mar 23, 2020

Bug: TypeError: ufunc 'sqrt' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe'' #32904

Bug: TypeError: ufunc 'sqrt' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe'' #32904

Comments

SultanOrazbayev commented Mar 22, 2020 • edited Loading

Code Sample, a copy-pastable example if possible

Problem description

Output of pd.show_versions()

TomAugspurger commented Mar 22, 2020

SultanOrazbayev commented Mar 22, 2020

TomAugspurger commented Mar 23, 2020

SultanOrazbayev commented Mar 23, 2020 • edited Loading

TomAugspurger commented Mar 23, 2020

SultanOrazbayev commented Mar 22, 2020 •

edited

Loading

Output of `pd.show_versions()`

SultanOrazbayev commented Mar 23, 2020 •

edited

Loading