Plotting Int64 columns with nulled integers (NAType) fails #32073 #32387

jeandersonbc · 2020-03-01T18:08:35Z

closes Plotting Int64 columns with nulled integers (NAType) fails #32073
tests added / passed
passes black pandas
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

MarcoGorelli · 2020-03-02T11:10:53Z

However, before this PR can be accepted, you need to write tests / make sure you don't break existing ones - see contributing to the code base:

pandas is serious about testing and strongly encourages contributors to embrace test-driven development (TDD). This development process “relies on the repetition of a very short development cycle: first the developer writes an (initially failing) automated test case that defines a desired improvement or new function, then produces the minimum amount of code to pass that test.” So, before actually writing any code, you should write your tests. Often the test can be taken from the original GitHub issue. However, it is always worth considering additional use cases and writing corresponding tests.

Adding tests is one of the most common requests after code is pushed to pandas. Therefore, it is worth getting in the habit of writing tests ahead of time so this is never an issue.

jeandersonbc · 2020-03-02T12:42:55Z

Thanks for the feedback @MarcoGorelli! I'm wondering whether the example provided in the related issue (first comment in #32073) would be sufficient to be added as a test in the TestDataFramePlots (test_frame.py) class. I started the discussion on tests on the issue yesterday (although here might be more appropriated).

TomAugspurger

Thanks for working on this. We'll need a test and a release note in 1.1.0.rst.

TomAugspurger · 2020-03-02T20:38:07Z

pandas/plotting/_matplotlib/core.py

+            if values.isna().any().all():
+                values = values.astype(float)
+


It seems like date-like data can be included here, and we don't want to convert those to floats.

I think this should be restricted to

if is_integer_dtype(values.dtype): values = values.to_numpy(dtype="float", na_value=np.nan)

…ories.dtype` (#32115)

…upBy.shift (#32356)

According to https://docs.python.org/3/library/pickle.html#object.__reduce__, > If a string is returned, the string should be interpreted as the name > of a global variable. It should be the object’s local name relative to > its module; the pickle module searches the module namespace to determine > the object’s module. This behaviour is typically useful for singletons. Closes #31847

Co-authored-by: Simon Hawkins <[email protected]>

…31667)

…#31946)

…#32042)

jeandersonbc · 2020-03-03T10:48:21Z

well, I'm going to close this pull request and work in a single branch as this was not the first time that I messed up after updating my branch

jeandersonbc added 3 commits March 1, 2020 19:01

BUG: fixes unhandled NAType when plotting (#32073)

05ab972

BUG: fixes unhandled NAType when plotting (#32073)

c7756b1

Fixed bad formatting

0e738b0

TomAugspurger reviewed Mar 2, 2020

View reviewed changes

jbrockmendel and others added 24 commits March 3, 2020 11:44

REF: collect+parametrize reorder_levels tests (#32373)

26d3297

TST: Allow definition of pd.CategoricalDtype with a specific `categ…

0ff6b96

…ories.dtype` (#32115)

TYP: annotations for internals, set_axis (#32376)

64f76e9

misplaced DataFrame.join test (#32375)

70b840b

DOC: Fixed ES01, PR07, SA04 error in pandas.core.groupby.DataFrameGro…

8262397

…upBy.shift (#32356)

DOC: Fix SA04 errors in docstrings #28792 (#32182)

ba7895e

CLN: remove _igetitem_cache (#32319)

f2f8605

Avoid unnecessary values_from_object (#32398)

6a48be6

ENH: infer freq in timedelta_range (#32377)

bc4c189

BUG: 2D DTA/TDA arithmetic with object-dtype (#32185)

6023860

TST: broken off from #32187 (#32258)

66422c4

Co-authored-by: Simon Hawkins <[email protected]>

REF: simplify PeriodIndex._shallow_copy (#32280)

7210810

CLN: setitem_with_indexer cleanups (#32341)

e262a71

BUG: None / Timedelta incorrectly returning NaT (#32340)

143faa0

TST: Using more fixtures in of tests/base/test_ops.py (#32313)

d2413f9

CLN: remove unused values from interpolate call (#32400)

0a7ebfd

CLN: some code cleanups to pandas/_libs/missing.pyx (#32367)

3f9b4e8

BUG: fixes bug when using sep=None and comment keyword for read_csv (#…

0d41a23

…31667)

Don't create _join_functions (#32336)

a57de43

API: replace() should raise an exception if invalid argument is given (…

5759ad9

…#31946)

BUG: Fix __ne__ comparison for Categorical (#32304)

1c5b03f

CLN: clean-up show_versions and consistently use null for json output (…

b03a910

…#32042)

Add missing newline (#32404)

1973ddb

Added simple test case

9f71755

jeandersonbc closed this Mar 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Plotting Int64 columns with nulled integers (NAType) fails #32073 #32387

Plotting Int64 columns with nulled integers (NAType) fails #32073 #32387

jeandersonbc commented Mar 1, 2020

MarcoGorelli commented Mar 2, 2020

jeandersonbc commented Mar 2, 2020

TomAugspurger left a comment

TomAugspurger Mar 2, 2020

jeandersonbc commented Mar 3, 2020

Plotting Int64 columns with nulled integers (NAType) fails #32073 #32387

Plotting Int64 columns with nulled integers (NAType) fails #32073 #32387

Conversation

jeandersonbc commented Mar 1, 2020

MarcoGorelli commented Mar 2, 2020

jeandersonbc commented Mar 2, 2020

TomAugspurger left a comment

Choose a reason for hiding this comment

TomAugspurger Mar 2, 2020

Choose a reason for hiding this comment

jeandersonbc commented Mar 3, 2020