ENH: Add the decimal.Decimal type to infer_dtypes (#15690) #16426

margaret · 2017-05-22T18:55:46Z

closes Infer Decimal dtype #15690
tests added / passed
passes git diff upstream/master --name-only -- '*.py' | flake8 --diff
whatsnew entry

TomAugspurger · 2017-05-22T19:12:45Z

pandas/tests/dtypes/test_inference.py

@@ -462,6 +463,11 @@ def test_floats(self):
        result = lib.infer_dtype(arr)
        assert result == 'floating'

+    def test_decimals(self):
+        arr = np.array([Decimal(1), Decimal(2), Decimal(3)])


Can you add a comment with a link to the github issue? (#15690)

TomAugspurger · 2017-05-22T19:24:15Z

pandas/tests/dtypes/test_inference.py

+        arr = np.array([Decimal(1), Decimal(2), Decimal(3)])
+        result = lib.infer_dtype(arr)
+        assert result == 'decimal'
+


Sorry, could you add a test with a mix of Decimal and non-decimal (like floats) and make sure that the return is mixed? Or search through and see if we have a test that already covers this. Other than that, this looks great.

jorisvandenbossche · 2017-05-22T20:00:02Z

What I am wondering is what are potential consequences of this change. As there are quite some places in the codebase where we wheck if the inferred type is 'mixed', which could lead to potential changes in behaviour for objects with Decimals with this change.
But don't know if those cases in the code would be relevant for Decimal objects.

margaret · 2017-05-22T20:47:53Z

https://travis-ci.org/pandas-dev/pandas/jobs/234948322#L1392 failed one of the vector resizing tests (Test for memory errors after internal vector reallocations), but only on the Python 3.6 run. Any ideas on how that would be related to this? The parameter it failed on seems to just be int64.

TomAugspurger · 2017-05-22T21:04:22Z

@jorisvandenbossche are there places where we explicitly check == 'mixed'? I haven't been able to find in just grepping (other than tests).

@margaret I'm not able to reproduce that. I'll restart that worker.

jorisvandenbossche · 2017-05-22T21:09:43Z

According to my pycharm search, there are 13 occurrences of 'mixed' (usage as a string constant)

codecov · 2017-05-22T22:09:34Z

Codecov Report

Merging #16426 into master will decrease coverage by <.01%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master   #16426      +/-   ##
==========================================
- Coverage   90.42%   90.42%   -0.01%     
==========================================
  Files         161      161              
  Lines       51023    51023              
==========================================
- Hits        46138    46137       -1     
- Misses       4885     4886       +1

Flag	Coverage Δ
#multiple	`88.26% <ø> (-0.01%)`	⬇️
#single	`40.17% <ø> (ø)`	⬆️

Impacted Files	Coverage Δ
pandas/core/common.py	`91.05% <0%> (-0.34%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 49ec31b...6f158cb. Read the comment docs.

jreback · 2017-05-22T22:15:43Z

doc/source/whatsnew/v0.21.0.txt

@@ -34,6 +34,7 @@ Other Enhancements
 - ``Series.to_dict()`` and ``DataFrame.to_dict()`` now support an ``into`` keyword which allows you to specify the ``collections.Mapping`` subclass that you would like returned.  The default is ``dict``, which is backwards compatible. (:issue:`16122`)
 - ``RangeIndex.append`` now returns a ``RangeIndex`` object when possible (:issue:`16212`)
 - :func:`to_pickle` has gained a protocol parameter (:issue:`16252`). By default, this parameter is set to `HIGHEST_PROTOCOL <https://docs.python.org/3/library/pickle.html#data-stream-format>`__
+- ``lib.infer_dtype`` now infers decimals. (:issue: `15690`)


:func:`api.types.infer_dtype`

jreback · 2017-05-22T22:15:59Z

pandas/_libs/src/inference.pyx

@@ -308,7 +308,6 @@ def infer_dtype(object value):
    'categorical'

    """


pls update the doc-string & add an example

jreback · 2017-05-22T22:18:04Z

If the suite passes this is prob ok. The times when we infer are:

construction (IOW we have something like Series([Decimal(..), Decimal(..)]), but since decimal is an object dtype this won't matter (IOW, we don't specially handle it so it will fall thru as object anyhow).
indexing, with an Index of decimals. but again its the same as above, we don't do anything special. so I think ok for now.

jorisvandenbossche · 2017-05-22T22:26:35Z

One further thought: would we like to call this "mixed-decimal" ? (or "object-decimal", but we seem to use 'mixed' as general indicator for object dtyped columns)
This is more explicit in the fact that it is still object dtype (since we don't have a decimal dtype, nor do we plan to have one), but lets you specifically check for decimals, and also let's you more easily test for object dtypes in general ("mixed" in inferred_dtype, instead of inferred_dtype in ['decimal', 'mixed', 'mixed-..', ..]`)

jreback · 2017-05-22T22:47:46Z

i think decimal is just fine ; don't need to get more complicated

jreback · 2017-05-23T00:18:34Z

lgtm. ping on green.

jreback · 2017-05-23T10:56:34Z

thanks!

…andas-dev#16426) closes pandas-dev#15690

ENH: Add the decimal.Decimal type to infer_dtypes (pandas-dev#15690)

bc15e85

TomAugspurger reviewed May 22, 2017

View reviewed changes

margaret added 2 commits May 22, 2017 12:16

Reference GH15690 in test_decimals

e7e67e8

Add enhancement from issue pandas-dev#15690

b291b62

TomAugspurger added this to the 0.21.0 milestone May 22, 2017

TomAugspurger added Dtype Conversions Unexpected or buggy dtype conversions Enhancement labels May 22, 2017

TomAugspurger reviewed May 22, 2017

View reviewed changes

Check that decimal and non-decimal returns 'mixed'

024feac

jreback requested changes May 22, 2017

View reviewed changes

jreback mentioned this pull request May 22, 2017

Infer Decimal dtype #15690

Closed

Fix reference to infer_dtype func

cc9ac39

update infer_dtype docstring with decimal example

6f158cb

jreback approved these changes May 23, 2017

View reviewed changes

jreback merged commit c53d00f into pandas-dev:master May 23, 2017

stangirala pushed a commit to stangirala/pandas that referenced this pull request Jun 11, 2017

ENH: Add the decimal.Decimal type to infer_dtypes (pandas-dev#15690) (p…

82e21a3

…andas-dev#16426) closes pandas-dev#15690

		@@ -308,7 +308,6 @@ def infer_dtype(object value):
		'categorical'

		"""

Uh oh!

ENH: Add the decimal.Decimal type to infer_dtypes (#15690) #16426

ENH: Add the decimal.Decimal type to infer_dtypes (#15690) #16426

Uh oh!

Conversation

margaret commented May 22, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TomAugspurger May 22, 2017

Choose a reason for hiding this comment

Uh oh!

TomAugspurger May 22, 2017

Choose a reason for hiding this comment

Uh oh!

jorisvandenbossche commented May 22, 2017

Uh oh!

margaret commented May 22, 2017

Uh oh!

TomAugspurger commented May 22, 2017

Uh oh!

jorisvandenbossche commented May 22, 2017

Uh oh!

codecov bot commented May 22, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jreback May 22, 2017

Choose a reason for hiding this comment

Uh oh!

jreback May 22, 2017

Choose a reason for hiding this comment

Uh oh!

margaret May 22, 2017

Choose a reason for hiding this comment

Uh oh!

jreback commented May 22, 2017

Uh oh!

jorisvandenbossche commented May 22, 2017

Uh oh!

jreback commented May 22, 2017

Uh oh!

jreback commented May 23, 2017

Uh oh!

jreback commented May 23, 2017

Uh oh!

Uh oh!

margaret commented May 22, 2017 •

edited

Loading

codecov bot commented May 22, 2017 •

edited

Loading