BUG: Retain tz-aware dtypes with melt (#15785) #20292

mroeschke · 2018-03-12T02:00:24Z

closes BUG: melt changes type of tz-aware columns #15785
tests added / passed
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

.values call was converting tz aware data to tz naive data (by casting to a numpy array). Added an additional test for Categorical data as well.

WillAyd · 2018-03-12T03:22:48Z

pandas/tests/reshape/test_melt.py

+    @pytest.mark.parametrize("col", [
+        pd.Series(pd.date_range('2010', periods=5, tz='US/Pacific')),
+        pd.Series(["a", "b", "c", "a", "d"], dtype="category")])
+    def test_pandas_dtypes_id_var(self, col):


If you wanted to reduce duplication of code further could parametrize something like "as_val" with parameters of True and False and then just add a conditional at the top of the function to set attr2 either to either the col or [0, 1, 0, 0, 0]

ahh I see @WillAyd suggested. in any event this makes this much harder to read.

codecov · 2018-03-12T05:39:54Z

Codecov Report

Merging #20292 into master will increase coverage by <.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master   #20292      +/-   ##
==========================================
+ Coverage   91.72%   91.73%   +<.01%     
==========================================
  Files         150      150              
  Lines       49165    49174       +9     
==========================================
+ Hits        45099    45108       +9     
  Misses       4066     4066

Flag	Coverage Δ
#multiple	`90.11% <100%> (ø)`	⬆️
#single	`41.86% <28.57%> (ø)`	⬆️

Impacted Files	Coverage Δ
pandas/core/reshape/melt.py	`97.34% <100%> (+0.14%)`	⬆️
pandas/core/generic.py	`95.84% <0%> (-0.01%)`	⬇️
pandas/core/frame.py	`97.18% <0%> (ø)`	⬆️
pandas/core/indexing.py	`93.02% <0%> (ø)`	⬆️
pandas/core/indexes/base.py	`96.66% <0%> (ø)`	⬆️
pandas/core/strings.py	`98.32% <0%> (ø)`	⬆️
pandas/core/resample.py	`96.43% <0%> (ø)`	⬆️
pandas/core/groupby.py	`92.14% <0%> (ø)`	⬆️
pandas/core/indexes/datetimes.py	`95.64% <0%> (ø)`	⬆️
pandas/plotting/_core.py	`82.27% <0%> (ø)`	⬆️
... and 2 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 31afaf8...abe48c9. Read the comment docs.

jreback · 2018-03-12T10:36:10Z

pandas/core/reshape/melt.py

+        id_data = frame.pop(col)
+        if is_extension_type(id_data):
+            # Preserve pandas dtype by not converting to a numpy array
+            id_data = concat([id_data] * K, ignore_index=True)


@TomAugspurger do we have this in ExtensionArray ATM? .tile()? or could emulate like this

No .tile. We do have _concat_same_type.

jreback · 2018-03-12T10:36:18Z

pandas/core/reshape/melt.py

-        mdata[col] = np.tile(frame.pop(col).values, K)
+        id_data = frame.pop(col)
+        if is_extension_type(id_data):
+            # Preserve pandas dtype by not converting to a numpy array


comment is not needed

jreback · 2018-03-12T10:38:17Z

pandas/tests/reshape/test_melt.py

+        df = DataFrame({'klass': range(5),
+                        'col': col,
+                        'attr1': [1, 0, 0, 0, 0]})
+        if pandas_dtype_value:


can you move what this does into the parameterize, maybe by using a fixture. this defeats the purpose of being able to look at the parameterize and see what cases are being tested

jschendel · 2018-03-12T13:58:29Z

doc/source/whatsnew/v0.23.0.txt

@@ -896,6 +896,7 @@ Timezones
 - Bug in :func:`Timestamp.tz_localize` where localizing a timestamp near the minimum or maximum valid values could overflow and return a timestamp with an incorrect nanosecond value (:issue:`12677`)
 - Bug when iterating over :class:`DatetimeIndex` that was localized with fixed timezone offset that rounded nanosecond precision to microseconds (:issue:`19603`)
 - Bug in :func:`DataFrame.diff` that raised an ``IndexError`` with tz-aware values (:issue:`18578`)
+- Bug in :func:`melt` that coverted tz-aware dtypes to tz-naive (:issue:`15785`)


coverted --> converted

Add additional tests

jreback · 2018-03-13T10:31:13Z

thanks!

WillAyd reviewed Mar 12, 2018

View reviewed changes

jreback added Bug Timezones Timezone data dtype labels Mar 12, 2018

jreback requested changes Mar 12, 2018

View reviewed changes

jreback added the Reshaping Concat, Merge/Join, Stack/Unstack, Explode label Mar 12, 2018

jschendel reviewed Mar 12, 2018

View reviewed changes

mroeschke added 4 commits March 12, 2018 18:22

BUG: Retain tz-aware dtypes with melt

be66786

Add additional tests

clarify comment

2d28685

Reduce duplication and dict insertion order fix

78a2d84

address review

abe48c9

mroeschke force-pushed the melt_tz branch from a46eef1 to abe48c9 Compare March 13, 2018 01:58

jreback added this to the 0.23.0 milestone Mar 13, 2018

jreback approved these changes Mar 13, 2018

View reviewed changes

jreback merged commit 53bf291 into pandas-dev:master Mar 13, 2018

mroeschke deleted the melt_tz branch March 13, 2018 15:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: Retain tz-aware dtypes with melt (#15785) #20292

BUG: Retain tz-aware dtypes with melt (#15785) #20292

mroeschke commented Mar 12, 2018

WillAyd Mar 12, 2018

jreback Mar 12, 2018

codecov bot commented Mar 12, 2018 •

edited

Loading

jreback Mar 12, 2018

TomAugspurger Mar 12, 2018

jreback Mar 12, 2018

jreback Mar 12, 2018

jschendel Mar 12, 2018

jreback commented Mar 13, 2018

BUG: Retain tz-aware dtypes with melt (#15785) #20292

BUG: Retain tz-aware dtypes with melt (#15785) #20292

Conversation

mroeschke commented Mar 12, 2018

WillAyd Mar 12, 2018

Choose a reason for hiding this comment

jreback Mar 12, 2018

Choose a reason for hiding this comment

codecov bot commented Mar 12, 2018 • edited Loading

Codecov Report

jreback Mar 12, 2018

Choose a reason for hiding this comment

TomAugspurger Mar 12, 2018

Choose a reason for hiding this comment

jreback Mar 12, 2018

Choose a reason for hiding this comment

jreback Mar 12, 2018

Choose a reason for hiding this comment

jschendel Mar 12, 2018

Choose a reason for hiding this comment

jreback commented Mar 13, 2018

codecov bot commented Mar 12, 2018 •

edited

Loading