CLN: Consistent and Annotated Return Type of _iterate_slices #28958

WillAyd · 2019-10-13T21:47:03Z

General pre-cursor to getting block management out of groupby. This is also a pre-cursor to fixing #21668 but needs to be coupled with a few more changes as a follow up

On master calls to _iterate_slices look up by label, potentially yielding a DataFrame if there were duplicated columns. This takes the surprise out of that and simply returns a Tuple of label / series for each item along the axis

@jbrockmendel

jreback

can u perf check for groupby

jreback · 2019-10-13T21:49:54Z

pandas/core/groupby/generic.py

-                slice_axis = self._selection_list
-            slicer = lambda x: self.obj[x]
+    def _iterate_slices(self) -> Iterator[Tuple[Hashable, Series]]:
+        obj = self._selected_obj.copy()


Mistake on my end. Didn’t want to mutate passed frame in the axis=1 case but I can get rid of this

Off topic: can we make the axis=1 case unnecessary by just transposing when we see it? Not sure exactly how that would work, but it would let us get rid of some lambda-wrapping, which would be nice.

jbrockmendel · 2019-10-13T22:09:45Z

Might want to look at DataFrameGroupBy._transform_item_by_item. If I'm reading it correctly, that will screw up if we have non-unique columns. (I think that's relevant to what you're doing here)

pandas/core/groupby/generic.py

WillAyd · 2019-10-13T23:32:52Z

can u perf check for groupby

Ran entire GroupBy benchmark and showed no changes

Might want to look at DataFrameGroupBy._transform_item_by_item

Yea looks it could be consolidated to use this, though leaving to a follow up

Also w.r.t. your comment on axis=1 case seems logical but might be some complications around managing selection state with that and how things are constructed. May take a look down the road, though if you see a way to make it work in the meantime makes sense to me

jreback · 2019-10-16T12:24:49Z

thanks

…dev#28958)

Consistent return type of _iterate_slices

4a60a68

WillAyd added the Groupby label Oct 13, 2019

jreback requested changes Oct 13, 2019

View reviewed changes

Removed copy

5e29af0

Annoted DataFrame.items as well

3f68794

simonjayhawkins reviewed Oct 13, 2019

View reviewed changes

pandas/core/groupby/generic.py Outdated Show resolved Hide resolved

WillAyd added 2 commits October 13, 2019 15:13

Iterator -> Iterable

4d94f63

mypy fixes

4862742

jreback added this to the 1.0 milestone Oct 16, 2019

jreback added the Typing type annotations, mypy/pyright type checking label Oct 16, 2019

jreback approved these changes Oct 16, 2019

View reviewed changes

jreback merged commit c903e5e into pandas-dev:master Oct 16, 2019

proost pushed a commit to proost/pandas that referenced this pull request Dec 19, 2019

CLN: Consistent and Annotated Return Type of _iterate_slices (pandas-…

048043c

…dev#28958)

proost pushed a commit to proost/pandas that referenced this pull request Dec 19, 2019

CLN: Consistent and Annotated Return Type of _iterate_slices (pandas-…

9af692f

…dev#28958)

bongolegend pushed a commit to bongolegend/pandas that referenced this pull request Jan 1, 2020

CLN: Consistent and Annotated Return Type of _iterate_slices (pandas-…

527f38c

…dev#28958)

WillAyd deleted the consistent-grp-iter branch January 16, 2020 00:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

CLN: Consistent and Annotated Return Type of _iterate_slices #28958

CLN: Consistent and Annotated Return Type of _iterate_slices #28958

Uh oh!

WillAyd commented Oct 13, 2019

Uh oh!

jreback left a comment

Uh oh!

jreback Oct 13, 2019

Uh oh!

WillAyd Oct 13, 2019

Uh oh!

jbrockmendel Oct 13, 2019

Uh oh!

jbrockmendel commented Oct 13, 2019

Uh oh!

Uh oh!

WillAyd commented Oct 13, 2019 •

edited

Loading

Uh oh!

jreback commented Oct 16, 2019

Uh oh!

Uh oh!

Uh oh!

CLN: Consistent and Annotated Return Type of _iterate_slices #28958

CLN: Consistent and Annotated Return Type of _iterate_slices #28958

Uh oh!

Conversation

WillAyd commented Oct 13, 2019

Uh oh!

jreback left a comment

Choose a reason for hiding this comment

Uh oh!

jreback Oct 13, 2019

Choose a reason for hiding this comment

Uh oh!

WillAyd Oct 13, 2019

Choose a reason for hiding this comment

Uh oh!

jbrockmendel Oct 13, 2019

Choose a reason for hiding this comment

Uh oh!

jbrockmendel commented Oct 13, 2019

Uh oh!

Uh oh!

WillAyd commented Oct 13, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jreback commented Oct 16, 2019

Uh oh!

Uh oh!

WillAyd commented Oct 13, 2019 •

edited

Loading