DOC-Modified documentation for the issue GH42106 #42110

raghuabhishek · 2021-06-18T18:06:48Z

closes Use better example in pd.lookup replacement #42106

mzeitlin11 · 2021-06-18T18:29:09Z

doc/source/user_guide/indexing.rst

-    melt = melt.loc[melt['col'] == melt['variable'], 'value']
-    melt.reset_index(drop=True)
+    idx, cols = pd.factorize(df['col'])
+    df.reindex(cols, axis=1).to_numpy()[np.arange(len(df)), idx]


The text above the example will also need to be changed?

Yes, you are right. I have modified the text which is there above the example,as per my understanding. Can you review it.

show the result in the PR itself as this doesn't look to replicate

On pandas 1.1.5:

In [1]: df = pd.DataFrame({'col': ["A", "A", "B", "B"], ...: ...: 'A': [80, 23, np.nan, 22], ...: ...: 'B': [80, 55, 76, 67]}) ...: In [2]: df.lookup(df.index, df['col']) Out[2]: array([80., 23., 76., 67.])

the example on master:

In [2]: melt = df.melt('col') ...: melt = melt.loc[melt['col'] == melt['variable'], 'value'] ...: melt.reset_index(drop=True) Out[2]: 0 80.0 1 23.0 2 76.0 3 67.0 Name: value, dtype: float64

the example here:

In [3]: idx, cols = pd.factorize(df['col']) ...: df.reindex(cols, axis=1).to_numpy()[np.arange(len(df)), idx] Out[3]: array([80., 23., 76., 67.])

MarcoGorelli · 2021-06-19T07:57:07Z

doc/source/user_guide/indexing.rst

+and column labels, this can be achieved by ``pandas.factorize`` which extracts the distinct values of the intended column and can be indexed by passing the length of the dataframe to the 
+``numpy.arange`` function and the distinct value array.  For instance:


this is very complicated wording...how about

Suggested change

and column labels, this can be achieved by ``pandas.factorize`` which extracts the distinct values of the intended column and can be indexed by passing the length of the dataframe to the

``numpy.arange`` function and the distinct value array. For instance:

and column labels, this can be achieved by using ``pandas.factorize`` and NumPy indexing. For instance:

I have modified the file as per your suggestion.

MarcoGorelli

Looks good to me

raghuabhishek · 2021-06-19T19:04:04Z

Looks good to me

Hi MarcoGorelli, I just wanted to know when can this PR be merged with the master/main.

Commit for updating PR

raghuabhishek · 2021-06-20T13:54:51Z

Hi MarcoGorelli
Web/doc CI check was failing and as a result I had to update this PR with the master branch. Can you please approve the workflows.

MarcoGorelli · 2021-06-20T13:56:13Z

Hey @raghuabhishek

yup, done

I just wanted to know when can this PR be merged with the master/main.

this looks good to me, but I'll leave it open for a bit in case others have comments

raghuabhishek · 2021-06-20T13:57:31Z

Hey @raghuabhishek

yup, done

I just wanted to know when can this PR be merged with the master/main.

this looks good to me, but I'll leave it open for a bit in case others have comments

Thanks @MarcoGorelli

jreback · 2021-06-21T14:31:16Z

thanks @raghuabhishek

jreback · 2021-06-21T14:31:23Z

@meeseeksdev backport 1.3.x

…e GH42106

lumberbot-app · 2021-06-21T14:31:30Z

Something went wrong ... Please have a look at my logs.

…42161) Co-authored-by: Abhishek R <[email protected]>

DOC-Modified documentation for the issue GH42106

316262f

mzeitlin11 reviewed Jun 18, 2021

View reviewed changes

mzeitlin11 added Docs Indexing Related to indexing on series/frames, not to indexes themselves labels Jun 18, 2021

DOC-Modified documentation for the issue GH42106

9dd6e41

raghuabhishek requested a review from mzeitlin11 June 19, 2021 05:52

raghuabhishek mentioned this pull request Jun 19, 2021

Use better example in pd.lookup replacement #42106

Closed

DOC-Modified documentation for the issue GH42106

d3364d7

MarcoGorelli suggested changes Jun 19, 2021

View reviewed changes

raghuabhishek requested a review from MarcoGorelli June 19, 2021 18:19

MarcoGorelli approved these changes Jun 19, 2021

View reviewed changes

Merge remote-tracking branch 'upstream/master' into modify-pd.lookup-doc

5e327bc

Commit for updating PR

raghuabhishek requested a review from jreback June 20, 2021 12:51

raghuabhishek requested a review from MarcoGorelli June 20, 2021 13:54

jreback added this to the 1.3 milestone Jun 21, 2021

jreback merged commit 00af20a into pandas-dev:master Jun 21, 2021

meeseeksmachine mentioned this pull request Jun 21, 2021

Backport PR #42110 on branch 1.3.x (DOC-Modified documentation for the issue GH42106) #42161

Merged

meeseeksmachine pushed a commit to meeseeksmachine/pandas that referenced this pull request Jun 21, 2021

Backport PR pandas-dev#42110: DOC-Modified documentation for the issu…

6fd854a

…e GH42106

simonjayhawkins pushed a commit that referenced this pull request Jun 21, 2021

Backport PR #42110: DOC-Modified documentation for the issue GH42106 (#…

cdc235f

…42161) Co-authored-by: Abhishek R <[email protected]>

neinkeinkaffee pushed a commit to neinkeinkaffee/pandas that referenced this pull request Jun 21, 2021

DOC-Modified documentation for the issue GH42106 (pandas-dev#42110)

31a0957

JulianWgs pushed a commit to JulianWgs/pandas that referenced this pull request Jul 3, 2021

DOC-Modified documentation for the issue GH42106 (pandas-dev#42110)

3f33c74

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC-Modified documentation for the issue GH42106 #42110

DOC-Modified documentation for the issue GH42106 #42110

raghuabhishek commented Jun 18, 2021

mzeitlin11 Jun 18, 2021

raghuabhishek Jun 18, 2021

jreback Jun 19, 2021

MarcoGorelli Jun 19, 2021

MarcoGorelli Jun 19, 2021

raghuabhishek Jun 19, 2021

MarcoGorelli left a comment

raghuabhishek commented Jun 19, 2021

raghuabhishek commented Jun 20, 2021

MarcoGorelli commented Jun 20, 2021

raghuabhishek commented Jun 20, 2021

jreback commented Jun 21, 2021

jreback commented Jun 21, 2021

lumberbot-app bot commented Jun 21, 2021

		and column labels, this can be achieved by ``pandas.factorize`` which extracts the distinct values of the intended column and can be indexed by passing the length of the dataframe to the
		``numpy.arange`` function and the distinct value array. For instance:

DOC-Modified documentation for the issue GH42106 #42110

DOC-Modified documentation for the issue GH42106 #42110

Conversation

raghuabhishek commented Jun 18, 2021

mzeitlin11 Jun 18, 2021

Choose a reason for hiding this comment

raghuabhishek Jun 18, 2021

Choose a reason for hiding this comment

jreback Jun 19, 2021

Choose a reason for hiding this comment

MarcoGorelli Jun 19, 2021

Choose a reason for hiding this comment

MarcoGorelli Jun 19, 2021

Choose a reason for hiding this comment

raghuabhishek Jun 19, 2021

Choose a reason for hiding this comment

MarcoGorelli left a comment

Choose a reason for hiding this comment

raghuabhishek commented Jun 19, 2021

raghuabhishek commented Jun 20, 2021

MarcoGorelli commented Jun 20, 2021

raghuabhishek commented Jun 20, 2021

jreback commented Jun 21, 2021

jreback commented Jun 21, 2021

lumberbot-app bot commented Jun 21, 2021