add dropna=False to crosstab example #21413

testvinder · 2018-06-10T20:03:30Z

>>> foo = pd.Categorical(['a', 'b'], categories=['a', 'b', 'c'])
>>> bar = pd.Categorical(['d', 'e'], categories=['d', 'e', 'f'])
>>> crosstab(foo, bar)  # 'c' and 'f' are not represented in the data,
    ...                     # but they still will be counted in the output
col_0  d  e  f
row_0
a      1  0  0
b      0  1  0
c      0  0  0

The above example code does not produce the output shown because dropna=True is default. Changing crosstab(foo, bar) to crosstab(foo, bar, dropna=False) fixes that and produces the shown output (which is also the expected and correct output).

closes #xxxx
tests added / passed
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

pep8speaks · 2018-06-10T20:03:33Z

Hello @testvinder! Thanks for updating the PR.

Cheers ! There are no PEP8 issues in this Pull Request. 🍻

Comment last updated on June 11, 2018 at 09:29 Hours UTC

uds5501

Please refer to the comments.

@gfyoung , are the suggestions viable?

uds5501 · 2018-06-10T21:08:38Z

pandas/core/reshape/pivot.py

@@ -429,8 +429,9 @@ def crosstab(index, columns, values=None, rownames=None, colnames=None,

    >>> foo = pd.Categorical(['a', 'b'], categories=['a', 'b', 'c'])
    >>> bar = pd.Categorical(['d', 'e'], categories=['d', 'e', 'f'])
-    >>> crosstab(foo, bar)  # 'c' and 'f' are not represented in the data,
-    ...                     # but they still will be counted in the output
+    >>> crosstab(foo, bar, dropna=False)  # 'c' and 'f' are not represented


@testvinder , just a suggestion, you could've actually just changed the output regarding the given code and mentioned in comments about the default value of dropnabeing set to True

I would actually just add an example instead of modifying this one.

@uds5501 I actually think it's better to set ´dropnatoFalse`. The whole point of specifying the extra categories is to include them in the crosstab. It's a small change but the original is very confusing because the output cannot be reproduced.

@testvinder : Ah, yes, your point is well-taken. Can you update the original example with the correct output and add your example with dropna=False ?

@gfyoung Sure, will do.

codecov · 2018-06-10T22:04:23Z

Codecov Report

Merging #21413 into master will decrease coverage by <.01%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master   #21413      +/-   ##
==========================================
- Coverage   91.89%   91.89%   -0.01%     
==========================================
  Files         153      153              
  Lines       49596    49596              
==========================================
- Hits        45576    45574       -2     
- Misses       4020     4022       +2

Flag	Coverage Δ
#multiple	`90.29% <ø> (-0.01%)`	⬇️
#single	`41.86% <ø> (ø)`	⬆️

Impacted Files	Coverage Δ
pandas/core/reshape/pivot.py	`97.03% <ø> (ø)`	⬆️
pandas/util/testing.py	`84.6% <0%> (-0.21%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 415012f...4d7e36f. Read the comment docs.

gfyoung · 2018-06-11T00:06:14Z

I completely misread this PR and thought this was a larger change than it actually was. Sorry!

testvinder · 2018-06-11T06:30:07Z

@gfyoung yes, it's really just a tiny change that makes the documentation more clear.

gfyoung · 2018-06-11T08:32:13Z

@testvinder : I just realized: your PR should be against the master branch, not 0.23.x. Could you fix that?

testvinder · 2018-06-11T08:49:01Z

@gfyoung Done:-) Sorry for the mistake. This is all new to me.

gfyoung · 2018-06-11T09:29:41Z

@testvinder : I re-applied your commits against master because the base-switching was a lot uglier than I was hoping it would be.

gfyoung

With the base fixed, LGTM!

cc @jreback

jreback · 2018-06-12T11:31:14Z

thanks @testvinder

jorisvandenbossche · 2018-06-12T11:37:55Z

@jreback just a small note, can you make sure the final commits gets a reasonable message? (normally you should see it in the github merge box), because now it is "Reapply all patches by @testvinder against master" :-)

jreback · 2018-06-12T11:44:51Z

normally i remove all comments - guess missed that one

jorisvandenbossche · 2018-06-12T11:50:42Z

Yeah, it's not about the comments (those go into the commit message body) but the title. But it's a bit strange github did it this wrongly, because normally the title is fine (probably because the commit was rebased and squashed)

uds5501 reviewed Jun 10, 2018

View reviewed changes

gfyoung added the Categorical Categorical Data Type label Jun 10, 2018

gfyoung requested review from jreback and removed request for jreback June 10, 2018 22:04

gfyoung changed the base branch from 0.23.x to master June 11, 2018 08:22

gfyoung changed the base branch from master to 0.23.x June 11, 2018 08:22

testvinder changed the base branch from 0.23.x to master June 11, 2018 08:47

gfyoung changed the base branch from master to 0.23.x June 11, 2018 09:24

Reapply all patches by @testvinder against master

4d7e36f

gfyoung force-pushed the patch-1 branch from e92d816 to 07c3d85 Compare June 11, 2018 09:28

gfyoung changed the base branch from 0.23.x to master June 11, 2018 09:28

gfyoung force-pushed the patch-1 branch from 07c3d85 to 4d7e36f Compare June 11, 2018 09:29

gfyoung approved these changes Jun 11, 2018

View reviewed changes

gfyoung added the Docs label Jun 11, 2018

jreback added this to the 0.23.2 milestone Jun 12, 2018

jreback merged commit ffffa5c into pandas-dev:master Jun 12, 2018

jorisvandenbossche modified the milestones: 0.23.2, 0.24.0 Jun 12, 2018

david-liu-brattle-1 pushed a commit to david-liu-brattle-1/pandas that referenced this pull request Jun 18, 2018

Reapply all patches by @testvinder against master (pandas-dev#21413)

12720db

Sup3rGeo pushed a commit to Sup3rGeo/pandas that referenced this pull request Oct 1, 2018

Reapply all patches by @testvinder against master (pandas-dev#21413)

aac047b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add dropna=False to crosstab example #21413

add dropna=False to crosstab example #21413

testvinder commented Jun 10, 2018 •

edited

Loading

pep8speaks commented Jun 10, 2018 •

edited

Loading

uds5501 left a comment

uds5501 Jun 10, 2018

gfyoung Jun 11, 2018

testvinder Jun 11, 2018

gfyoung Jun 11, 2018

testvinder Jun 11, 2018

codecov bot commented Jun 10, 2018 •

edited

Loading

gfyoung commented Jun 11, 2018

testvinder commented Jun 11, 2018

gfyoung commented Jun 11, 2018

testvinder commented Jun 11, 2018

gfyoung commented Jun 11, 2018

gfyoung left a comment

jreback commented Jun 12, 2018

jorisvandenbossche commented Jun 12, 2018

jreback commented Jun 12, 2018

jorisvandenbossche commented Jun 12, 2018

add dropna=False to crosstab example #21413

add dropna=False to crosstab example #21413

Conversation

testvinder commented Jun 10, 2018 • edited Loading

pep8speaks commented Jun 10, 2018 • edited Loading

Comment last updated on June 11, 2018 at 09:29 Hours UTC

uds5501 left a comment

Choose a reason for hiding this comment

uds5501 Jun 10, 2018

Choose a reason for hiding this comment

gfyoung Jun 11, 2018

Choose a reason for hiding this comment

testvinder Jun 11, 2018

Choose a reason for hiding this comment

gfyoung Jun 11, 2018

Choose a reason for hiding this comment

testvinder Jun 11, 2018

Choose a reason for hiding this comment

codecov bot commented Jun 10, 2018 • edited Loading

Codecov Report

gfyoung commented Jun 11, 2018

testvinder commented Jun 11, 2018

gfyoung commented Jun 11, 2018

testvinder commented Jun 11, 2018

gfyoung commented Jun 11, 2018

gfyoung left a comment

Choose a reason for hiding this comment

jreback commented Jun 12, 2018

jorisvandenbossche commented Jun 12, 2018

jreback commented Jun 12, 2018

jorisvandenbossche commented Jun 12, 2018

testvinder commented Jun 10, 2018 •

edited

Loading

pep8speaks commented Jun 10, 2018 •

edited

Loading

codecov bot commented Jun 10, 2018 •

edited

Loading