Option for crosstab/pivot_table to include empty columns #3820

hayd · 2013-06-09T10:55:22Z

I don't think you can do this easily atm. crosstab is just a [convenience for pivot_table]((https://github.com/pydata/pandas/blob/master/pandas/tools/pivot.py#L218)...

http://stackoverflow.com/questions/17003034/missing-data-in-pandas-crosstab

a = np.array(['foo', 'foo', 'foo', 'bar', 'bar', 'foo', 'foo'], dtype=object)
b = np.array(['one', 'one', 'two', 'one', 'two', 'two', 'two'], dtype=object)
c = np.array(['dull', 'dull', 'dull', 'dull', 'dull', 'shiny', 'shiny'], dtype=object)

pd.crosstab(a, [b, c], rownames=['a'], colnames=['b', 'c'])
b     one   two       
c    dull  dull  shiny
a                     
bar     1     1      0
foo     2     1      2

# but wants
b     one        two       
c    dull  shiny dull  shiny
a                     
bar     1     0    1      0
foo     2     0    1      2

Maybe we could have an option in pivot_table (and crosstab) to include empty columns.

The text was updated successfully, but these errors were encountered:

hayd · 2013-07-02T09:27:12Z

Also see: https://groups.google.com/forum/#!searchin/pydata/Edwin$20Haver/pydata/Fda9mEGpVKM/rPYnJSUrs3YJ

jreback · 2013-07-02T10:49:22Z

I think something like this might be done in tools/merge
an outer join produces the Cartesian product of the keys

hayd · 2013-07-02T18:11:17Z

@jreback I'm not sure how to do that merge trick for multi-dimensional things?

My awful hack was:

def cartesian_product(X):
    lenX = map(len, X)
    cumprodX = np.cumproduct(lenX)
    a = np.insert(cumprodX, 0, 1)
    b = a[-1] / a[1:]
    return [np.tile(np.repeat(x, b[i]), 
                    np.product(a[i]))
               for i, x in enumerate(X)]

faster than itertools/compat.product for larger inputs (slower for smaller). Also works on things which aren't necessarily Series/DataFrames... basically I'm doing

cartesian_product(table.index.levels)

Have that as a component to a PR on the way, easy to drop in something else instead for that part.

hayd · 2013-07-02T22:51:44Z

this probably could be cythonized...

This was referenced Jun 9, 2013

Flesh out examples included in docstrings #3439

Closed

ENH: provide keyword to groupby to reindex by all possible indicies for a multi-index #3835

Closed

hayd mentioned this issue Jul 2, 2013

ENH: add dropna argument to pivot_table #4106

Merged

hayd closed this as completed in #4106 Jul 10, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Option for crosstab/pivot_table to include empty columns #3820

Option for crosstab/pivot_table to include empty columns #3820

hayd commented Jun 9, 2013

hayd commented Jul 2, 2013

jreback commented Jul 2, 2013

hayd commented Jul 2, 2013

hayd commented Jul 2, 2013

Option for crosstab/pivot_table to include empty columns #3820

Option for crosstab/pivot_table to include empty columns #3820

Comments

hayd commented Jun 9, 2013

hayd commented Jul 2, 2013

jreback commented Jul 2, 2013

hayd commented Jul 2, 2013

hayd commented Jul 2, 2013