ENH: drop function now has errors keyword for non-existing column handling #6736

sinhrks · 2014-03-29T12:21:38Z

Closes #5300.

Currently drop raises ValueError when non-existing label is passed. I think it is useful if drop has an option to suppress error and drop existing labels only.

For example, I sometimes process lots of files which has slightly different columns, and want to drop if data has unnecessary columns. Previously, I have to prepare different drop arguments checking columns existing each data.

jreback · 2014-03-29T12:24:39Z

this could be added on after #6599
@hayd

jtratner · 2014-03-30T05:22:22Z

pandas/tests/test_frame.py

@@ -6762,6 +6762,26 @@ def test_drop_names(self):
            self.assertEqual(obj.columns.name, 'second')
        self.assertEqual(list(df.columns), ['d', 'e', 'f'])

+        self.assertRaises(ValueError, df.drop, ['g'])
+        self.assertRaises(ValueError, df.drop, ['g'], 1)


let's give at least one of these an errors='raise' argument (and add it to an existing test that doesn't raise) just for completeness.

jtratner · 2014-03-30T05:25:57Z

This all looks fine to me - thanks for the patch.

We might want to reduce the number of test cases (just because we don't necessarily need to duplicate), but aside from that, quite good. - somebody else want to take a look too? I may be a bit rusty :P

hayd · 2014-03-30T06:59:10Z

Looks all good to me. But let's wait for #6599. :)

jreback · 2014-03-30T14:12:00Z

yep....so @hayd will revisit after that #6599 merge

jreback · 2015-01-25T22:58:22Z

@sinhrks this idea is ok, can you rebase

sinhrks · 2015-01-26T14:45:56Z

@jreback Yes, rebased.

jreback · 2015-01-29T11:28:52Z

pandas/core/index.py

@@ -2226,7 +2228,9 @@ def drop(self, labels):
        indexer = self.get_indexer(labels)
        mask = indexer == -1
        if mask.any():
-            raise ValueError('labels %s not contained in axis' % labels[mask])


hmm, shouldn't this raise a KeyError? @jorisvandenbossche @shoyer to be consistent with say .loc and other indexers? @hayd

This is kind of a gray area to me. We're not doing actual indexing here. I would probably stick with ValueError, especially to remain consistent with the existing implementation.

jreback · 2015-03-05T23:47:22Z

have to think about this

jreback · 2015-04-04T18:59:52Z

@sinhrks can you rebase this.

…dling

sinhrks · 2015-04-04T21:53:26Z

Sure, rebased.

ENH: drop function now has errors keyword for non-existing column handling

jreback · 2015-04-08T14:36:03Z

@sinhrks thanks!

jreback added API Design labels Mar 29, 2014

jtratner reviewed Mar 30, 2014
View reviewed changes

jreback added this to the 0.14.0 milestone Mar 30, 2014

sinhrks mentioned this pull request Apr 2, 2014

ENH: rename function now has errors keyword #6767

Closed

jreback modified the milestones: 0.14.1, 0.14.0 May 5, 2014

jreback modified the milestones: 0.15.0, 0.14.1 Jun 5, 2014

sinhrks force-pushed the drop branch from 71ee633 to 016453f Compare November 22, 2014 07:10

sinhrks force-pushed the drop branch from 016453f to 0c03326 Compare January 19, 2015 15:42

sinhrks force-pushed the drop branch from 0c03326 to ed0f310 Compare January 26, 2015 13:52

jreback reviewed Jan 29, 2015
View reviewed changes

sinhrks force-pushed the drop branch 2 times, most recently from 44421ca to 58b4a4d Compare February 15, 2015 03:29

jreback modified the milestones: 0.16.1, 0.16.0 Mar 5, 2015

sinhrks force-pushed the drop branch 2 times, most recently from fc6794d to 3fdca30 Compare April 1, 2015 12:05

ENH: drop function now has errors keyword for non-existing column han…

a2620d7

…dling

sinhrks force-pushed the drop branch from 3fdca30 to a2620d7 Compare April 4, 2015 21:49

jreback added a commit that referenced this pull request Apr 8, 2015

Merge pull request #6736 from sinhrks/drop

a4ae0cf

ENH: drop function now has errors keyword for non-existing column handling

jreback merged commit a4ae0cf into pandas-dev:master Apr 8, 2015

sinhrks deleted the drop branch April 11, 2015 13:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH: drop function now has errors keyword for non-existing column handling #6736

ENH: drop function now has errors keyword for non-existing column handling #6736

Uh oh!

sinhrks commented Mar 29, 2014

Uh oh!

jreback commented Mar 29, 2014

Uh oh!

jtratner Mar 30, 2014

Uh oh!

jtratner commented Mar 30, 2014

Uh oh!

hayd commented Mar 30, 2014

Uh oh!

jreback commented Mar 30, 2014

Uh oh!

jreback commented Jan 25, 2015

Uh oh!

sinhrks commented Jan 26, 2015

Uh oh!

jreback Jan 29, 2015

Uh oh!

shoyer Jan 29, 2015

Uh oh!

jreback commented Mar 5, 2015

Uh oh!

jreback commented Apr 4, 2015

Uh oh!

sinhrks commented Apr 4, 2015

Uh oh!

jreback commented Apr 8, 2015

Uh oh!

Uh oh!

Uh oh!

ENH: drop function now has errors keyword for non-existing column handling #6736

ENH: drop function now has errors keyword for non-existing column handling #6736

Uh oh!

Conversation

sinhrks commented Mar 29, 2014

Uh oh!

jreback commented Mar 29, 2014

Uh oh!

jtratner Mar 30, 2014

Choose a reason for hiding this comment

Uh oh!

jtratner commented Mar 30, 2014

Uh oh!

hayd commented Mar 30, 2014

Uh oh!

jreback commented Mar 30, 2014

Uh oh!

jreback commented Jan 25, 2015

Uh oh!

sinhrks commented Jan 26, 2015

Uh oh!

jreback Jan 29, 2015

Choose a reason for hiding this comment

Uh oh!

shoyer Jan 29, 2015

Choose a reason for hiding this comment

Uh oh!

jreback commented Mar 5, 2015

Uh oh!

jreback commented Apr 4, 2015

Uh oh!

sinhrks commented Apr 4, 2015

Uh oh!

jreback commented Apr 8, 2015

Uh oh!

Uh oh!