-
-
Notifications
You must be signed in to change notification settings - Fork 18.4k
Method/option for deduplicating index, a la drop_duplicates #2825
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
The workarounds I've found for emulating Maybe drop_duplicates could have a new boolean parameter which shortcuts like so
Of course the I'm wholly ignorant of what implementation barriers exist; this is just speculation about how it could be done. |
Now we can do as below. Should document somewhere.
|
Seeing how the index has |
I hadn't noticed but totally agree. Labeling the two with higher index numbers as dups is arbitrary. On August 11, 2015 1:05:51 PM PDT, Christoph Buchner [email protected] wrote:
|
@bilderbuchi You can specify the behavior with
|
Ah - that's subtle, but a very good compromise.
|
ah, interesting, thanks for the clarification. |
Thanks to confirm. #10236 is included in v0.17 and not released yet, we can discuss if you have better alternatives. |
Thanks. Well, as I said I'd have gone for a 'none' string instead of the boolean False - as a Python intermediate I find neither more pythonic than the other. I'm just a casual user of pandas, though, and I I'm sure that @jreback has more investment in pandas, so I guess that it fits better with the general pandas API or somesuch... |
@bilderbuchi having a string and a Boolean possible in a keyword are no big deal 'none' however is very confusing in any event @sinhrks out up s nice doc about how to do this |
see e.g. #2763
The text was updated successfully, but these errors were encountered: