ENH: Add columns parameter to from_dict #19802

reidy-p · 2018-02-20T21:56:46Z

xref DEPR: Deprecate from_items #18529
tests added / passed
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

@jorisvandenbossche it seems to have been quite straightforward to implement this but let me know if I missed anything.

jorisvandenbossche

Thanks a lot! Looks good, small comment

jorisvandenbossche · 2018-02-20T22:47:35Z

doc/source/whatsnew/v0.23.0.txt

@@ -589,6 +589,7 @@ Other API Changes
 - :func:`Series.to_csv` now accepts a ``compression`` argument that works in the same way as the ``compression`` argument in :func:`DataFrame.to_csv` (:issue:`18958`)
 - Set operations (union, difference...) on :class:`IntervalIndex` with incompatible index types will now raise a ``TypeError`` rather than a ``ValueError`` (:issue:`19329`)
 - :class:`DateOffset` objects render more simply, e.g. ``<DateOffset: days=1>`` instead of ``<DateOffset: kwds={'days': 1}>`` (:issue:`19403`)
+- :func:`DataFrame.from_dict` now accepts a ``columns`` argument that can be used to specify the column names when ``orient='index'`` is used (:issue:`18529`)


you can move it to the 'Other Enhancements' section

jorisvandenbossche · 2018-02-20T22:48:59Z

pandas/core/frame.py

+        elif orient == 'columns':
+            if columns is not None:
+                raise ValueError("cannot use columns parameter with "
+                                 "orient='columns'")


In principle, we could even allow it for orient='columns', as DataFrame(dict, columns=..) will just handle this fine.
But, not really sure of the value here.

Could you expand on what you mean? If I pass a dict with columns to DataFrame() I get:

In [1]: pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]}, columns = ['one', 'two']) Out[1]: Empty DataFrame Columns: [one, two] Index: []

Yes, that is exactly what I meant. The columns keyword does a 'reindexing' operation, and does not 'overwrite' if the data has keys as well. That's the current behaviour of DataFrame(..). So we could follow here the same pattern, but since this is sometimes also somewhat surprising behaviour, not sure if it is needed to have that here as well.
So therefore my doubt :-)

Ah yes I understand now. It makes more sense to me to raise a ValueError in the case with from_dict(..., orient='columns', columns=[...]) but I can change it to not raise if we want to be consistent with DataFrame(dict, columns=[..])

codecov · 2018-02-21T19:30:09Z

Codecov Report

Merging #19802 into master will decrease coverage by 0.02%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master   #19802      +/-   ##
==========================================
- Coverage   91.61%   91.59%   -0.03%     
==========================================
  Files         150      150              
  Lines       48892    48895       +3     
==========================================
- Hits        44792    44783       -9     
- Misses       4100     4112      +12

Flag	Coverage Δ
#multiple	`89.96% <100%> (-0.03%)`	⬇️
#single	`41.78% <20%> (-0.01%)`	⬇️

Impacted Files	Coverage Δ
pandas/core/frame.py	`97.23% <100%> (ø)`	⬆️
pandas/plotting/_converter.py	`65.22% <0%> (-1.74%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update aa59954...e755462. Read the comment docs.

jreback · 2018-02-22T02:22:16Z

I believe we need to do a small doc change in dsintro.rst? (which motivated this). can you add on. lgtm. ping on green.

jorisvandenbossche · 2018-02-22T10:34:40Z

I believe we need to do a small doc change in dsintro.rst? (which motivated this). can you add on. lgtm. ping on green.

I had that already in a branch, so will push that separately.

jorisvandenbossche · 2018-02-22T10:34:55Z

Thanks @reidy-p !

jorisvandenbossche · 2018-02-22T10:47:52Z

See #19837

jorisvandenbossche reviewed Feb 20, 2018

View reviewed changes

jreback added Enhancement Reshaping Concat, Merge/Join, Stack/Unstack, Explode labels Feb 21, 2018

reidy-p added 2 commits February 21, 2018 19:27

ENH: Add columns parameter to from_dict

db65af9

Move whatsnew note and use OrderedDict in test

e755462

reidy-p force-pushed the from_dict_columns branch from f7a0879 to e755462 Compare February 21, 2018 19:29

jorisvandenbossche approved these changes Feb 21, 2018

View reviewed changes

jreback added this to the 0.23.0 milestone Feb 22, 2018

jorisvandenbossche merged commit 02f6308 into pandas-dev:master Feb 22, 2018

jorisvandenbossche mentioned this pull request Feb 22, 2018

DOC: remove deprecated from_items from dsintro docs #19837

Merged

harisbal pushed a commit to harisbal/pandas that referenced this pull request Feb 28, 2018

ENH: Add columns parameter to from_dict (pandas-dev#19802)

f3836c4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH: Add columns parameter to from_dict #19802

ENH: Add columns parameter to from_dict #19802

Uh oh!

reidy-p commented Feb 20, 2018 •

edited

Loading

Uh oh!

jorisvandenbossche left a comment

Uh oh!

jorisvandenbossche Feb 20, 2018

Uh oh!

jorisvandenbossche Feb 20, 2018

Uh oh!

reidy-p Feb 21, 2018

Uh oh!

jorisvandenbossche Feb 21, 2018

Uh oh!

reidy-p Feb 21, 2018

Uh oh!

codecov bot commented Feb 21, 2018 •

edited

Loading

Uh oh!

jreback commented Feb 22, 2018

Uh oh!

jorisvandenbossche commented Feb 22, 2018

Uh oh!

jorisvandenbossche commented Feb 22, 2018

Uh oh!

jorisvandenbossche commented Feb 22, 2018

Uh oh!

Uh oh!

Uh oh!

ENH: Add columns parameter to from_dict #19802

ENH: Add columns parameter to from_dict #19802

Uh oh!

Conversation

reidy-p commented Feb 20, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jorisvandenbossche left a comment

Choose a reason for hiding this comment

Uh oh!

jorisvandenbossche Feb 20, 2018

Choose a reason for hiding this comment

Uh oh!

jorisvandenbossche Feb 20, 2018

Choose a reason for hiding this comment

Uh oh!

reidy-p Feb 21, 2018

Choose a reason for hiding this comment

Uh oh!

jorisvandenbossche Feb 21, 2018

Choose a reason for hiding this comment

Uh oh!

reidy-p Feb 21, 2018

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Feb 21, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jreback commented Feb 22, 2018

Uh oh!

jorisvandenbossche commented Feb 22, 2018

Uh oh!

jorisvandenbossche commented Feb 22, 2018

Uh oh!

jorisvandenbossche commented Feb 22, 2018

Uh oh!

Uh oh!

reidy-p commented Feb 20, 2018 •

edited

Loading

codecov bot commented Feb 21, 2018 •

edited

Loading