fixed issue#59670. DOC #59714

StaticAccess · 2024-09-05T01:46:02Z

closes DOC: Document that DataFrame.from_records()'s columns argument also acts as "include" #59670

cjerdonek · 2024-09-05T04:08:30Z

pandas/core/frame.py

@@ -2126,7 +2126,8 @@ def from_records(
            associated with them, this argument provides names for the
            columns. Otherwise this argument indicates the order of the columns
            in the result (any names not found in the data will become all-NA
-            columns).
+            columns).Additionally,specifying `columns` will limit the DataFrame to only
+            include the specified columns, similar to an "include" or "usecols" functionality.


I would suggest simplifying this language as follows:

Otherwise this argument indicates the order of the columns in the result (any names not found in the data will become all-NA columns) and limits the data to these columns if not all column names are provided.

I don't think it's worth mentioning "include" or "usecols" because it's better to keep the description brief.

Current:
Column names to use. If the passed data do not have names associated with them, this argument provides names for the columns. Otherwise, this argument indicates the order of the columns in the result (any names not found in the data will become all-NA columns).

Propose 1:
Column names to use. If the passed data do not have names associated with them, this argument provides names for the columns. Otherwise, this argument indicates the order of the columns in the result (any names not found in the data will become all-NA columns) and limits the data to these columns if not all column names are provided.

Proposed 2:
The columns argument specifies the column names for the DataFrame. If the data does not have column names, this argument assigns them. If the data already includes column names, this argument determines the order of the columns and limits the DataFrame to include only the columns listed. Any columns not specified will be excluded.

Would this revision work, or do you think there's a better way to phrase it? I’d love to hear your thoughts. Thanks so much for your time!

I recommend leaving the first sentences alone, since they're not part of this issue, and also not limit the scope of the argument to DataFrames. If it's not working for other types, that's something that can be fixed rather than documenting the bug / limitation (see e.g. this issue that was just filed: #59717).

Here's a polite reply to that message:

Thank you for your feedback! I agree, keeping the first sentences unchanged makes sense, and addressing the broader scope beyond just DataFrames is the right approach. If this affects other types, fixing the issue rather than documenting a limitation would indeed be the best course of action. Thanks for pointing that out!

mroeschke · 2024-09-30T16:14:34Z

Looks like this issue has already been addressed. Thanks for the PR but closing

StaticAccess added 2 commits September 5, 2024 01:41

fixed issue#59670. DOC

9b4351c

fixed issue#59670.Adding information on doc

7e856e1

cjerdonek reviewed Sep 5, 2024

View reviewed changes

mroeschke closed this Sep 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixed issue#59670. DOC #59714

fixed issue#59670. DOC #59714

StaticAccess commented Sep 5, 2024

cjerdonek Sep 5, 2024

StaticAccess Sep 5, 2024

cjerdonek Sep 5, 2024

StaticAccess Sep 5, 2024

mroeschke commented Sep 30, 2024

fixed issue#59670. DOC #59714

fixed issue#59670. DOC #59714

Conversation

StaticAccess commented Sep 5, 2024

cjerdonek Sep 5, 2024

Choose a reason for hiding this comment

StaticAccess Sep 5, 2024

Choose a reason for hiding this comment

cjerdonek Sep 5, 2024

Choose a reason for hiding this comment

StaticAccess Sep 5, 2024

Choose a reason for hiding this comment

mroeschke commented Sep 30, 2024