ENH: Allow comment lines to be written to CSV outputs. Complements read_csv's comment param #53569
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
doc/source/whatsnew/vX.X.X.rst
file if fixing a bug or adding a new feature.There are many use cases (especially in the scientific community) where the best/only course of action is to enable to embedding of configuration parameters and/or other metadata into the beginning of a CSV file itself. These are typically prefaced with some comment-indication prefix such as
#
. This maintains human readability while attaching the metadata to the generated file itself.Pandas'
read_csv
method already implements a feature to read such files and ignore these lines when parsing the the data into a dataframe. This PR implements the complement of this feature. It allows users to write these metadata and/or comment lines in their CSV outputs as well.The
to_csv
method was extended to allow acomment
param matching that inread_csv
along with acomment_lines
param which takes an iterator of strings, each of which will be written to the beginning of the file as a single line, prior to and CSV header or body data being written.