DOC: add Comparison with Excel #38554

afeld · 2020-12-18T05:22:13Z

closes Equivalent to the Excel fill handle #22993
tests added / passed
~~passes black pandas~~
~~passes git diff upstream/master -u -- "*.py" | flake8 --diff~~
~~whatsnew entry~~

Background

I teach a class on pandas for public policy students, and for many of them, spreadsheets are the only point of reference they have for working with tabular data. It would be very helpful to have (official) document comparing the two to point them to.

This is my first contribution to pandas and first time using reStructuredText, so feedback welcome. Thanks in advance!

TODOs

Making a running checklist to show what I've done already, and what else I plan to do. Hoping for some preliminary feedback (like is there still interest in having this page) before spending too much more time on it.

Happy to continue in this pull request until complete with all of them, or get this merged sooner than later and take care of the others in follow-up pull requests. Slight preference for the latter (some documentation being better than none, less to review at once, etc.), but open to whatever.

Questions

Which whatsnew file should I add to?
I noticed that doc/source/_static is in the .gitignore, but there are files checked into that folder. Is that intentional? - CLN: remove duplicate banklist.html file #38739
Some of the comparison documentation refers to "columns", while other refer to "Series". Is there a preference, or can they be used interchangeably?
Since spreadsheet software is largely interchangeable/compatible, would it make sense to make the page more general as "Comparison to spreadsheets"?
Thoughts about including slightly more subjective content, such as why one might want to use spreadsheets vs. pandas?

MarcoGorelli

Thanks @afeld - I built this locally and it generally looks good

To answer some questions:

Which whatsnew file should I add to?

It shouldn't be necessary to add one, that's usually for new features / bug and regression fixes

Since spreadsheet software is largely interchangeable/compatible, would it make sense to make the page more general as "Comparison to spreadsheets"?

I think so, perhaps it could be "Comparison with spreadsheets (e.g. Excel)"?

EDIT

I don't think this needs a whatsnew note

MarcoGorelli · 2020-12-18T11:33:42Z

doc/source/getting_started/comparison/comparison_boilerplate.rst

+If you're new to pandas, you might want to first read through :ref:`10 Minutes to pandas<10min>`
+to familiarize yourself with the library.
+
+As is customary, we import pandas and NumPy as follows:
+
+.. ipython:: python
+
+    import pandas as pd
+    import numpy as np


jreback · 2020-12-21T23:50:53Z

@afeld can you post a rendered picture here of the new docs page here

jreback

@afeld looks great. ideally if you can add the excel (and pandas) references. am happy to merge (and can keep adding things in later PRs).

jreback · 2020-12-21T23:52:09Z

doc/source/getting_started/comparison/comparison_with_excel.rst

+
+    ``DataFrame``, worksheet
+    ``Series``, column
+    ``Index``, row headings


should also indicate that the row labels themselves are akin to the default RangeIndex

Added a mention of RangeIndex below. That work?

jreback · 2020-12-21T23:53:16Z

doc/source/getting_started/comparison/comparison_with_excel.rst

+General terminology translation
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+.. csv-table::


it might be useful to include .png's here if it helps explain material

Sure, will put something together. Ok as a follow-up?

doc/source/getting_started/comparison/comparison_with_excel.rst

- Format Excel comparison code samples with [blacken-docs](https://github.com/asottile/blacken-docs) - Fix `SettingWithCopyWarning`s

- Mention apply() in documentation around deriving columns - Simplify code for doing column subtraction in Excel doc

afeld · 2020-12-26T01:11:48Z

Screenshot of the new page

Let me know what you think! Nudge on the questions up top. I know there are a lot of commits in here; let me know if you want me to squash.

More I want to do with the page, but hoping this is close to being merge-able.

afeld · 2020-12-26T01:13:22Z