DOC: Fix examples in documentation #31472

ShaharNaveh · 2020-01-30T20:36:31Z

pep8speaks · 2020-01-30T20:36:39Z

Hello @MomIsBestFriend! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2020-02-27 17:35:24 UTC

ShaharNaveh · 2020-01-30T20:54:19Z

pandas/core/generic.py

-                     "pandas_version": "0.20.0"},
-          "data": [{"index": "row 1", "col 1": "a", "col 2": "b"},
-                   {"index": "row 2", "col 1": "c", "col 2": "d"}]}'
+        '{"schema":{"fields":[{"name":"index","type":"string"},{"name":"col 1","type":"string"},{"name":"col 2","type":"string"}],"primaryKey":["index"],"pandas_version":"0.20.0"},"data":[{"index":"row 1","col 1":"a","col 2":"b"},{"index":"row 2","col 1":"c","col 2":"d"}]}'


Because of lines like these (and basically every other output line, that's received by df.to_json()), I think it's a good idea that we include a "pprint" example under each one, so it will look somewhat like this:

def example(): """ Examples -------- Encoding with table schema >>> df = pd.DataFrame( ... [["a", "b"], ["c", "d"]], ... index=["row 1", "row 2"], ... columns=["col 1", "col 2"], ... ) >>> df.to_json(orient='table') '{"schema":{"fields":[{"name":"index","type":"string"},{"name":"col 1","type":"string"},{"name":"col 2","type":"string"}],"primaryKey":["index"],"pandas_version":"0.20.0"},"data":[{"index":"row 1","col 1":"a","col 2":"b"},{"index":"row 2","col 1":"c","col 2":"d"}]}' Pretty print version: >>> import json >>> result = df.to_json(orient="table") >>> parsed = json.loads(result) >>> json.dumps(parsed, indent=4) { "schema": { "fields": [ { "name": "index", "type": "string" }, { "name": "col 1", "type": "string" }, { "name": "col 2", "type": "string" } ], "primaryKey": [ "index" ], "pandas_version": "0.20.0" }, "data": [ { "index": "row 1", "col 1": "a", "col 2": "b" }, { "index": "row 2", "col 1": "c", "col 2": "d" } ] } """

alimcmaster1 · 2020-02-02T15:03:06Z

Flake8 errors in CI:

##[error]./pandas/core/generic.py:2217:89:E501:line too long (94 > 88 characters)

jreback

also pls merge master

jreback · 2020-02-09T20:56:42Z

pandas/core/generic.py

-          "index":["row 1","row 2"],
-          "data":[["a","b"],["c","d"]]}'
+        '{"columns":["col 1","col 2"],\
+"index":["row 1","row 2"],\


can you use ... instead here?

jreback · 2020-02-09T20:57:16Z

pandas/core/generic.py

-          "data": [{"index": "row 1", "col 1": "a", "col 2": "b"},
-                   {"index": "row 2", "col 1": "c", "col 2": "d"}]}'
+        '{"schema":{"fields":[{"name":"index","type":"string"},\
+{"name":"col 1","type":"string"},{"name":"col 2","type":"string"}],\


can you use ... here? (else could do a json prettify, e.g. json.dump(...., indent=4)

jreback · 2020-02-09T20:57:42Z

pandas/core/generic.py

-        freq                        2
-        first     2000-01-01 00:00:00
-        last      2010-01-01 00:00:00
+        count                      3


are these on the doctest list that we check?

Since 4d66fa8 they are.

alimcmaster1 · 2020-02-14T18:24:11Z

pandas/core/generic.py

@@ -9589,16 +9685,16 @@ def describe(

        Excluding numeric columns from a ``DataFrame`` description.

-        >>> df.describe(exclude=[np.number])
+        >>> df.describe(exclude=[np.number]) # doctest: +SKIP


How come you are adding doctest skips opposed to our current pytest -k approach? Think we should be consistent.

The problem with describe, is that the output can be random, If you know how to skip a specific line in the output, it would be great!

ShaharNaveh · 2020-02-22T10:40:46Z

Restarting azure

datapythonista · 2020-02-26T20:22:32Z

A bit unsure about this, but I think it's reasonable. At least we test all what we can test.

You have conflicts to fix. Also, for inline comments (with #), please leave two spaces instead of one before the hash. That's PEP-8, not sure why the PEP-8 validation is not complaining, I guess we don't have it active in the CI because there are failing cases.

Also, why describe output is not deterministic? I think it should.

REF: pandas-dev#31472 (comment)

datapythonista · 2020-02-27T18:59:44Z

Thanks for the fixes @MomIsBestFriend. About the describe, why do we need to skip the tests? Isn't the output deterministic?

@jreback, if you want to have another look and see if your comments were addressed...

ShaharNaveh · 2020-02-27T19:12:39Z

Thanks for the fixes @MomIsBestFriend. About the describe, why do we need to skip the tests? Isn't the output deterministic?

I haven't got it to work without the skip maybe one of the core developers knows something?

datapythonista · 2020-02-27T19:14:30Z

I haven't got it to work without the skip maybe one of the core developers knows something?

Do you know what was the error?

datapythonista · 2020-03-07T20:11:44Z

Thanks for fixing those @MomIsBestFriend.

Do you mind opening an issue for the describe? I think the output should be deterministic and we shouldn't need the SKIP there. Would be good to have a look and know what's wrong.

DOC: Fix examples in documentation

d6072be

ShaharNaveh commented Jan 30, 2020

View reviewed changes

ShaharNaveh added the Docs label Jan 30, 2020

MomIsBestFriend added 3 commits February 1, 2020 12:28

Fix merge conflicts

65d29cd

Merge remote-tracking branch 'upstream/master' into DOC-core-py-1

a62da8e

Fixed pandas/core/base.py

9e5af03

MomIsBestFriend added 2 commits February 7, 2020 12:31

Merge remote-tracking branch 'upstream/master' into DOC-core-py-1

4b90c82

Fixed lint issues

316f850

jreback requested changes Feb 9, 2020

View reviewed changes

MomIsBestFriend added 4 commits February 14, 2020 12:19

Fixed merge conflicts

a81897d

Reverted change from fixing merge conflicts

144caa8

to_json examples are now pretty printed

f3dd043

Added checks to the CI

4d66fa8

alimcmaster1 reviewed Feb 14, 2020

View reviewed changes

MomIsBestFriend added 3 commits February 15, 2020 13:29

Merge remote-tracking branch 'upstream/master' into DOC-core-py-1

a04c848

Skipping "clipboard" examples as there's no clipborad in the CI

4c304a6

Merge remote-tracking branch 'upstream/master' into DOC-core-py-1

52ecbef

ShaharNaveh closed this Feb 22, 2020

ShaharNaveh reopened this Feb 22, 2020

ShaharNaveh and others added 4 commits February 27, 2020 19:21

Merge branch 'master' into DOC-core-py-1

9856dcc

Reverted the wrong merge error

d489b45

Added extra space to the doctest skip comment

44ca5d5

REF: pandas-dev#31472 (comment)

Removed a single space

94ec83d

datapythonista merged commit 6852012 into pandas-dev:master Mar 7, 2020

ShaharNaveh mentioned this pull request Mar 8, 2020

DataFrame.describe() output is not deterministic #32528

Open

simonjayhawkins added this to the 1.1 milestone Mar 9, 2020

ShaharNaveh mentioned this pull request Mar 13, 2020

Improve docs for pandas_version in Dataframe to_json(orient='table') #26637

Closed

ShaharNaveh deleted the DOC-core-py-1 branch March 14, 2020 14:05

SeeminSyed pushed a commit to CSCD01-team01/pandas that referenced this pull request Mar 22, 2020

DOC: Fix examples in documentation (pandas-dev#31472)

1586d41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC: Fix examples in documentation #31472

DOC: Fix examples in documentation #31472

ShaharNaveh commented Jan 30, 2020 •

edited

Loading

pep8speaks commented Jan 30, 2020 •

edited

Loading

ShaharNaveh Jan 30, 2020

alimcmaster1 commented Feb 2, 2020

jreback left a comment

jreback Feb 9, 2020

jreback Feb 9, 2020

jreback Feb 9, 2020

ShaharNaveh Feb 14, 2020

alimcmaster1 Feb 14, 2020

ShaharNaveh Feb 14, 2020

ShaharNaveh commented Feb 22, 2020

datapythonista commented Feb 26, 2020

datapythonista commented Feb 27, 2020

ShaharNaveh commented Feb 27, 2020

datapythonista commented Feb 27, 2020

datapythonista commented Mar 7, 2020

DOC: Fix examples in documentation #31472

DOC: Fix examples in documentation #31472

Conversation

ShaharNaveh commented Jan 30, 2020 • edited Loading

pep8speaks commented Jan 30, 2020 • edited Loading

Comment last updated at 2020-02-27 17:35:24 UTC

ShaharNaveh Jan 30, 2020

Choose a reason for hiding this comment

alimcmaster1 commented Feb 2, 2020

jreback left a comment

Choose a reason for hiding this comment

jreback Feb 9, 2020

Choose a reason for hiding this comment

jreback Feb 9, 2020

Choose a reason for hiding this comment

jreback Feb 9, 2020

Choose a reason for hiding this comment

ShaharNaveh Feb 14, 2020

Choose a reason for hiding this comment

alimcmaster1 Feb 14, 2020

Choose a reason for hiding this comment

ShaharNaveh Feb 14, 2020

Choose a reason for hiding this comment

ShaharNaveh commented Feb 22, 2020

datapythonista commented Feb 26, 2020

datapythonista commented Feb 27, 2020

ShaharNaveh commented Feb 27, 2020

datapythonista commented Feb 27, 2020

datapythonista commented Mar 7, 2020

ShaharNaveh commented Jan 30, 2020 •

edited

Loading

pep8speaks commented Jan 30, 2020 •

edited

Loading