Modernize IO using only the API #164

bashtage · 2022-07-22T18:02:59Z

This is an attempt to modernize IO using only the API functions. Just a start

pandas-stubs/io/parsers/readers.pyi

twoertwein · 2022-07-22T21:29:53Z

pandas-stubs/io/parsers/readers.pyi

+    chunksize = ...  # Incomplete
+    nrows = ...  # Incomplete
+    squeeze = ...  # Incomplete
+    handles = ...  # Incomplete


I wouldn't mind removing those. Might be good to be in sync with @phofl's PR pandas-dev/pandas#46308

twoertwein · 2022-07-22T21:31:37Z

pandas-stubs/io/pickle.pyi

-    obj,
-    filepath_or_buffer: FilePathOrBuffer,
-    compression: str | None = ...,
+    obj: Any,


I think object might be better here than Any. I think the rule of thumb is use object if it can literally be any object and use Any when it is too complex to type.

Looking at typeshed they use Any both ways for dump and load.

https://github.com/python/typeshed/blob/d4287a7f08305d95d24d3e0487941ec9bb35f4ae/stdlib/pickle.pyi#L121-L128

Seems object can cause problems with type checking

https://stackoverflow.com/a/39817126/2551705

although this is probably more to do with loading than saving.

Dr-Irv

Can you confirm that read_csv() and read_table() are the same as what was previously in parsers.pyi ?

Make pickle consistent with upstream pandas Only include what is in the API

Fully add read table and supporting classes

twoertwein · 2022-08-10T20:01:28Z

@bashtage might be easier to break it into multiple PRs: easier to review and probably also easier for you (less rebasing since the individual PRs will be closed more quickly)

twoertwein reviewed Jul 22, 2022

View reviewed changes

pandas-stubs/io/parsers/readers.pyi Outdated Show resolved Hide resolved

twoertwein reviewed Jul 22, 2022

View reviewed changes

Dr-Irv reviewed Jul 23, 2022

View reviewed changes

bashtage and others added 4 commits July 25, 2022 00:20

ENH: Improve io/pickle

b4eadf5

Make pickle consistent with upstream pandas Only include what is in the API

ENH: Add read_table

2e25467

Fully add read table and supporting classes

ENH: Sync to/read gbp

65120b7

ENH: Add and update XML io interface

3f26825

bashtage force-pushed the io-api branch from 3ae5bfd to 3f26825 Compare July 25, 2022 00:08

bashtage added 5 commits July 25, 2022 10:07

ENH: Improve and clean stata io functions

79da23b

ENH: Improve and clean orc io functions

8a78a32

ENH: Improve and clean sql io functions

f60035f

CLN: Remove non-public class

d089b8b

ENH: Verify clipboard io functions

d1b3b1c

bashtage force-pushed the io-api branch from 9d9dc41 to d1b3b1c Compare July 25, 2022 12:53

bashtage added 2 commits July 26, 2022 08:40

ENH: Verify json functions

18cb482

ENH: Verify HDF functions

698190f

bashtage closed this Aug 22, 2022

bashtage deleted the io-api branch August 23, 2022 09:07

bashtage restored the io-api branch August 23, 2022 09:07

bashtage deleted the io-api branch September 1, 2022 06:32

DarioHett mentioned this pull request Dec 21, 2022

Series.to_json - Overload variants arguments raise error despite match #481

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modernize IO using only the API #164

Modernize IO using only the API #164

bashtage commented Jul 22, 2022 •

edited

Loading

twoertwein Jul 22, 2022

twoertwein Jul 22, 2022

bashtage Jul 24, 2022

bashtage Jul 24, 2022

Dr-Irv left a comment

twoertwein commented Aug 10, 2022

Modernize IO using only the API #164

Modernize IO using only the API #164

Conversation

bashtage commented Jul 22, 2022 • edited Loading

twoertwein Jul 22, 2022

Choose a reason for hiding this comment

twoertwein Jul 22, 2022

Choose a reason for hiding this comment

bashtage Jul 24, 2022

Choose a reason for hiding this comment

bashtage Jul 24, 2022

Choose a reason for hiding this comment

Dr-Irv left a comment

Choose a reason for hiding this comment

twoertwein commented Aug 10, 2022

bashtage commented Jul 22, 2022 •

edited

Loading