Skip to content

ENH: Distinguish widths argument in read_fwf #35057

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 633 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
633 commits
Select commit Hold shift + click to select a range
ecc5015
TYP/CLN: cleanup `_openpyxl.py`, add type annotation #36021 (#36022)
fangchenli Sep 1, 2020
2a624dc
CLN: _wrap_applied_output (#36053)
rhshadrach Sep 1, 2020
4f2251c
REF: implement Block._replace_list (#36020)
jbrockmendel Sep 1, 2020
bdd5d4c
BUG: PeriodIndex.get_loc incorrectly raising ValueError instead of Ke…
jbrockmendel Sep 1, 2020
67429d4
CI: Unpin MyPy (#36012)
simonjayhawkins Sep 1, 2020
79919db
ENH: vendor typing_extensions (#36000)
jbrockmendel Sep 1, 2020
0bc407a
Added numba as an argument (#35778)
erfannariman Sep 1, 2020
075ed8b
REF: handle axis=None case inside DataFrame.any/all to simplify _redu…
jbrockmendel Sep 2, 2020
2056900
BUG: BlockSlider not clearing index._cache (#35937)
jbrockmendel Sep 2, 2020
73c1d32
BUG: NDFrame.replace wrong exception type, wrong return when size==0 …
jbrockmendel Sep 2, 2020
1fc244f
Comma cleanup for #35925 (#36058)
JonathanShrek Sep 2, 2020
19c3d40
API: replace dropna=False option with na_sentinel=None in factorize (…
charlesdong1991 Sep 2, 2020
7d047c2
TYP: update setup.cfg (#36067)
simonjayhawkins Sep 2, 2020
09c6124
TYP: statically define attributes in plotting._matplotlib.core (#36068)
simonjayhawkins Sep 2, 2020
c41e500
BUG: frame._item_cache not cleared when Series is altered (#36051)
jbrockmendel Sep 2, 2020
abe6ad8
CLN remove trailing commas (#36057)
tiagohonorato Sep 2, 2020
a0d92b6
CLN remove unnecessary trailing commas in groupby tests (#36059)
xcz011 Sep 2, 2020
675a541
CLN: rename private functions used across modules (#36049)
jbrockmendel Sep 2, 2020
caca6e1
TYP: misc typing in core\indexes\base.py (#35991)
simonjayhawkins Sep 2, 2020
f635d17
DOC: Fix typo of `=!` to `!=` in docstring (#36077)
daytonb Sep 2, 2020
00483a1
cleared commas (#36073)
Sep 2, 2020
5227245
TYP: Postponed Evaluation of Annotations (PEP 563) (#36034)
simonjayhawkins Sep 2, 2020
76f74d5
BUG: Index.get_slice_bounds does not accept datetime.date or tz naive…
mroeschke Sep 2, 2020
81e3236
REF: use BlockManager.apply for cython_agg_blocks, apply_blockwise (#…
jbrockmendel Sep 3, 2020
76eb314
Optionally disallow duplicate labels (#28394)
TomAugspurger Sep 3, 2020
361166f
BUG/ENH: compression for google cloud storage in to_csv (#35681)
twoertwein Sep 3, 2020
309018c
CI: MyPy fixup (#36085)
simonjayhawkins Sep 3, 2020
f1d9497
Update contributing_docstring.rst (#36087)
albertvillanova Sep 3, 2020
a0c8168
DOC: minor fixes to whatsnew\v1.1.2.rst (#36086)
simonjayhawkins Sep 3, 2020
8b38239
CLN remove unnecessary trailing commas in pandas/io (#36052)
Sep 3, 2020
f3408ea
DOC: add mypy version to whatsnew\v1.2.0.rst (#36090)
simonjayhawkins Sep 3, 2020
497ede8
DOC: Add Notes about difference to numpy behaviour for ddof in std() …
timhunderwood Sep 3, 2020
b80dcbc
BUG: groupby and agg on read-only array gives ValueError: buffer sour…
jeet-parekh Sep 4, 2020
b53dc8f
CLN: use IS64 instead of is_platform_32bit #36108 (#36109)
fangchenli Sep 4, 2020
9ad6363
REGR: ensure closed attribute of IntervalIndex is preserved in pickle…
jorisvandenbossche Sep 4, 2020
b8e4a09
TYP: misc fixes for numpy types 2 (#36099)
simonjayhawkins Sep 4, 2020
1a21836
TYP: io (#36120)
jbrockmendel Sep 4, 2020
9dc1fd7
BUG: incorrect year returned in isocalendar for certain dates (#36050)
asishm Sep 4, 2020
7d73641
REF: use BlockManager.apply for DataFrameGroupBy.count (#35924)
jbrockmendel Sep 4, 2020
fadb72c
REGR: fix consolidation/cache issue with take operation (#36114)
jorisvandenbossche Sep 4, 2020
bc0e507
De-privatize (#36107)
jbrockmendel Sep 4, 2020
3ca6d8f
De-privatize functions in io.excel (#36104)
jbrockmendel Sep 4, 2020
6016b17
TYP: activate Check for missing error codes (#36088)
simonjayhawkins Sep 4, 2020
9820c42
CLN: remove xfails/skips for no-longer-supported numpys (#36128)
jbrockmendel Sep 4, 2020
de9c771
De-privatize (#36130)
jbrockmendel Sep 5, 2020
cab3e0e
TYP: misc fixes for numpy types (#36098)
simonjayhawkins Sep 5, 2020
6b2a860
Comma cleanup (#36082)
JonathanShrek Sep 5, 2020
b73489f
CLN: remove unused args/kwargs (#36129)
jbrockmendel Sep 5, 2020
d0de0c6
BUG: Fix DataFrame.groupby().apply() for NaN groups with dropna=False…
cwkwong Sep 5, 2020
c40bf02
Bug 29764 groupby loses index name sometimes (#36121)
phofl Sep 5, 2020
6f45304
STY: add code check for use of builtin filter function (#36089)
simonjayhawkins Sep 5, 2020
3967131
BUG: df.replace with numeric values and str to_replace (#36093)
jbrockmendel Sep 5, 2020
c43652e
CLN: resolve UserWarning in `pandas/plotting/_matplotlib/core.py` #35…
fangchenli Sep 5, 2020
9fea06c
add note about missing values to Categorical docstring (#36125)
arw2019 Sep 5, 2020
70c056b
CLN removing trailing commas (#36101)
Sep 5, 2020
c4f0c6f
Updated series documentation to close #35406 (#36139)
Sep 5, 2020
6fa1a45
BUG: repair 'style' kwd handling in DataFrame.plot (#21003) (#33821)
joooeey Sep 5, 2020
45cec12
BUG/ENH: to_pickle/read_pickle support compression for file ojects (#…
twoertwein Sep 5, 2020
16544c9
TYP: Check for use of Union[Series, DataFrame] instead of FrameOrSeri…
simonjayhawkins Sep 5, 2020
bdb6e26
TYP: remove string literals for type annotations in pandas\core\frame…
simonjayhawkins Sep 5, 2020
6e28008
STY+CI: check for private function access across modules (#36144)
jbrockmendel Sep 5, 2020
42289d0
CLN: unused case in compare_or_regex_search (#36143)
jbrockmendel Sep 5, 2020
cc54943
REF: window/test_dtypes.py with pytest idioms (#35918)
mroeschke Sep 5, 2020
8c3ad64
DOC: add userwarning doc about mpl #35684 (#36145)
fangchenli Sep 5, 2020
cf1aa9e
BUG: item_cache invalidation in get_numeric_data (#35882)
jbrockmendel Sep 5, 2020
ce6882f
Make MultiIndex.get_loc raise for unhashable type (#35914)
dsaxton Sep 5, 2020
cda2f54
ENH: Make explode work for sets (#35637)
dsaxton Sep 5, 2020
c688a0f
BUG: Don't raise when constructing Series from ordered set (#36054)
dsaxton Sep 5, 2020
337f3ca
REGR: append tz-aware DataFrame with tz-naive values (#36115)
jorisvandenbossche Sep 6, 2020
9228066
BUG: Respect errors="ignore" during extension astype (#35979)
dsaxton Sep 6, 2020
a7e39b8
De-privatize imported names (#36156)
jbrockmendel Sep 6, 2020
2b72203
REF: share more EA methods (#36154)
jbrockmendel Sep 6, 2020
9083281
CLN: Separate transform tests (#36146)
rhshadrach Sep 6, 2020
b431e9a
CLN: _wrap_applied_output (#36160)
rhshadrach Sep 6, 2020
ba552ec
BUG: allow missing values in Index when calling Index.sort_values (#3…
AlexKirko Sep 6, 2020
aca77f7
BUG: extra leading space in to_string when index=False (#36094)
onshek Sep 6, 2020
b9a9769
BUG: shows correct package name when import_optional_dependency is ca…
hs2361 Sep 7, 2020
773f64d
REF: simplify latex formatting (#35872)
ivanovmg Sep 7, 2020
2a5e3cc
Comma cleanup (#36168)
JonathanShrek Sep 7, 2020
b2fc5ac
TST: test_datetime64_factorize on 32bit (#36192)
simonjayhawkins Sep 7, 2020
dd5d945
TST: update test_series_factorize_na_sentinel_none for 32bit (#36191)
simonjayhawkins Sep 7, 2020
12d3453
DOC: move release note for #36155 (#36187)
simonjayhawkins Sep 7, 2020
7584e59
REF: use _validate_foo pattern in Categorical (#36181)
jbrockmendel Sep 7, 2020
971c60e
DTA/TDA/PA use self._data instead of self.asi8 for self._ndarray (#36…
jbrockmendel Sep 7, 2020
b37c9f8
TST verify groupby doesn't alter unit64s to floats #30859 (#36164)
Sep 7, 2020
ce0476f
Fix compressed multiindex for output of groupby.rolling (#36152)
phofl Sep 7, 2020
472e846
TST: DataFrame.replace: TypeError: Cannot compare types 'ndarray(dtyp…
phofl Sep 7, 2020
c1d7bbd
REF: collect methods by topic (#36173)
jbrockmendel Sep 7, 2020
405b5c5
REF: implement Categorical._validate_setitem_value (#36180)
jbrockmendel Sep 7, 2020
592126c
COMPAT: match numpy behavior for searchsorted on dt64/td64 (#36176)
jbrockmendel Sep 8, 2020
19f0a9f
pandas docs json_normalize example (#36194)
nzare Sep 8, 2020
fc75d9c
BUG: GroupbyRolling with an empty frame (#36208)
mroeschke Sep 8, 2020
d7fe22f
DOC: doc fix (#36205)
simonjayhawkins Sep 8, 2020
81394d3
DOC: release date for 1.1.2 (#36182)
simonjayhawkins Sep 8, 2020
6625b89
Fixed pandas.json_normalize doctests errors` (#36207)
ylin00 Sep 8, 2020
6d340a9
BUG: copying series into empty dataframe does not preserve dataframe …
Dr-Irv Sep 8, 2020
903ca22
CLN remove trailing commas (#36222)
tiagohonorato Sep 8, 2020
f4458f2
CLN: remove unused return value in _create_blocks (#36196)
jbrockmendel Sep 8, 2020
8c7efd1
Make to_numeric default to correct precision (#36149)
Dr-Irv Sep 8, 2020
490a999
REF: implement Categorical._box_func, make _box_func a method (#36206)
jbrockmendel Sep 8, 2020
d26f99a
STY: de-privatize names imported across modules (#36178)
jbrockmendel Sep 8, 2020
ab342c6
DOC: Start 1.1.3 (#36183)
simonjayhawkins Sep 8, 2020
7d16224
CLN: re-use invalid_comparison in Categorical comparisons (#36229)
jbrockmendel Sep 8, 2020
9e3b8df
CLN: w3 formatting (#36223)
attack68 Sep 8, 2020
b1d3897
REF: pass setitem to unbox_scalar to de-duplicate validation (#36234)
jbrockmendel Sep 9, 2020
b5a5268
STY: De-privatize imported names (#36235)
jbrockmendel Sep 9, 2020
44e933a
REF: simplify CSVFormatter (#36046)
ivanovmg Sep 9, 2020
faae2f0
CLN: remove unnecessary trailing commas on issues #35925 (#36193)
satrio-hw Sep 9, 2020
1003a5c
Fixed a broken JSON Table Schema link (#36246)
danchev Sep 9, 2020
6aa311d
DOC: Improve Index docstrings (#36239)
Nikhil1O1 Sep 9, 2020
03c7040
DOC: Rst Formatting, make sure continuation prompt are used. (#35317)
Carreau Sep 10, 2020
a09259b
BUG: DataFrame.any with axis=1 and bool_only=True (#36106)
jbrockmendel Sep 11, 2020
6c8b923
BUG: instantiation using a dict with a period scalar (#35966)
justinessert Sep 11, 2020
f8d5fba
REF: share more EA methods (#36209)
jbrockmendel Sep 11, 2020
49b342b
CLN: simplify Categorical comparisons (#36237)
jbrockmendel Sep 11, 2020
ddf2f05
DOC: Update groupby.rst (#36238)
Nikhil1O1 Sep 11, 2020
b2dda5a
ENH add na_action to DataFrame.applymap (#35704)
jnothman Sep 11, 2020
dc1c849
CI: xfail failing parquet test (#36272)
jbrockmendel Sep 11, 2020
da82aef
de-privatize (#36259)
jbrockmendel Sep 11, 2020
5450233
Update deprecation warnings, which were already removed (#36292)
phofl Sep 11, 2020
2067d7e
CLN: typo cleanups (#36276)
jbrockmendel Sep 12, 2020
15fd0e7
REF: de-duplicate _wrap_joined_index (#36282)
jbrockmendel Sep 12, 2020
21fe972
REF: de-duplicate sort_values (#36301)
jbrockmendel Sep 12, 2020
2da7c34
PERF: get_dtype_kinds (#36309)
jbrockmendel Sep 12, 2020
4dc5887
CLN: pandas/io/parsers.py (#36269)
ivanovmg Sep 12, 2020
06b3f5d
Resample fix dst transition (#36264)
Flix6x Sep 12, 2020
39c5e29
CLN: _wrap_applied_output (#36260)
rhshadrach Sep 12, 2020
7e0bf1c
REF: implement Categorical._validate_listlike (#36274)
jbrockmendel Sep 12, 2020
3a6aedc
CLN: simplify Categorical comparisons (#36250)
jbrockmendel Sep 12, 2020
65f78c7
searchsorted numpy compat for Period dtype (#36254)
jbrockmendel Sep 12, 2020
a9f8d3c
BUG: na parameter for str.startswith and str.endswith not propagating…
asishm Sep 12, 2020
cb58dbb
PERF: JoinUnit.is_na (#36312)
jbrockmendel Sep 12, 2020
6100425
PERF: creating string Series/Arrays from sequence with many strings (…
topper-123 Sep 12, 2020
c6e3af7
TST: add test case for sort_index on multiindexed Frame with sparse c…
ylin00 Sep 12, 2020
c104622
REF: use BlockManager.apply in csv code (#36150)
jbrockmendel Sep 12, 2020
4729d8f
STY/WIP: check for private imports/lookups (#36055)
jbrockmendel Sep 12, 2020
bed9656
DOC: Fix DataFrame.query contradiction on use of Python keywords as i…
ezebunandu Sep 12, 2020
ab5b38d
BUG/CLN: Decouple Series/DataFrame.transform (#35964)
rhshadrach Sep 12, 2020
b8f22ad
DEPR: Deprecate pandas/io/date_converters.py (#35741)
avinashpancham Sep 12, 2020
822dc6f
REGR: Series access with Index of tuples/frozenset (#36147)
rhshadrach Sep 12, 2020
229722e
ENH: consistently cast strings for DTA/TDA/PA.__setitem__ (#36261)
jbrockmendel Sep 12, 2020
e47d5eb
CI: install numpy from pip #36296 (#36323)
fangchenli Sep 13, 2020
5647251
REF: _convert_for_op -> _validate_fill_value (#36318)
jbrockmendel Sep 13, 2020
28aab65
REF: separate out helpers from iLoc._setitem_with_indexer (#36315)
jbrockmendel Sep 13, 2020
1478291
PERF: CategoricalDtype.__eq__ (#36280)
jbrockmendel Sep 13, 2020
23e28df
REF: de-duplicate _wrap_joined_index in MultiIndex (#36313)
jbrockmendel Sep 13, 2020
22374c3
BUG: Don't overflow with large int scalar (#36316)
dsaxton Sep 13, 2020
2d95908
PERF: constructing string Series (#36317)
topper-123 Sep 13, 2020
7a7b053
REF: de-duplicate get_indexer_non_unique (#36322)
jbrockmendel Sep 13, 2020
73cdfc4
REGR: Fix IntegerArray unary ops regression (#36303)
dsaxton Sep 13, 2020
70d0dd0
ENH: add set_td_classes method for CSS class addition to data cells (…
attack68 Sep 13, 2020
65074db
PERF: Allow groupby transform with numba engine to be fully paralleli…
mroeschke Sep 13, 2020
068d1b5
BUG: GH36212 DataFrame agg() raises error when DataFrame column name …
leonarduschen Sep 13, 2020
752cd42
BUG: Fixe unintentionally added suffix in DataFrame.apply/agg and Ser…
charlesdong1991 Sep 13, 2020
3007baf
CLN: remove CategoricalIndex._create_from_codes (#36342)
jbrockmendel Sep 13, 2020
fef1830
REF: move ShallowMixin to groupby.base (#36341)
jbrockmendel Sep 13, 2020
4859be9
DOC/CLN: remove versionadded/changed:: 0.23 (#36338)
simonjayhawkins Sep 13, 2020
aa6298f
ERR: Cartesian product error (#36335)
Sep 13, 2020
00353d5
Pd.series.map performance (#34948)
Rohith295 Sep 13, 2020
69ff179
BUG: Ensure read_spss accepts pathlib Paths (GH33666) (#36174)
drmrd Sep 13, 2020
b8448b2
BUG: iloc.__setitem__ with DataFrame value, multiple blocks, non-uniq…
jbrockmendel Sep 13, 2020
4c4db19
BUG: xticks unnecessarily rotated (#34334)
MarcoGorelli Sep 13, 2020
fc61aa9
[TST]: Groupy raised ValueError for ffill with duplicate column names…
phofl Sep 13, 2020
37b2694
CLN: replaced Appender with doc (#33633)
smartvinnetou Sep 13, 2020
5ad15f8
Ensure resource closure in all exceptional circumstances during const…
rxxg Sep 13, 2020
a3c4dc8
Change default of float_precision for read_csv and read_table to "hig…
Dr-Irv Sep 13, 2020
88bc2e4
BUG: read_excel for ods files raising UnboundLocalError in certain ca…
asishm Sep 13, 2020
e67220d
DOC: add type BinaryIO to path param #35505 (#35568)
alexhtn Sep 13, 2020
1f49b76
DOC: update DataFrame.to_feather docstring (#35408)
arw2019 Sep 13, 2020
1b2f1f4
Concatenating rows with Int64 datatype coerces to object
phofl Sep 13, 2020
8df0218
REF: use check_setitem_lengths in DTA.__setitem__ (#36339)
jbrockmendel Sep 14, 2020
fd20f7d
DOC: Added docstring for storage_options for read_csv GH36361 (#36364)
Sep 14, 2020
9189487
REF: _unbox_scalar, _unbox_listlike for Categorical (#36362)
jbrockmendel Sep 15, 2020
947c8f2
REF: _assert_can_do_op -> _validate_scalar (#36367)
jbrockmendel Sep 15, 2020
74c352c
REF: share code for __setitem__ (#36366)
jbrockmendel Sep 15, 2020
ce7b4c0
CI: Add stale PR action (#36336)
dsaxton Sep 15, 2020
3833dc5
BUG: add py39 compat check for ast.slice #32766 (#36080)
fangchenli Sep 15, 2020
6929e26
Move sort index to generic (#36177)
fangchenli Sep 15, 2020
078f88e
[BUG]: Implement Kahan summation for rolling().mean() to avoid numeri…
phofl Sep 15, 2020
8ea00fb
DOC: Example for natural sort using key argument (#36356)
erfannariman Sep 15, 2020
0bfede2
[TST]: Groupby raised error with duplicate column names (#36389)
phofl Sep 15, 2020
b53fa14
BUG: Fix MultiIndex column stacking with dupe names (#36371)
dsaxton Sep 15, 2020
51558a0
DOC: Add dataframe_sql to eco system page (#36370)
zbrookle Sep 15, 2020
0d4a1c1
CLN: Clean test_arithmetic.py (#36390)
dsaxton Sep 15, 2020
11d5fc9
BLD/CI: fix py39 ci #36296 (#36393)
fangchenli Sep 16, 2020
1b6879a
CLN: Numba internal routines (#36376)
mroeschke Sep 16, 2020
98e4a2b
DOC: move release note for #36175 (pt1) (#36378)
simonjayhawkins Sep 16, 2020
15285e7
DOC: move release note for #36175 (pt2) (#36379)
simonjayhawkins Sep 16, 2020
876f040
remove trailing commas for black update (#36399)
Sep 16, 2020
70d618c
BUG: Always cast to Categorical in lexsort_indexer (#36385)
dsaxton Sep 17, 2020
1a3a2c1
DEPR: DataFrame.lookup (#35224)
erfannariman Sep 17, 2020
b162caf
ENH/BUG: consistently cast strs to datetimelike for searchsorted (#36…
jbrockmendel Sep 17, 2020
3a15e47
Bump flake8 version in pre-commit-config.yaml (#36412)
dsaxton Sep 17, 2020
f36437f
REF: re-use validate_listlike for _convert_arr_indexer (#36415)
jbrockmendel Sep 17, 2020
6e3f698
REF: use BlockManager.to_native_types in formatting code (#36417)
jorisvandenbossche Sep 17, 2020
be6908e
REF: re-use _maybe_promote for _is_convertible_to_index_for_join (#36…
jbrockmendel Sep 17, 2020
cd26fe2
REF: _validate_foo pattern for IntervalArray (#36414)
jbrockmendel Sep 17, 2020
81d24c7
Update isort version in pre-commit config (#36428)
clbarnes Sep 17, 2020
5eb1add
CLN: Clean series/test_arithmetic.py (#36406)
dsaxton Sep 17, 2020
a76f701
BUG: Categorical.sort_values inplace breaking views (#36404)
jbrockmendel Sep 17, 2020
d4947a9
TYP: alias IndexLabel without Optional (#36401)
ivanovmg Sep 17, 2020
52c81a9
REF: implement putmask for CI/DTI/TDI/PI (#36400)
jbrockmendel Sep 17, 2020
28068da
REF: share __getitem__ for Categorical/PandasArray/DTA/TDA/PA (#36391)
jbrockmendel Sep 17, 2020
042515d
CLN: remove unnecessary _convert_index_indexer (#36394)
jbrockmendel Sep 17, 2020
b4d0ae5
PERF: StringArray construction (#36325)
topper-123 Sep 17, 2020
5481e6c
Fix typo in docstring 'handler' --> 'handle' (#36427)
y2kbugger Sep 17, 2020
6537ad8
ADMIN: Update stale PR action (#36382)
dsaxton Sep 17, 2020
a607bd7
Fix documentation for new float_precision on read_csv (#36358)
Dr-Irv Sep 17, 2020
970517e
BLD/CI fix arm64 build #36397 (#36403)
fangchenli Sep 17, 2020
234f5ac
BUG: FooIndex.insert casting datetimelike NAs incorrectly (#36374)
jbrockmendel Sep 17, 2020
d6678d1
REF: de-duplicate IntervalIndex compat code (#36372)
jbrockmendel Sep 17, 2020
87974c0
remove trailing comma (#36441)
lacrosse91 Sep 18, 2020
1b895ef
DOC: read_excel skiprows documentation matches read_csv (#36435) (#36…
ahgamut Sep 18, 2020
46ad4b7
CLN: 35925 rm trailing commas (#36446)
lacrosse91 Sep 18, 2020
1f89b64
REF: collect IntervalArray methods by topic (#36438)
jbrockmendel Sep 18, 2020
a0e0571
REF: share insert between DTI/TDI/PI (#36439)
jbrockmendel Sep 18, 2020
a44a704
CLN Upgrade pandas/core syntax (#36453)
MarcoGorelli Sep 18, 2020
ec93a02
CI: fix gbq test #36436 (#36443)
fangchenli Sep 18, 2020
dea9ff3
REGR: Series[numeric] comparison with str raising on numexpr path (#3…
jbrockmendel Sep 18, 2020
b03179c
[DOC]: Add warning about rolling sums with large values (#36433)
phofl Sep 18, 2020
d2e958b
CI: Auto-label PRs for review (#36349)
dsaxton Sep 18, 2020
56eb167
REF: _is_compatible_with_other -> _can_union_without_object_cast (#36…
jbrockmendel Sep 18, 2020
4caafd4
CLN: Update files (as per #36450) to Python 3.7+ syntax (#36457)
ahgamut Sep 18, 2020
51b0165
CI: Revert PR template (#36460)
dsaxton Sep 18, 2020
20ee6d2
BUG: Concat typing (#36409)
rhshadrach Sep 19, 2020
353ce7e
REF: MultiIndex._validate_insert_value, IntervaArray._validate_setite…
jbrockmendel Sep 19, 2020
3e1cc56
DEPR: Index.to_native_types (#36418)
jorisvandenbossche Sep 19, 2020
aed64e8
BUG: fix isin with nans and large arrays (#36266)
Hanspagh Sep 19, 2020
5b04079
Remove unnecessary trailing commas (#36463)
sm1899 Sep 19, 2020
ff11c05
PERF: styler uuid control and security (#36345)
attack68 Sep 19, 2020
80f0a74
Align cython and python reduction code paths (#36459)
jbrockmendel Sep 19, 2020
b3e2c6c
Turn on stale GitHub action (#36476)
dsaxton Sep 19, 2020
605efc6
PERF: construct DataFrame with string array and dtype=str (#36432)
topper-123 Sep 19, 2020
51ffcdb
ASV: added benchamark tests for DataFrame.to_numpy() and .values (#36…
hardikpnsp Sep 19, 2020
54f23e8
BUG: get_indexer returned dtype (#36431)
alexhlim Sep 19, 2020
4236c86
Use https for network checks (#36480)
alimcmaster1 Sep 19, 2020
a90d559
BUG: Python Parser skipping over items if BOM present in first elemen…
asishm Sep 19, 2020
c1484b1
PERF: pd.to_datetime, unit='s' much slower for float64 than for int64…
arw2019 Sep 19, 2020
a22cf43
BUG: Enable Series.equals to compare numpy arrays to scalars (#36161)
avinashpancham Sep 19, 2020
00a510b
[BUG]: Rolling.sum() calculated wrong values when axis is one and dty…
phofl Sep 19, 2020
475a980
BUG: fix duplicate entries in LaTeX List of Tables when using longtab…
jeschwar Sep 19, 2020
2705dd6
REF: pandas/io/formats/format.py (#36434)
ivanovmg Sep 19, 2020
c33c3c0
TST: #31922 assert no segmentation fault with numpy.array.__contains_…
ylin00 Sep 19, 2020
d38dc06
REF: de-duplicate IntervalArray._validate_foo (#36483)
jbrockmendel Sep 19, 2020
7e13b19
TYP: core.missing; PERF for needs_i8_conversion (#36485)
jbrockmendel Sep 20, 2020
15539fa
Don't unlabel stale PR on update (#36487)
dsaxton Sep 20, 2020
a0d6d06
BUG: Fix astype from float32 to string (#36464)
dsaxton Sep 21, 2020
5a75b1d
CI: troubleshoot segfault (#36511)
jbrockmendel Sep 21, 2020
d39a30d
merge conflicts
erfannariman Sep 21, 2020
01d778a
merge conflicts
erfannariman Sep 21, 2020
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
4 changes: 3 additions & 1 deletion .github/workflows/ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,9 @@ on:
push:
branches: master
pull_request:
branches: master
branches:
- master
- 1.1.x

env:
ENV_FILE: environment.yml
Expand Down
21 changes: 21 additions & 0 deletions .github/workflows/stale-pr.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
name: "Stale PRs"
on:
schedule:
# * is a special character in YAML so you have to quote this string
- cron: "0 */6 * * *"

jobs:
stale:
runs-on: ubuntu-latest
steps:
- uses: actions/stale@v3
with:
repo-token: ${{ secrets.GITHUB_TOKEN }}
stale-pr-message: "This pull request is stale because it has been open for thirty days with no activity."
skip-stale-pr-message: true
stale-pr-label: "Stale"
exempt-pr-labels: "Needs Review,Blocked,Needs Discussion"
days-before-stale: 30
days-before-close: -1
remove-stale-when-updated: false
debug-only: false
16 changes: 2 additions & 14 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ repos:
- id: black
language_version: python3
- repo: https://gitlab.com/pycqa/flake8
rev: 3.7.7
rev: 3.8.3
hooks:
- id: flake8
language: python_venv
Expand All @@ -25,20 +25,8 @@ repos:
- file
args: [--append-config=flake8/cython-template.cfg]
- repo: https://github.com/pre-commit/mirrors-isort
rev: v4.3.21
rev: v5.2.2
hooks:
- id: isort
language: python_venv
exclude: ^pandas/__init__\.py$|^pandas/core/api\.py$
- repo: https://github.com/pre-commit/mirrors-mypy
rev: v0.730
hooks:
- id: mypy
args:
# As long as a some files are excluded from check-untyped-defs
# we have to exclude it from the pre-commit hook as the configuration
# is based on modules but the hook runs on files.
- --no-check-untyped-defs
- --follow-imports
- skip
files: pandas/
19 changes: 3 additions & 16 deletions .travis.yml
Original file line number Diff line number Diff line change
Expand Up @@ -42,10 +42,10 @@ matrix:

- arch: arm64
env:
- JOB="3.7, arm64" PYTEST_WORKERS=8 ENV_FILE="ci/deps/travis-37-arm64.yaml" PATTERN="(not slow and not network and not clipboard)"
- JOB="3.7, arm64" PYTEST_WORKERS=8 ENV_FILE="ci/deps/travis-37-arm64.yaml" PATTERN="(not slow and not network and not clipboard and not arm_slow)"

- env:
- JOB="3.6, locale" ENV_FILE="ci/deps/travis-36-locale.yaml" PATTERN="((not slow and not network and not clipboard) or (single and db))" LOCALE_OVERRIDE="zh_CN.UTF-8" SQL="1"
- JOB="3.7, locale" ENV_FILE="ci/deps/travis-37-locale.yaml" PATTERN="((not slow and not network and not clipboard) or (single and db))" LOCALE_OVERRIDE="zh_CN.UTF-8" SQL="1"
services:
- mysql
- postgresql
Expand All @@ -54,24 +54,11 @@ matrix:
# Enabling Deprecations when running tests
# PANDAS_TESTING_MODE="deprecate" causes DeprecationWarning messages to be displayed in the logs
# See pandas/_testing.py for more details.
- JOB="3.6, coverage" ENV_FILE="ci/deps/travis-36-cov.yaml" PATTERN="((not slow and not network and not clipboard) or (single and db))" PANDAS_TESTING_MODE="deprecate" COVERAGE=true SQL="1"
- JOB="3.7, coverage" ENV_FILE="ci/deps/travis-37-cov.yaml" PATTERN="((not slow and not network and not clipboard) or (single and db))" PANDAS_TESTING_MODE="deprecate" COVERAGE=true SQL="1"
services:
- mysql
- postgresql

- env:
- JOB="3.6, slow" ENV_FILE="ci/deps/travis-36-slow.yaml" PATTERN="slow" SQL="1"
services:
- mysql
- postgresql
allow_failures:
- arch: arm64
env:
- JOB="3.7, arm64" PYTEST_WORKERS=8 ENV_FILE="ci/deps/travis-37-arm64.yaml" PATTERN="(not slow and not network and not clipboard)"
- dist: bionic
env:
- JOB="3.9-dev" PATTERN="(not slow and not network and not clipboard)"


before_install:
- echo "before_install"
Expand Down
4 changes: 4 additions & 0 deletions LICENSES/XARRAY_LICENSE
Original file line number Diff line number Diff line change
@@ -1,3 +1,7 @@
Copyright 2014-2019, xarray Developers

--------------------------------------------------------------------------------

Apache License
Version 2.0, January 2004
http://www.apache.org/licenses/
Expand Down
13 changes: 13 additions & 0 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -25,3 +25,16 @@ doc:
cd doc; \
python make.py clean; \
python make.py html

check:
python3 scripts/validate_unwanted_patterns.py \
--validation-type="private_function_across_module" \
--included-file-extensions="py" \
--excluded-file-paths=pandas/tests,asv_bench/,pandas/_vendored \
pandas/

python3 scripts/validate_unwanted_patterns.py \
--validation-type="private_import_across_module" \
--included-file-extensions="py" \
--excluded-file-paths=pandas/tests,asv_bench/,pandas/_vendored,doc/
pandas/
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -32,7 +32,7 @@ its way towards this goal.
Here are just a few of the things that pandas does well:

- Easy handling of [**missing data**][missing-data] (represented as
`NaN`) in floating point as well as non-floating point data
`NaN`, `NA`, or `NaT`) in floating point as well as non-floating point data
- Size mutability: columns can be [**inserted and
deleted**][insertion-deletion] from DataFrame and higher dimensional
objects
Expand Down
5 changes: 3 additions & 2 deletions asv_bench/asv.conf.json
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,7 @@
// The Pythons you'd like to test against. If not provided, defaults
// to the current version of Python used to run `asv`.
// "pythons": ["2.7", "3.4"],
"pythons": ["3.6"],
"pythons": ["3.8"],

// The matrix of dependencies to test. Each key is the name of a
// package (in PyPI) and the values are version numbers. An empty
Expand All @@ -39,7 +39,7 @@
// followed by the pip installed packages).
"matrix": {
"numpy": [],
"Cython": ["0.29.16"],
"Cython": ["0.29.21"],
"matplotlib": [],
"sqlalchemy": [],
"scipy": [],
Expand All @@ -53,6 +53,7 @@
"xlwt": [],
"odfpy": [],
"pytest": [],
"jinja2": [],
// If using Windows with python 2.7 and want to build using the
// mingw toolchain (rather than MSVC), uncomment the following line.
// "libpython": [],
Expand Down
2 changes: 1 addition & 1 deletion asv_bench/benchmarks/frame_ctor.py
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@
from .pandas_vb_common import tm

try:
from pandas.tseries.offsets import Nano, Hour
from pandas.tseries.offsets import Hour, Nano
except ImportError:
# For compatibility with older versions
from pandas.core.datetools import * # noqa
Expand Down
40 changes: 40 additions & 0 deletions asv_bench/benchmarks/frame_methods.py
Original file line number Diff line number Diff line change
Expand Up @@ -219,6 +219,46 @@ def time_to_html_mixed(self):
self.df2.to_html()


class ToNumpy:
def setup(self):
N = 10000
M = 10
self.df_tall = DataFrame(np.random.randn(N, M))
self.df_wide = DataFrame(np.random.randn(M, N))
self.df_mixed_tall = self.df_tall.copy()
self.df_mixed_tall["foo"] = "bar"
self.df_mixed_tall[0] = period_range("2000", periods=N)
self.df_mixed_tall[1] = range(N)
self.df_mixed_wide = self.df_wide.copy()
self.df_mixed_wide["foo"] = "bar"
self.df_mixed_wide[0] = period_range("2000", periods=M)
self.df_mixed_wide[1] = range(M)

def time_to_numpy_tall(self):
self.df_tall.to_numpy()

def time_to_numpy_wide(self):
self.df_wide.to_numpy()

def time_to_numpy_mixed_tall(self):
self.df_mixed_tall.to_numpy()

def time_to_numpy_mixed_wide(self):
self.df_mixed_wide.to_numpy()

def time_values_tall(self):
self.df_tall.values

def time_values_wide(self):
self.df_wide.values

def time_values_mixed_tall(self):
self.df_mixed_tall.values

def time_values_mixed_wide(self):
self.df_mixed_wide.values


class Repr:
def setup(self):
nrows = 10000
Expand Down
8 changes: 4 additions & 4 deletions asv_bench/benchmarks/gil.py
Original file line number Diff line number Diff line change
Expand Up @@ -7,14 +7,14 @@

try:
from pandas import (
rolling_median,
rolling_kurt,
rolling_max,
rolling_mean,
rolling_median,
rolling_min,
rolling_max,
rolling_var,
rolling_skew,
rolling_kurt,
rolling_std,
rolling_var,
)

have_rolling_methods = True
Expand Down
46 changes: 32 additions & 14 deletions asv_bench/benchmarks/groupby.py
Original file line number Diff line number Diff line change
Expand Up @@ -627,49 +627,63 @@ def time_first(self):


class TransformEngine:
def setup(self):

param_names = ["parallel"]
params = [[True, False]]

def setup(self, parallel):
N = 10 ** 3
data = DataFrame(
{0: [str(i) for i in range(100)] * N, 1: list(range(100)) * N},
columns=[0, 1],
)
self.parallel = parallel
self.grouper = data.groupby(0)

def time_series_numba(self):
def time_series_numba(self, parallel):
def function(values, index):
return values * 5

self.grouper[1].transform(function, engine="numba")
self.grouper[1].transform(
function, engine="numba", engine_kwargs={"parallel": self.parallel}
)

def time_series_cython(self):
def time_series_cython(self, parallel):
def function(values):
return values * 5

self.grouper[1].transform(function, engine="cython")

def time_dataframe_numba(self):
def time_dataframe_numba(self, parallel):
def function(values, index):
return values * 5

self.grouper.transform(function, engine="numba")
self.grouper.transform(
function, engine="numba", engine_kwargs={"parallel": self.parallel}
)

def time_dataframe_cython(self):
def time_dataframe_cython(self, parallel):
def function(values):
return values * 5

self.grouper.transform(function, engine="cython")


class AggEngine:
def setup(self):

param_names = ["parallel"]
params = [[True, False]]

def setup(self, parallel):
N = 10 ** 3
data = DataFrame(
{0: [str(i) for i in range(100)] * N, 1: list(range(100)) * N},
columns=[0, 1],
)
self.parallel = parallel
self.grouper = data.groupby(0)

def time_series_numba(self):
def time_series_numba(self, parallel):
def function(values, index):
total = 0
for i, value in enumerate(values):
Expand All @@ -679,9 +693,11 @@ def function(values, index):
total += value * 2
return total

self.grouper[1].agg(function, engine="numba")
self.grouper[1].agg(
function, engine="numba", engine_kwargs={"parallel": self.parallel}
)

def time_series_cython(self):
def time_series_cython(self, parallel):
def function(values):
total = 0
for i, value in enumerate(values):
Expand All @@ -693,7 +709,7 @@ def function(values):

self.grouper[1].agg(function, engine="cython")

def time_dataframe_numba(self):
def time_dataframe_numba(self, parallel):
def function(values, index):
total = 0
for i, value in enumerate(values):
Expand All @@ -703,9 +719,11 @@ def function(values, index):
total += value * 2
return total

self.grouper.agg(function, engine="numba")
self.grouper.agg(
function, engine="numba", engine_kwargs={"parallel": self.parallel}
)

def time_dataframe_cython(self):
def time_dataframe_cython(self, parallel):
def function(values):
total = 0
for i, value in enumerate(values):
Expand Down
Loading