Skip to content

Commit aa646d3

Browse files
bjornjorgensendongjoon-hyun
authored andcommitted
[SPARK-45941][PS] Upgrade pandas to version 2.1.3
### What changes were proposed in this pull request? Upgrade pandas from 2.1.2 to 2.1.3 ### Why are the changes needed? Fixed infinite recursion from operations that return a new object on some DataFrame subclasses ([GH 55763](pandas-dev/pandas#55763)) and Fix [read_parquet()](https://pandas.pydata.org/docs/reference/api/pandas.read_parquet.html#pandas.read_parquet) and [read_feather()](https://pandas.pydata.org/docs/reference/api/pandas.read_feather.html#pandas.read_feather) for [CVE-2023-47248](https://www.cve.org/CVERecord?id=CVE-2023-47248) ([GH 55894](pandas-dev/pandas#55894)) [Release notes for 2.1.3](https://pandas.pydata.org/docs/whatsnew/v2.1.3.html) ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Pass GA ### Was this patch authored or co-authored using generative AI tooling? No. Closes #43822 from bjornjorgensen/pandas-2_1_3. Authored-by: Bjørn Jørgensen <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]>
1 parent 39fc610 commit aa646d3

File tree

2 files changed

+3
-3
lines changed

2 files changed

+3
-3
lines changed

dev/infra/Dockerfile

+2-2
Original file line numberDiff line numberDiff line change
@@ -84,8 +84,8 @@ RUN Rscript -e "devtools::install_version('roxygen2', version='7.2.0', repos='ht
8484
# See more in SPARK-39735
8585
ENV R_LIBS_SITE "/usr/local/lib/R/site-library:${R_LIBS_SITE}:/usr/lib/R/library"
8686

87-
RUN pypy3 -m pip install numpy 'pandas<=2.1.2' scipy coverage matplotlib
88-
RUN python3.9 -m pip install numpy pyarrow 'pandas<=2.1.2' scipy unittest-xml-reporting plotly>=4.8 'mlflow>=2.3.1' coverage matplotlib openpyxl 'memory-profiler==0.60.0' 'scikit-learn==1.1.*'
87+
RUN pypy3 -m pip install numpy 'pandas<=2.1.3' scipy coverage matplotlib
88+
RUN python3.9 -m pip install numpy pyarrow 'pandas<=2.1.3' scipy unittest-xml-reporting plotly>=4.8 'mlflow>=2.3.1' coverage matplotlib openpyxl 'memory-profiler==0.60.0' 'scikit-learn==1.1.*'
8989

9090
# Add Python deps for Spark Connect.
9191
RUN python3.9 -m pip install 'grpcio>=1.48,<1.57' 'grpcio-status>=1.48,<1.57' 'protobuf==3.20.3' 'googleapis-common-protos==1.56.4'

python/pyspark/pandas/supported_api_gen.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -98,7 +98,7 @@ def generate_supported_api(output_rst_file_path: str) -> None:
9898
9999
Write supported APIs documentation.
100100
"""
101-
pandas_latest_version = "2.1.2"
101+
pandas_latest_version = "2.1.3"
102102
if LooseVersion(pd.__version__) != LooseVersion(pandas_latest_version):
103103
msg = (
104104
"Warning: Latest version of pandas (%s) is required to generate the documentation; "

0 commit comments

Comments
 (0)