Skip to content

DOC: Add Use Modin section on Scaling to large datasets page #57585

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
1 task done
YarShev opened this issue Feb 23, 2024 · 0 comments · Fixed by #57586
Closed
1 task done

DOC: Add Use Modin section on Scaling to large datasets page #57585

YarShev opened this issue Feb 23, 2024 · 0 comments · Fixed by #57586
Labels

Comments

@YarShev
Copy link
Contributor

YarShev commented Feb 23, 2024

Pandas version checks

  • I have checked that the issue still exists on the latest versions of the docs on main here

Location of the documentation

https://pandas.pydata.org/docs/user_guide/scale.html

Documentation problem

While there are different libraries to scale large datasets, the only option shown on scale.rst page is Dask. It could be useful and valuable for users to get more options out of the page. One of the options could be Modin, which has a drop-in replacement API for pandas. It would be nice to show how Modin can speed up pandas operations, as well as to be used along with pandas in a single flow to get ultimate performance and reduce users's waiting time.

Suggested fix for documentation

Add a section about Modin.

@YarShev YarShev added Docs Needs Triage Issue that has not been reviewed by a pandas team member labels Feb 23, 2024
YarShev added a commit to YarShev/pandas that referenced this issue Feb 23, 2024
YarShev added a commit to YarShev/pandas that referenced this issue Feb 23, 2024
@rhshadrach rhshadrach removed the Needs Triage Issue that has not been reviewed by a pandas team member label Feb 25, 2024
mroeschke pushed a commit that referenced this issue Mar 7, 2024
…ge (#57586)

* DOC-#57585: Add `Use Modin` section on `Scaling to large datasets` page

Signed-off-by: Igoshev, Iaroslav <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Address comments

Signed-off-by: Igoshev, Iaroslav <[email protected]>

* Address comments

Signed-off-by: Igoshev, Iaroslav <[email protected]>

* Revert some changes

Signed-off-by: Igoshev, Iaroslav <[email protected]>

* Address comments

Signed-off-by: Igoshev, Iaroslav <[email protected]>

---------

Signed-off-by: Igoshev, Iaroslav <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Marc Garcia <[email protected]>
pmhatre1 pushed a commit to pmhatre1/pandas-pmhatre1 that referenced this issue May 7, 2024
…tasets` page (pandas-dev#57586)

* DOC-pandas-dev#57585: Add `Use Modin` section on `Scaling to large datasets` page

Signed-off-by: Igoshev, Iaroslav <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Address comments

Signed-off-by: Igoshev, Iaroslav <[email protected]>

* Address comments

Signed-off-by: Igoshev, Iaroslav <[email protected]>

* Revert some changes

Signed-off-by: Igoshev, Iaroslav <[email protected]>

* Address comments

Signed-off-by: Igoshev, Iaroslav <[email protected]>

---------

Signed-off-by: Igoshev, Iaroslav <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Marc Garcia <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants