Skip to content

DOC: Add Use Modin section on Scaling to large datasets page #57585

Closed
@YarShev

Description

@YarShev

Pandas version checks

  • I have checked that the issue still exists on the latest versions of the docs on main here

Location of the documentation

https://pandas.pydata.org/docs/user_guide/scale.html

Documentation problem

While there are different libraries to scale large datasets, the only option shown on scale.rst page is Dask. It could be useful and valuable for users to get more options out of the page. One of the options could be Modin, which has a drop-in replacement API for pandas. It would be nice to show how Modin can speed up pandas operations, as well as to be used along with pandas in a single flow to get ultimate performance and reduce users's waiting time.

Suggested fix for documentation

Add a section about Modin.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions