Skip to content

ENH/CI: Cache packages installed from conda #42580

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
lithomas1 opened this issue Jul 17, 2021 · 5 comments
Closed

ENH/CI: Cache packages installed from conda #42580

lithomas1 opened this issue Jul 17, 2021 · 5 comments
Labels
CI Continuous Integration Enhancement good first issue Performance Memory or execution speed performance

Comments

@lithomas1
Copy link
Member

Currently, I think we are downloading all the packages every time CI is run. This takes a long time(~5-10 min. on azure) and makes CI flakier, since sometimes we can't connect to the Anaconda servers, and sometimes we timeout on Azure. We should cache the packages downloaded and only check for updates everytime the CI is run. I think we're doing some form of this on GHA, but we should also do it on azure.

cc @fangchenli

@lithomas1 lithomas1 added Enhancement Needs Triage Issue that has not been reviewed by a pandas team member labels Jul 17, 2021
@fangchenli
Copy link
Member

It shouldn't be hard.
https://docs.microsoft.com/en-us/azure/devops/pipelines/release/caching?view=azure-devops#pythonanaconda

I'll try cache for the windows build in #42581

@lithomas1 lithomas1 added CI Continuous Integration Performance Memory or execution speed performance and removed Needs Triage Issue that has not been reviewed by a pandas team member labels Jul 17, 2021
@alimcmaster1
Copy link
Member

It shouldn't be hard.
https://docs.microsoft.com/en-us/azure/devops/pipelines/release/caching?view=azure-devops#pythonanaconda

I'll try cache for the windows build in #42581

Do you still want to work on this? If not I think we can mark as good first issue/contrib welcome as instructions above look comprehensive :)

@fangchenli

@amrutha-mukkatira
Copy link

  1. Export yaml : conda env export > env.yaml
    and then if there is a section of pip: packages, try to replace them all with Conda versions (editing the YAML). If there isn't a pip section, then it might be the conda clean -p thing. Either way, recreate your environment using the YAML

2)conda env create -f env.yaml -n new_env
This should force redownloading packages missing from the cache, as well as switch to non-PyPI versions of packages, assuming you removed all the pip: packages.

I would expect conda-pack to then work on this rebuilt version of the environment.

@Sara-cos
Copy link

Is it still open?

@mroeschke
Copy link
Member

I think we do with now with provision-with-micromamba and don't use azure now so closing this out.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI Continuous Integration Enhancement good first issue Performance Memory or execution speed performance
Projects
None yet
Development

Successfully merging a pull request may close this issue.

6 participants