-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Opening old workspaces stuck on pulling container image #8198
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@adg25 I think it would be helpful for our |
In classic bug ticket fashion now all of my workspaces are unaffected by this issue! But another member of my team is experiencing it, his workspace is https://belvederetradin-tribe-y684d5w18ri.ws-us31.gitpod.io/ Edit: 1:14pm it's currently happening to https://belvederetradin-tribe-tke67qfj000.ws-us31.gitpod.io/ |
I tried opening a workspace I was editing yesterday and I got this error message: Timed Out Workspace is https://belvederetradin-tribe-3rrw2k0grv6.ws-us32.gitpod.io/ |
@axonasif Any update on this? |
Might be one for @kylos101 - copying him in. |
Thanks! We may have some idea of what's going on here actually. We are using Bazel as the build system for our mono repo and modified the bazelrc so that
Which means that the bazel build cache lives in /workspace instead of the default /home directory. We did this because the cache would take up more than 5Gi of space and crash all of our workspaces, and a recommendation from the gitpod discord was to utilize the 30Gb we get from /workspace. However we think that when workspaces stop with a built cache the images are too large to be pulled again (and subsequently time-out. I've confirmed that if I execute a
|
Thanks for the heads up @adg25 ! How large of a |
@kylos101 Certainly the 12G there is a problem to re-pull the container image after it's stopped - and the 3.5G is good enough (less than a minute usually). I don't really have a metric for "at X size we start having these issues", but there's rarely an in-between for devs anyway since usually we either operate with a full bazel cache or an empty one. |
Thanks for the heads up, @adg25 ! I've scheduled this for our team to do some research. Have a nice weekend! Assuming we can recreate the problem, that should give us some better insight. While I'm thinking about it, if you're able, could you share a simple repo where you're able to reproduce the problem? It's okay to say no, just figured I would ask. |
Thanks @kylos101. Unfortunately I can't share the repo with you since it's private but I can share some relevant metrics. We currently have 165 bazel BUILD files that include java_* cc_* py_* and container_* (library/binary/image/push) modules and in total about 47k lines of code |
Thanks, @adg25 that should help us recreate a similarly sized |
I looked at logs and found interesting data point regarding
Internal link for logs: https://cloudlogging.app.goo.gl/nH3fexYGbsGBcAkj7 |
@sagor999 I don't have access to those logs, but that would make sense as a theory |
@aledbf when we start backup process, can you confirm if we ensure that all user processes in workspace have been stopped? I think this error above happens because some build process is still churning out files while we are trying to backup and cleanup. |
Bug description
Recently (last 2 weeks, maybe more) members of my team have not been able to open previously closed workspaces. Opening new pods with the gitlab browser extension works just fine, but re-opening pods gets stuck on the "Pulling container image" step. I don't think there's anything out of the ordinary with our setup, we have a .gitpod.yml which specifies pulling the latest image from our docker registry hosted in gitlab. Some members of the team don't have issues with this, some do, and some are on and off.
Steps to reproduce
Open a new gitpod workspace from a gitlab merge request, close the workspace and try to open it again. Gets hung on the "Pulling container image step"
Workspace affected
All workspaces
Expected behavior
I expect the workspace to pull the container image and start properly
Example repository
.gitpod.yml file content for image:
image: registry.gitlab.com/<my_company>/path/to/hosted_images:latest
Anything else?
No response
The text was updated successfully, but these errors were encountered: