-
Notifications
You must be signed in to change notification settings - Fork 159
Bug in NodeExpandVolume #433
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
/assign @davidz627 |
The test failing is: Looks like every test requests the resize to a size This looks like some sort of error either in |
I've splunked through the |
The test runs in OSS K8s. You can run the standard kubetest with focus on it. |
|
I thought the failure was on node side. Do we know why node requested such a large number? If you let the test run in a loop with a focus, I think we should be able to reproduce this bug. |
The failure is on the node side but I was investigating to see if maybe the context cancel was causing something weird to happen with the size that gets passed along to node calls (since it goes through updating the PVC fields). Couldn't see anything obviously wrong there so I'll do some runs tomorrow to see if I can repro |
Ok I can repro the issue consistently by setting
Here's a particularly interesting part (after context timing out a couple times):
Looks like the driver is responding with an incorrect size |
Another interesting observation is that the PD driver is expanding the disk on cloud to the correct size |
Ah. I got it. If we resized successfully (but timeout the context) then resize again, we fall into this if statement: gcp-compute-persistent-disk-csi-driver/pkg/gce-cloud-provider/compute/gce-compute.go Line 634 in dd14907
Which returns the request size in bytes, not gb |
Is this fixed. Looks to be failing still on - https://prow.k8s.io/view/gcs/kubernetes-jenkins/pr-logs/pull/88488/pull-kubernetes-e2e-gce-csi-serial/1234883797321781250/
|
I am tentatively reopening this. If it turns out I am wrong or missed something, we can close it. /reopen |
@gnufied: Reopened this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Released a new version of the driver v0.7.0 with the fix. It needs to be pulled in to k8s |
upstream specs in k/k e2e have been updated |
@msau42: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
It looks like we may have a bug in CSI
NodeExpandVolume
implementation:Failure logs - https://storage.googleapis.com/kubernetes-jenkins/logs/ci-kubernetes-e2e-gce-cos-k8sbeta-serial/1198911611914424322/build-log.txt
cc @msau42
The text was updated successfully, but these errors were encountered: