`docker-rootful`: Increase inotify limits by default #1179

carlosonunez-vmw · 2022-11-18T19:20:30Z

This resolves #1178.

AkihiroSuda · 2022-11-20T01:03:27Z

Thanks, but please sign the commit for DCO
https://github.com/apps/dco

(run git commit -a -s --amend, and make sure that the Signed-off-by: NAME <EMAIL> line with your real name is included in the commit message)

examples/docker-rootful.yaml

AkihiroSuda · 2022-11-20T01:04:44Z

examples/docker-rootful.yaml

+    # from crash looping.
+    echo 'fs.inotify.max_user_watches = 524288' >> /etc/sysctl.conf
+    echo 'fs.inotify.max_user_instances = 512' >> /etc/sysctl.conf
+    sysctl --system


Can we replicate this to docker.yaml, podman*.yaml, k8s.yaml, k3s.yaml too?

carlosonunez-vmw · 2022-11-20T06:18:01Z

sure; i'll try to make these changes some time between tomorrow and friday-- CarlosOn Nov 19, 2022, at 19:04, Akihiro Suda ***@***.***> wrote: @AkihiroSuda commented on this pull request. In examples/docker-rootful.yaml:

@@ -54,6 +54,14 @@ provision:

fi export DEBIAN_FRONTEND=noninteractive curl -fsSL https://get.docker.com | sh +- mode: system + script: | + #!/bin/bash + # Increase inotify limits to prevent nested Kubernetes control planes + # from crash looping. + echo 'fs.inotify.max_user_watches = 524288' >> /etc/sysctl.conf + echo 'fs.inotify.max_user_instances = 512' >> /etc/sysctl.conf + sysctl --system Can we replicate this to docker.yaml, podman*.yaml, k8s.yaml, k3s.yaml too? —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you authored the thread.Message ID: ***@***.***>

afbjorklund · 2022-11-20T16:59:49Z

Seems needlessly complicated for the k3s and k8s examples, since they would have VMs as nodes (not containers) ?

If I understand correctly, it is only for running containerd-in-docker or containerd-in-podman - as part of "kind"

AkihiroSuda · 2022-11-25T02:04:15Z

Please sign off the commit for DCO: https://github.com/apps/dco
Please squash commits
Please consider doing the same for podman*.yaml

This resolves lima-vm#1178 and allows users to create multiple local Kubernetes clusters through Kind or the Cluster API Docker provider. Signed-off-by: Carlos Nunez <[email protected]>

carlosonunez-vmw · 2022-11-27T17:54:49Z

✅ Please sign off the commit for DCO: https://github.com/apps/dco
✅ Please squash commits
⚠️ Please consider doing the same for podman*.yaml

I'm not sure if Podman needs this treatment, as it uses crun instead of runc which handles nested cgroup mounting differently. This would require additional testing.

Can that be a separate pull request, given that this behavior is known for containerd-based engines?

AkihiroSuda · 2022-11-27T18:01:01Z

examples/k3s.yaml

+  script: |
+    #!/bin/bash
+    # Increase inotify limits to prevent nested Kubernetes control planes
+    # from crash looping.


Is this needed for k3s? If so, it should be needed for k8s.yaml too?

As far as I know, it is only needed for k3d and kind - not for k3s and k8s

As far as I know, it is only needed for k3d and kind - not for k3s and k8s

Not necessarily, it's used anytime you're using a lot of inotify, which can happen with k3s as well, anything using configmaps will need one per configmap, user workloads of other kinds may also run into this.

kind usage is a common way to encounter it, because you often start multiple kubelets on the same kernel and some system workloads with configmaps, but that's only one way to run up usage. a single kubelet with many configmaps could hit the same limit.

BTW on Ubuntu defaults kubernetes's e2e tests created enough pods to exceed it running a Kubernetes worker node on the host (not kind, and not a single node cluster), in particular max_user_instances default seems to be pretty low (128)

kubernetes/kubernetes#130990

I setup my fork the other day and have been meaning to work up a new PR, that didn't happen yet, but leaving this breadcrumb in the meantime. There's also some pointers in the linked issue with example tuning in other cluster tools in the project.

chancez · 2022-12-08T00:47:35Z

Ah this looks great, I've been doing something similar for ages.

BenTheElder · 2025-02-28T23:02:30Z

I'm not sure if Podman needs this treatment, as it uses crun instead of runc which handles nested cgroup mounting differently. This would require additional testing.

inotify isn't namespaced in the kernel, if you start another VM / kernel you'll have separate limits but otherwise this applies to anything using inotify (consider also things like the inotify command line tool, IDEs, etc)

crun/runc/... shouldn't change that. Increasing the inotify limits is probably a good idea on all the templates, at the cost of increasing the template complexity and some kmem

carlosonunez · 2025-03-01T02:59:19Z

Hello! I apologize for not signing the DCO with my vmware account. I've since moved on and no longer have access to this GitHub handle.

@AkihiroSuda , should I re-raise this PR with my personal GitHub account (i.e. this one) and sign-off with that? I still increase my inotify limits in my Docker template; it would be nice to save others the work.

(I have not tested whether podman + kind needs this treatment, though I likely will soon.)

jandubois · 2025-03-01T03:46:39Z

should I re-raise this PR with my personal GitHub account (i.e. this one) and sign-off with that?

Yes, please! We are unable to merge any commits without a valid DCO; it is a CNCF requirement.

jandubois · 2025-03-01T03:55:05Z

Increasing the inotify limits is probably a good idea on all the templates, at the cost of increasing the template complexity and some kmem

We could do this in the internal provisioning scripts, if we can agree that it should be done for all distros. That way it would not complicate the templates. We would also have to decide what the new limits are supposed to be, and if they can be the same for each distro.

Is there any downside to increasing the limits? If not, why are the default limits so low?

@AkihiroSuda any opinion on this?

An alternative in the future (when we have composable templates) would be to implement this in a mix-in template, and users could add it with base: template://options/inotify or whatever we want to call it. It could even be parameterized with param settings. But we are not there yet until later this year.

BenTheElder · 2025-03-01T06:18:05Z

Is there any downside to increasing the limits? If not, why are the default limits so low?

Setting low limits caps the memory used by the kernel for this purpose (tracking inotify watches)

https://man7.org/linux/man-pages/man7/inotify.7.html#:~:text=/proc%20interfaces%0A%20%20%20%20%20%20%20The%20following%20interfaces%20can%20be%20used%20to%20limit%20the%20amount%20of%20kernel%0A%20%20%20%20%20%20%20memory%20consumed%20by%20inotify%3A

So there's definitely a downside to high limits. Since it's kernel memory it can't be swapped.

I don't remember where 524288 but that's probably .... excessive

BenTheElder · 2025-03-01T06:25:20Z

AIUI that memory is only consumed if the user actually creates that many watches though.

The kernel default is pretty low

https://www.monodevelop.com/documentation/inotify-watches-limit/#:~:text=Managed%20file%20watching%20is%20less,The%20default%20is%208192.

https://fleet-support.jetbrains.com/hc/en-us/articles/8084899752722-Inotify-Watches-Limit-Linux

https://www.suse.com/support/kb/doc/?id=000020048

524288 max_user_watches seems pretty common, but I think a lower value would be fine.

There's some discussion in https://patchwork.kernel.org/project/linux-fsdevel/patch/[email protected]/#23713335

carlosonunez · 2025-03-01T07:00:23Z

i agree that upping inotify limits only makes sense when many containerized kubernetes clusters will run on the VM (IIRC I ran into this limit when i was doing cluster-api work using the capd provider which creates lots of clusters).given this, another thing I could do is add a bool field in the Config struct, like "enableHighFSWatcherCount", that adds the appropriate sysctl commands automatically to reduce complexity within template YAMLs.thoughts?

BenTheElder · 2025-03-01T16:10:21Z

A workload like a remote IDE or one cluster with as many lighter workloads as the multiple clusters together can also do it. We tend to notice it with multiple clusters because there are a minimum number of system workloads, but just adding workloads to one cluster can hit this instead, not to mention workloads that themselves use inotify.

Bumping inotify limits is common for other developer tools that use it, see the links above.

It also shouldn't cost anything if unused. It just caps the number of entries (~Inodes) that are permitted, which consume kmem.

one thing to note: these limits are per user, so they only sort of work as a defensive limit, if you run workloads as lots of users that will multiply the maximum anyhow ...

https://watchexec.github.io/docs/inotify-limits.html

BenTheElder · 2025-03-01T16:18:23Z

So actually users can increase this a bit by allocating more memory but you have to add a lot as the default is capped to 1% up to a maximum:

torvalds/linux@9289012

See also abiosoft/colima#319
It appears docker desktop defaults to that maximum based on the discussion there, but I don't have it locally since the license change and I'm not finding it documented anywhere.

AkihiroSuda · 2025-03-01T17:21:07Z

Increasing the inotify limits is probably a good idea on all the templates, at the cost of increasing the template complexity and some kmem

Yes, this should be probably added to https://github.com/lima-vm/lima/tree/master/pkg/cidata/cidata.TEMPLATE.d/boot.

carlosonunez-vmw changed the title ~~Increase inotify limits by default~~ docker-rootful: Increase inotify limits by default Nov 18, 2022

AkihiroSuda reviewed Nov 20, 2022

View reviewed changes

examples/docker-rootful.yaml Show resolved Hide resolved

AkihiroSuda reviewed Nov 20, 2022

View reviewed changes

AkihiroSuda added docker kubernetes labels Nov 20, 2022

increase inotify limits for multiple k8s clusters

047e703

This resolves lima-vm#1178 and allows users to create multiple local Kubernetes clusters through Kind or the Cluster API Docker provider. Signed-off-by: Carlos Nunez <[email protected]>

carlosonunez-vmw force-pushed the patch-1 branch from 0ff03e9 to 047e703 Compare November 27, 2022 17:50

AkihiroSuda reviewed Nov 27, 2022

View reviewed changes

BenTheElder mentioned this pull request Feb 28, 2025

"too many open files" error upon creating multiple Kind clusters on Lima VMs. #1178

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`docker-rootful`: Increase inotify limits by default #1179

`docker-rootful`: Increase inotify limits by default #1179

carlosonunez-vmw commented Nov 18, 2022

AkihiroSuda commented Nov 20, 2022

AkihiroSuda Nov 20, 2022

carlosonunez-vmw Nov 23, 2022

carlosonunez-vmw commented Nov 20, 2022 via email

afbjorklund commented Nov 20, 2022 •

edited

Loading

AkihiroSuda commented Nov 25, 2022

carlosonunez-vmw commented Nov 27, 2022

AkihiroSuda Nov 27, 2022

afbjorklund Nov 28, 2022

BenTheElder Feb 28, 2025 •

edited

Loading

BenTheElder Feb 28, 2025 •

edited

Loading

BenTheElder Mar 22, 2025

chancez commented Dec 8, 2022

BenTheElder commented Feb 28, 2025

carlosonunez commented Mar 1, 2025 •

edited

Loading

jandubois commented Mar 1, 2025

jandubois commented Mar 1, 2025

BenTheElder commented Mar 1, 2025

BenTheElder commented Mar 1, 2025

carlosonunez commented Mar 1, 2025 via email •

edited by jandubois

Loading

BenTheElder commented Mar 1, 2025

BenTheElder commented Mar 1, 2025

AkihiroSuda commented Mar 1, 2025

docker-rootful: Increase inotify limits by default #1179

Are you sure you want to change the base?

docker-rootful: Increase inotify limits by default #1179

Conversation

carlosonunez-vmw commented Nov 18, 2022

AkihiroSuda commented Nov 20, 2022

AkihiroSuda Nov 20, 2022

Choose a reason for hiding this comment

carlosonunez-vmw Nov 23, 2022

Choose a reason for hiding this comment

carlosonunez-vmw commented Nov 20, 2022 via email

afbjorklund commented Nov 20, 2022 • edited Loading

AkihiroSuda commented Nov 25, 2022

carlosonunez-vmw commented Nov 27, 2022

AkihiroSuda Nov 27, 2022

Choose a reason for hiding this comment

afbjorklund Nov 28, 2022

Choose a reason for hiding this comment

BenTheElder Feb 28, 2025 • edited Loading

Choose a reason for hiding this comment

BenTheElder Feb 28, 2025 • edited Loading

Choose a reason for hiding this comment

BenTheElder Mar 22, 2025

Choose a reason for hiding this comment

chancez commented Dec 8, 2022

BenTheElder commented Feb 28, 2025

carlosonunez commented Mar 1, 2025 • edited Loading

jandubois commented Mar 1, 2025

jandubois commented Mar 1, 2025

BenTheElder commented Mar 1, 2025

BenTheElder commented Mar 1, 2025

carlosonunez commented Mar 1, 2025 via email • edited by jandubois Loading

BenTheElder commented Mar 1, 2025

BenTheElder commented Mar 1, 2025

AkihiroSuda commented Mar 1, 2025

`docker-rootful`: Increase inotify limits by default #1179

`docker-rootful`: Increase inotify limits by default #1179

afbjorklund commented Nov 20, 2022 •

edited

Loading

BenTheElder Feb 28, 2025 •

edited

Loading

BenTheElder Feb 28, 2025 •

edited

Loading

carlosonunez commented Mar 1, 2025 •

edited

Loading

carlosonunez commented Mar 1, 2025 via email •

edited by jandubois

Loading