LXCFS and Rancher Resource Enumeration

ilhaan · August 21, 2019, 12:54am

I have a Rancher based Kubernetes cluster that consists of nodes that are LXD containers. I have been able to deploy services to these nodes successfully. However, Rancher seems to think that each of these nodes has all the resources of the host the LXD container is running on.

For example, I launched a few worker node containers. These were ensured to have limited CPU cores using lxc config set my-container limits.cpu 4 . This was verified by running grep -c processor /proc/cpuinfo in the container, which returned 4 as expected. However, Rancher shows the same node as having 56 cores, which is the number of cores on the bare metal where the LXD containers are running. This is shown in the screenshot below:

All the nodes shown in the screenshot were CPU limited as mentioned above, but they are listed as having 56 cores by Rancher. This is problematic, given that many of these containers are on the same host.

I am running LXD 3.0.4 (from the 3.0/stable channel). Based on what I have read about LXCFS, I believe it may solve my problem. However, I am not sure if LXCFS is already running on my snap installed LXD. I see the following process on my machine when I run ps aux | grep lxcfs:

root        9885  0.0  0.0 160920  1444 ?        Sl   17:20   0:00 lxcfs /var/snap/lxd/common/var/lib/lxcfs -p /var/snap/lxd/common/lxcfs.pid

/var/snap/lxd/common/var/lib/lxcfs shown above points to /var/snap/lxd/common/shmounts/lxcfs, which does not exist (/var/snap/lxd/common/shmounts/ is an empty directory).

As per this, LXCFS will ensure that /sys/devices/system/cpu/online is container aware. It seems to me that Rancher obtains resource information from this location, which is why I believe a working install of LXCFS will solve my problem. Please let me know if you have any thoughts on this.

How do I use LXCFS with my snap installed LXD? The instructions here do not seem to be specific to a snap installation.

Anything else I should be considering? Appreciate all feedback and suggestions.

gpatel-fr · August 21, 2019, 4:50am

that’s the ‘magic’ of snap.

sudo nsenter -t $(pgrep daemon.start) -m – ls /var/snap/lxd/common/shmounts/lxcfs
cgroup proc sys

ilhaan · August 21, 2019, 5:41pm

Running that gives me:

nsenter: failed to execute –: No such file or directory

gpatel-fr · August 21, 2019, 6:23pm

Well that’s discourse ‘magic’: turning pasted text into something else when one forgets to click the darn </> button. So the correct syntax is

sudo nsenter -t $(pgrep daemon.start) -m -- ls /var/snap/lxd/common/shmounts/lxcfs

ilhaan · August 21, 2019, 7:38pm

Ah gotcha. Thanks for clarifying.

Here is what I see now:

root@server:~# nsenter -t $(pgrep daemon.start) -m -- ls /var/snap/lxd/common/shmounts/lxcfs
cgroup	proc

inspecting files under proc by running nsenter -t $(pgrep daemon.start) -m -- cat /var/snap/lxd/common/shmounts/lxcfs/proc/cpuinfo I see all 56 cores of my server listed.

Is this correct behavior?
How do I verify if LXCFS is correctly mounting a container-aware version of these dirs into containers?

gpatel-fr · August 21, 2019, 8:42pm

Certainly it’s correct since this path don’t refer the container at all, and the limit you set is on a specific container.

If in a container you don’t see a reduced CPU number in /proc/cpuinfo, it’s a problem.

I can’t see it on snap LXD 3.16. I have an old LXD version installed through APT (so 3.03 with lxcfs 3.03) and /proc/cpuinfo in a container is showing a reduced proc count.
So it don’t seem to come from an lxcfs recent enhancement;

So it seems that your software don’t use /proc/cpuinfo to count processors.
A quick search in the rancher issues leads to this.

ilhaan · August 21, 2019, 9:10pm

That is an issue I posted on Rancher’s Github to try and figure this out before I found out about LXCFS. The response to my post is not a real solution to the problem (the responder didn’t seem to fully understand my question).

So, looking at /proc/cpuinfo in a running container shows:

root@server:~# lxc exec test -- grep -c "processor" /proc/cpuinfo 
1

Which is what I expected. However:

root@server:~# lxc exec test -- cat /sys/devices/system/cpu/online
0-55

That is not something I expected, since the container should only have access to one core.

This says that LXCFS will mount /sys/devices/system/cpu/online in a container aware manner, but that doesn’t seem to be the case here. Any way I can force this to happen?

gpatel-fr · August 21, 2019, 10:14pm

Yes this seems indeed the problem. It’s the same with apt LXD 3.03 and snap LXD 3.16. No obvious reason I can see.

ilhaan · August 21, 2019, 11:02pm

I just tested this on snap LXD 3.0.4 and 3.16 and both versions seem to have the same issue with /sys/ mounts.

Filing a bug report at https://github.com/lxc/lxcfs/issues. Will post an update here if I find a fix.

Thanks for your help @gpatel-fr!

gpatel-fr · August 22, 2019, 3:59pm

After looking a bit at that, I should add that it’s pretty normal in fact that /sys/devices/system/cpu/online is not container-aware on LXD 3.03 (Apt version) since the feature was not developped. The problem occurs also on LXD 3.16 and here it seems less normal.

Nonetheless, it’s not certain that fixing this problem would fix the Rancher issue.
There are other ways of looking at Cpu info, such as this that are just not covered by lxcfs.