Container with routed NIC can't ping its "neighbour" IP Address, since it's also its own broadcast address?

schnuppepampe · August 9, 2021, 1:15pm

Hi,
Sorry if this has been asked before, but I couldn’t find anything…

I’m running multiple containers on a server. Since the ISP doesn’t allow me to use mutliple mac addresses, I’m using a routed setup. Each container has its own profile like this:

devices:
 eth0:
   ipv4.address: 1.2.3.4
   nictype: routed
   type: nic

This works perfectly well for almost everything I need, but there is one issue:
Some of the containers are in the same subnet and therefore have consecutive IP addresses. Say 1.2.3.4, 1.2.3.5, 1.2.3.6, 1.2.3.7.
While e.g. .4 and .7 can communicate just fine, .4 and .5 can’t, .5 and .6 can’t, etc.
I think this is because .5 seems to be the broadcast address of .4.

If I am inside 1.2.3.4 and run ping 1.2.3.5, I get an error like this:

Do you want to ping broadcast? Then -b. If not, check your local firewall rules.

And when checking with ip addr, indeed this is the broadcast address:

inet 1.2.3.4/32 brd 1.2.3.5 scope global eth0

Is there any simple way to fix this? Right now, sadly, two discourse instances can’t access my mail server to send emails because they are its IP-neightbours…

Note: There are also containers on the host with completely different IPs. But some of them are consecutive.

Thanks,
Adrian

tomp · August 9, 2021, 3:42pm

That suggests what ever is configuring the IP inside the container isn’t getting it quite right as on my setup in LXC it is indeed possible to reach each adjacent IP from each one, see.

inet 192.168.31.4/32 brd 255.255.255.255 scope global eth0

schnuppepampe · August 9, 2021, 5:05pm

Hmm interesting. I thought LXD has the sole responsibility to configure this network interface?
At least I didn’t configure it anywhere inside the container.

All containers are debian buster systems and were imported via lxd-p2c.
Network-Manager was uninstalled after importing and /etc/network/interfaces is mostly empty:

auto lo
iface lo inet loopback

Funnily enough, I just noticed that not all containers have this issue.
Some list their own IP as broadcast, some list their own IP+1 as broadcast (none of them lists 255.255.255.255 though). Those who list their own IP don’t seem to have this issue.

What else could be responsible for configuring the broadcast address in this way?

tomp · August 9, 2021, 5:19pm

Interesting, I’ll try and recreate, it maybe that somehow LXD/liblxc isn’t setting the explicit broadcast address which then leaves the OS (incorrectly) guessing what it should be.

Either that or it may be a regression in liblxc that LXD uses.

What version of LXD are you using?

schnuppepampe · August 9, 2021, 5:22pm

This is lxd 4.16 installed from snap packages. Let me know if you need more info (e.g. systemctl status in a wrongly configured host or something to see what’s running)

tomp · August 9, 2021, 5:36pm

Actually this looks like a regression in liblxc (cc @brauner )

In LXC 4.0.6 (the package that is in Ubuntu Focal) the router veth interface gets configured as:

inet 192.168.31.7/32 brd 255.255.255.255 scope global eth0

And in current main branch (and the one bundled with LXD 4.16 it seems), it gets configured as:

inet 192.168.31.7/32 brd 192.168.31.7 scope global eth0

tomp · August 10, 2021, 9:53am

OK so I tracked it down to this commit @brauner

Before that the broadcast is set to 255.255.255.255 and after that its set to 192.168.31.7 (this particular value changes based on the IP address set and isn’t always the same as the address set, sometimes it is the IP after the address that is specified). Either way it is incorrect though.

tomp · August 10, 2021, 9:56am

Specifying in the liblxc config a zero broadcast address (or setting it to 255.255.255.255) seems to work.

lxc.net.0.ipv4.address = 192.168.31.7/32 0.0.0.0

But the change in result of calculation between the commits seems of concern.

tomp · August 10, 2021, 10:25am

This should fix it:

github.com/lxc/lxd

NIC: Works around routed NIC regression in liblxc by setting zero broadcast address

lxc:master ← tomponline:tp-nic-routed-ipv4-broadcast

opened 10:24AM - 10 Aug 21 UTC

tomponline

+7 -1

See https://discuss.linuxcontainers.org/t/container-cant-ping-its-neighbour-ip-a…ddress-since-its-also-its-own-broadcast-address/11829 Change was introduced in liblxc by https://github.com/lxc/lxc/commit/365136359f8bf991ed172b498909000ec18b32de which change how the broadcast address was automatically calculated. Previously when using a `routed` NIC with a /32 IPv4 address it was being set to `255.255.255.255` which allowed adjacent IP communication. However that commit changed it so that the broadcast address was either set to the same IP as the specified address or the address after it, which could lead adjacent IPs not being reachable if assigned on different instance NICs. This PR uses the undocumented feature in libxc to specify an all-zero broadcast address, which then works around the change in the automatic calculation. The reason I didn't specify the 255.255.255.255 address which was previous behaviour is that AFAIK there shouldn't be an explicit broadcast address set on the `routed` NIC point-to-point links anyway, and using an all-zero broadcast address replicates the behaviour I see when manually running: ``` ip a add n.n.n.n/32 dev eth0 ``` CC @brauner

schnuppepampe · August 10, 2021, 10:38am

Awesome, thank you!
Is there a simple way to get these patches into my snap install? Or alternatively: how often are the snaps updated/when can I expect this to reach the snap?

tomp · August 10, 2021, 10:48am

I’m sure @stgraber will be able to push it to the latest/stable channel shortly after its merged.

schnuppepampe · August 30, 2021, 7:02am

Sorry to bother again - I noticed that the latest/stable channel was updated last week, so I immediately installed that new version and restarted snap.lxd.daemon.service. However, the containers still seem to get these wrong broadcast addresses? Is the patch not included in the new latest/stable version? Do I have to switch to latest/edge to get that?

tomp · August 31, 2021, 8:41am

Yes it doesn’t look like @stgraber has cherry-pick this one yet into stable. It will be included in 4.18 at least.