Previously, containers worked just fine on my machine (arch linux, up to date, custom kernel based off of 5.17.4). After a system update, things seem to be thrown out of wack.
When I try to start an existing container:
~ lxc start fed
Error: Failed to run: /usr/bin/lxd forkstart fed /var/lib/lxd/containers /var/log/lxd/fed/lxc.conf:
Try `lxc info --show-log fed` for more info
Asking for info:
~ lxc info --show-log fed
Name: fed
Status: STOPPED
Type: container
Architecture: x86_64
Created: 2022/07/13 12:18 EDT
Last Used: 2022/07/13 17:40 EDT
Log:
lxc fed 20220713214051.633 ERROR cgfsng - ../src/lxc/cgroups/cgfsng.c:cgfsng_mount:1772 - No such file or directory - Failed to create cgroup at_mnt 24()
lxc fed 20220713214051.635 ERROR conf - ../src/lxc/conf.c:lxc_mount_auto_mounts:851 - No such file or directory - Failed to mount "/sys/fs/cgroup"
lxc fed 20220713214051.635 ERROR conf - ../src/lxc/conf.c:lxc_setup:4396 - Failed to setup remaining automatic mounts
lxc fed 20220713214051.635 ERROR start - ../src/lxc/start.c:do_start:1275 - Failed to setup container "fed"
lxc fed 20220713214051.635 ERROR sync - ../src/lxc/sync.c:sync_wait:34 - An error occurred in another process (expected sequence number 4)
lxc fed 20220713214051.660 WARN network - ../src/lxc/network.c:lxc_delete_network_priv:3631 - Failed to rename interface with index 0 from "eth0" to its initial name "vethce914e5c"
lxc fed 20220713214051.660 ERROR lxccontainer - ../src/lxc/lxccontainer.c:wait_on_daemonized_start:877 - Received container state "ABORTING" instead of "RUNNING"
lxc fed 20220713214051.660 ERROR start - ../src/lxc/start.c:__lxc_start:2074 - Failed to spawn container "fed"
lxc fed 20220713214051.660 WARN start - ../src/lxc/start.c:lxc_abort:1039 - No such process - Failed to send SIGKILL via pidfd 19 for process 82634
lxc 20220713214056.206 ERROR af_unix - ../src/lxc/af_unix.c:lxc_abstract_unix_recv_fds_iov:218 - Connection reset by peer - Failed to receive response
lxc 20220713214056.206 ERROR commands - ../src/lxc/commands.c:lxc_cmd_rsp_recv_fds:127 - Failed to receive file descriptors for command "get_state"
Any advice on this? I’ve ensured /etc/subgid and /etc/subuid have the appropriate values, and followed instructions on setting up unprivileged containers. That said, creating a new (privileged container) also fails with the same output.
Solved by downgrading to lxd 5.2. Looks like 5.3 still has some issues. If anyone knows how to debug 5.3 I’d prefer to be on the newer version if it can be made to work. Thanks!
tomp
(Thomas Parrott)
July 14, 2022, 7:46am
3
Seems like an issue with the underlying liblxc rather than LXD.
Can you upgrade to LXD 5.3 again and then get the output of lxc info
please.
Sure thing. Info for 5.3 is as follows:
config:
images.auto_update_interval: "0"
api_extensions:
- storage_zfs_remove_snapshots
- container_host_shutdown_timeout
- container_stop_priority
- container_syscall_filtering
- auth_pki
- container_last_used_at
- etag
- patch
- usb_devices
- https_allowed_credentials
- image_compression_algorithm
- directory_manipulation
- container_cpu_time
- storage_zfs_use_refquota
- storage_lvm_mount_options
- network
- profile_usedby
- container_push
- container_exec_recording
- certificate_update
- container_exec_signal_handling
- gpu_devices
- container_image_properties
- migration_progress
- id_map
- network_firewall_filtering
- network_routes
- storage
- file_delete
- file_append
- network_dhcp_expiry
- storage_lvm_vg_rename
- storage_lvm_thinpool_rename
- network_vlan
- image_create_aliases
- container_stateless_copy
- container_only_migration
- storage_zfs_clone_copy
- unix_device_rename
- storage_lvm_use_thinpool
- storage_rsync_bwlimit
- network_vxlan_interface
- storage_btrfs_mount_options
- entity_description
- image_force_refresh
- storage_lvm_lv_resizing
- id_map_base
- file_symlinks
- container_push_target
- network_vlan_physical
- storage_images_delete
- container_edit_metadata
- container_snapshot_stateful_migration
- storage_driver_ceph
- storage_ceph_user_name
- resource_limits
- storage_volatile_initial_source
- storage_ceph_force_osd_reuse
- storage_block_filesystem_btrfs
- resources
- kernel_limits
- storage_api_volume_rename
- macaroon_authentication
- network_sriov
- console
- restrict_devlxd
- migration_pre_copy
- infiniband
- maas_network
- devlxd_events
- proxy
- network_dhcp_gateway
- file_get_symlink
- network_leases
- unix_device_hotplug
- storage_api_local_volume_handling
- operation_description
- clustering
- event_lifecycle
- storage_api_remote_volume_handling
- nvidia_runtime
- container_mount_propagation
- container_backup
- devlxd_images
- container_local_cross_pool_handling
- proxy_unix
- proxy_udp
- clustering_join
- proxy_tcp_udp_multi_port_handling
- network_state
- proxy_unix_dac_properties
- container_protection_delete
- unix_priv_drop
- pprof_http
- proxy_haproxy_protocol
- network_hwaddr
- proxy_nat
- network_nat_order
- container_full
- candid_authentication
- backup_compression
- candid_config
- nvidia_runtime_config
- storage_api_volume_snapshots
- storage_unmapped
- projects
- candid_config_key
- network_vxlan_ttl
- container_incremental_copy
- usb_optional_vendorid
- snapshot_scheduling
- snapshot_schedule_aliases
- container_copy_project
- clustering_server_address
- clustering_image_replication
- container_protection_shift
- snapshot_expiry
- container_backup_override_pool
- snapshot_expiry_creation
- network_leases_location
- resources_cpu_socket
- resources_gpu
- resources_numa
- kernel_features
- id_map_current
- event_location
- storage_api_remote_volume_snapshots
- network_nat_address
- container_nic_routes
- rbac
- cluster_internal_copy
- seccomp_notify
- lxc_features
- container_nic_ipvlan
- network_vlan_sriov
- storage_cephfs
- container_nic_ipfilter
- resources_v2
- container_exec_user_group_cwd
- container_syscall_intercept
- container_disk_shift
- storage_shifted
- resources_infiniband
- daemon_storage
- instances
- image_types
- resources_disk_sata
- clustering_roles
- images_expiry
- resources_network_firmware
- backup_compression_algorithm
- ceph_data_pool_name
- container_syscall_intercept_mount
- compression_squashfs
- container_raw_mount
- container_nic_routed
- container_syscall_intercept_mount_fuse
- container_disk_ceph
- virtual-machines
- image_profiles
- clustering_architecture
- resources_disk_id
- storage_lvm_stripes
- vm_boot_priority
- unix_hotplug_devices
- api_filtering
- instance_nic_network
- clustering_sizing
- firewall_driver
- projects_limits
- container_syscall_intercept_hugetlbfs
- limits_hugepages
- container_nic_routed_gateway
- projects_restrictions
- custom_volume_snapshot_expiry
- volume_snapshot_scheduling
- trust_ca_certificates
- snapshot_disk_usage
- clustering_edit_roles
- container_nic_routed_host_address
- container_nic_ipvlan_gateway
- resources_usb_pci
- resources_cpu_threads_numa
- resources_cpu_core_die
- api_os
- container_nic_routed_host_table
- container_nic_ipvlan_host_table
- container_nic_ipvlan_mode
- resources_system
- images_push_relay
- network_dns_search
- container_nic_routed_limits
- instance_nic_bridged_vlan
- network_state_bond_bridge
- usedby_consistency
- custom_block_volumes
- clustering_failure_domains
- resources_gpu_mdev
- console_vga_type
- projects_limits_disk
- network_type_macvlan
- network_type_sriov
- container_syscall_intercept_bpf_devices
- network_type_ovn
- projects_networks
- projects_networks_restricted_uplinks
- custom_volume_backup
- backup_override_name
- storage_rsync_compression
- network_type_physical
- network_ovn_external_subnets
- network_ovn_nat
- network_ovn_external_routes_remove
- tpm_device_type
- storage_zfs_clone_copy_rebase
- gpu_mdev
- resources_pci_iommu
- resources_network_usb
- resources_disk_address
- network_physical_ovn_ingress_mode
- network_ovn_dhcp
- network_physical_routes_anycast
- projects_limits_instances
- network_state_vlan
- instance_nic_bridged_port_isolation
- instance_bulk_state_change
- network_gvrp
- instance_pool_move
- gpu_sriov
- pci_device_type
- storage_volume_state
- network_acl
- migration_stateful
- disk_state_quota
- storage_ceph_features
- projects_compression
- projects_images_remote_cache_expiry
- certificate_project
- network_ovn_acl
- projects_images_auto_update
- projects_restricted_cluster_target
- images_default_architecture
- network_ovn_acl_defaults
- gpu_mig
- project_usage
- network_bridge_acl
- warnings
- projects_restricted_backups_and_snapshots
- clustering_join_token
- clustering_description
- server_trusted_proxy
- clustering_update_cert
- storage_api_project
- server_instance_driver_operational
- server_supported_storage_drivers
- event_lifecycle_requestor_address
- resources_gpu_usb
- clustering_evacuation
- network_ovn_nat_address
- network_bgp
- network_forward
- custom_volume_refresh
- network_counters_errors_dropped
- metrics
- image_source_project
- clustering_config
- network_peer
- linux_sysctl
- network_dns
- ovn_nic_acceleration
- certificate_self_renewal
- instance_project_move
- storage_volume_project_move
- cloud_init
- network_dns_nat
- database_leader
- instance_all_projects
- clustering_groups
- ceph_rbd_du
- instance_get_full
- qemu_metrics
- gpu_mig_uuid
- event_project
- clustering_evacuation_live
- instance_allow_inconsistent_copy
- network_state_ovn
- storage_volume_api_filtering
- image_restrictions
- storage_zfs_export
- network_dns_records
- storage_zfs_reserve_space
- network_acl_log
- storage_zfs_blocksize
- metrics_cpu_seconds
- instance_snapshot_never
- certificate_token
- instance_nic_routed_neighbor_probe
- event_hub
- agent_nic_config
- projects_restricted_intercept
- metrics_authentication
- images_target_project
- cluster_migration_inconsistent_copy
- cluster_ovn_chassis
- container_syscall_intercept_sched_setscheduler
- storage_lvm_thinpool_metadata_size
- storage_volume_state_total
- instance_file_head
- instances_nic_host_name
- image_copy_profile
- container_syscall_intercept_sysinfo
- clustering_evacuation_mode
- resources_pci_vpd
- qemu_raw_conf
- storage_cephfs_fscache
api_status: stable
api_version: "1.0"
auth: trusted
public: false
auth_methods:
- tls
environment:
addresses: []
architectures:
- x86_64
- i686
certificate: |
-----BEGIN CERTIFICATE-----
MIICDzCCAZWgAwIBAgIQPP+US4+1gVh6ru4TKP9J3jAKBggqhkjOPQQDAzA4MRww
GgYDVQQKExNsaW51eGNvbnRhaW5lcnMub3JnMRgwFgYDVQQDDA9yb290QG1vdGhl
cnNoaXAwHhcNMjIwMTE5MjE0ODAxWhcNMzIwMTE3MjE0ODAxWjA4MRwwGgYDVQQK
ExNsaW51eGNvbnRhaW5lcnMub3JnMRgwFgYDVQQDDA9yb290QG1vdGhlcnNoaXAw
djAQBgcqhkjOPQIBBgUrgQQAIgNiAARXnc7TLJVIfWf+cSAcbtgejOc4IVULOvcU
B+0tq/Flpnho3H5p3i4pBGl8+cA45YjBMjbOrh58OSFzI3N7fNZLaL8G2ARRmuec
CQyNXFp9SpzgipW4oDVCZnugBCSNHDGjZDBiMA4GA1UdDwEB/wQEAwIFoDATBgNV
HSUEDDAKBggrBgEFBQcDATAMBgNVHRMBAf8EAjAAMC0GA1UdEQQmMCSCCm1vdGhl
cnNoaXCHBH8AAAGHEAAAAAAAAAAAAAAAAAAAAAEwCgYIKoZIzj0EAwMDaAAwZQIw
fjDzXucyFjwKpxpahVsQ/WwdkwmAIjzX8wLDaztubd5qzYSixt6Ihgr56omoLOml
AjEAp5fB1MQ5J2peHygnw1D/vr3GtYfZF3RZcsMuqX36rgUDkxJA2941UV1aKf4z
b1Vd
-----END CERTIFICATE-----
certificate_fingerprint: 34076e232c61350c040654ec2a228283624c1e25d136e977ac4ef9a2e9fcf10d
driver: lxc | qemu
driver_version: 5.0.0 | 7.0.50
firewall: nftables
kernel: Linux
kernel_architecture: x86_64
kernel_features:
idmapped_mounts: "true"
netnsid_getifaddrs: "true"
seccomp_listener: "true"
seccomp_listener_continue: "true"
shiftfs: "false"
uevent_injection: "true"
unpriv_fscaps: "true"
kernel_version: 5.17.4-arch1-1-x570-vfio-openrgb-sm2262+sm2263
lxc_features:
cgroup2: "true"
core_scheduling: "true"
devpts_fd: "true"
idmapped_mounts_v2: "true"
mount_injection_file: "true"
network_gateway_device_route: "true"
network_ipvlan: "true"
network_l2proxy: "true"
network_phys_macvlan_mtu: "true"
network_veth_router: "true"
pidfd: "true"
seccomp_allow_deny_syntax: "true"
seccomp_notify: "true"
seccomp_proxy_send_notify_fd: "true"
os_name: Arch Linux
os_version: ""
project: default
server: lxd
server_clustered: false
server_event_mode: full-mesh
server_name: mothership
server_pid: 1002
server_version: "5.3"
storage: btrfs
storage_version: 5.18.1
storage_supported_drivers:
- name: btrfs
version: 5.18.1
remote: false
- name: cephfs
version: 15.2.14
remote: true
- name: dir
version: "1"
remote: false
- name: lvm
version: 2.03.16(2) (2022-05-18) / 1.02.185 (2022-05-18) / 4.45.0
remote: false
- name: ceph
version: 15.2.14
remote: true
If it helps, 5.2 config (working) is as follows:
config:
images.auto_update_interval: "0"
api_extensions:
- storage_zfs_remove_snapshots
- container_host_shutdown_timeout
- container_stop_priority
- container_syscall_filtering
- auth_pki
- container_last_used_at
- etag
- patch
- usb_devices
- https_allowed_credentials
- image_compression_algorithm
- directory_manipulation
- container_cpu_time
- storage_zfs_use_refquota
- storage_lvm_mount_options
- network
- profile_usedby
- container_push
- container_exec_recording
- certificate_update
- container_exec_signal_handling
- gpu_devices
- container_image_properties
- migration_progress
- id_map
- network_firewall_filtering
- network_routes
- storage
- file_delete
- file_append
- network_dhcp_expiry
- storage_lvm_vg_rename
- storage_lvm_thinpool_rename
- network_vlan
- image_create_aliases
- container_stateless_copy
- container_only_migration
- storage_zfs_clone_copy
- unix_device_rename
- storage_lvm_use_thinpool
- storage_rsync_bwlimit
- network_vxlan_interface
- storage_btrfs_mount_options
- entity_description
- image_force_refresh
- storage_lvm_lv_resizing
- id_map_base
- file_symlinks
- container_push_target
- network_vlan_physical
- storage_images_delete
- container_edit_metadata
- container_snapshot_stateful_migration
- storage_driver_ceph
- storage_ceph_user_name
- resource_limits
- storage_volatile_initial_source
- storage_ceph_force_osd_reuse
- storage_block_filesystem_btrfs
- resources
- kernel_limits
- storage_api_volume_rename
- macaroon_authentication
- network_sriov
- console
- restrict_devlxd
- migration_pre_copy
- infiniband
- maas_network
- devlxd_events
- proxy
- network_dhcp_gateway
- file_get_symlink
- network_leases
- unix_device_hotplug
- storage_api_local_volume_handling
- operation_description
- clustering
- event_lifecycle
- storage_api_remote_volume_handling
- nvidia_runtime
- container_mount_propagation
- container_backup
- devlxd_images
- container_local_cross_pool_handling
- proxy_unix
- proxy_udp
- clustering_join
- proxy_tcp_udp_multi_port_handling
- network_state
- proxy_unix_dac_properties
- container_protection_delete
- unix_priv_drop
- pprof_http
- proxy_haproxy_protocol
- network_hwaddr
- proxy_nat
- network_nat_order
- container_full
- candid_authentication
- backup_compression
- candid_config
- nvidia_runtime_config
- storage_api_volume_snapshots
- storage_unmapped
- projects
- candid_config_key
- network_vxlan_ttl
- container_incremental_copy
- usb_optional_vendorid
- snapshot_scheduling
- snapshot_schedule_aliases
- container_copy_project
- clustering_server_address
- clustering_image_replication
- container_protection_shift
- snapshot_expiry
- container_backup_override_pool
- snapshot_expiry_creation
- network_leases_location
- resources_cpu_socket
- resources_gpu
- resources_numa
- kernel_features
- id_map_current
- event_location
- storage_api_remote_volume_snapshots
- network_nat_address
- container_nic_routes
- rbac
- cluster_internal_copy
- seccomp_notify
- lxc_features
- container_nic_ipvlan
- network_vlan_sriov
- storage_cephfs
- container_nic_ipfilter
- resources_v2
- container_exec_user_group_cwd
- container_syscall_intercept
- container_disk_shift
- storage_shifted
- resources_infiniband
- daemon_storage
- instances
- image_types
- resources_disk_sata
- clustering_roles
- images_expiry
- resources_network_firmware
- backup_compression_algorithm
- ceph_data_pool_name
- container_syscall_intercept_mount
- compression_squashfs
- container_raw_mount
- container_nic_routed
- container_syscall_intercept_mount_fuse
- container_disk_ceph
- virtual-machines
- image_profiles
- clustering_architecture
- resources_disk_id
- storage_lvm_stripes
- vm_boot_priority
- unix_hotplug_devices
- api_filtering
- instance_nic_network
- clustering_sizing
- firewall_driver
- projects_limits
- container_syscall_intercept_hugetlbfs
- limits_hugepages
- container_nic_routed_gateway
- projects_restrictions
- custom_volume_snapshot_expiry
- volume_snapshot_scheduling
- trust_ca_certificates
- snapshot_disk_usage
- clustering_edit_roles
- container_nic_routed_host_address
- container_nic_ipvlan_gateway
- resources_usb_pci
- resources_cpu_threads_numa
- resources_cpu_core_die
- api_os
- container_nic_routed_host_table
- container_nic_ipvlan_host_table
- container_nic_ipvlan_mode
- resources_system
- images_push_relay
- network_dns_search
- container_nic_routed_limits
- instance_nic_bridged_vlan
- network_state_bond_bridge
- usedby_consistency
- custom_block_volumes
- clustering_failure_domains
- resources_gpu_mdev
- console_vga_type
- projects_limits_disk
- network_type_macvlan
- network_type_sriov
- container_syscall_intercept_bpf_devices
- network_type_ovn
- projects_networks
- projects_networks_restricted_uplinks
- custom_volume_backup
- backup_override_name
- storage_rsync_compression
- network_type_physical
- network_ovn_external_subnets
- network_ovn_nat
- network_ovn_external_routes_remove
- tpm_device_type
- storage_zfs_clone_copy_rebase
- gpu_mdev
- resources_pci_iommu
- resources_network_usb
- resources_disk_address
- network_physical_ovn_ingress_mode
- network_ovn_dhcp
- network_physical_routes_anycast
- projects_limits_instances
- network_state_vlan
- instance_nic_bridged_port_isolation
- instance_bulk_state_change
- network_gvrp
- instance_pool_move
- gpu_sriov
- pci_device_type
- storage_volume_state
- network_acl
- migration_stateful
- disk_state_quota
- storage_ceph_features
- projects_compression
- projects_images_remote_cache_expiry
- certificate_project
- network_ovn_acl
- projects_images_auto_update
- projects_restricted_cluster_target
- images_default_architecture
- network_ovn_acl_defaults
- gpu_mig
- project_usage
- network_bridge_acl
- warnings
- projects_restricted_backups_and_snapshots
- clustering_join_token
- clustering_description
- server_trusted_proxy
- clustering_update_cert
- storage_api_project
- server_instance_driver_operational
- server_supported_storage_drivers
- event_lifecycle_requestor_address
- resources_gpu_usb
- clustering_evacuation
- network_ovn_nat_address
- network_bgp
- network_forward
- custom_volume_refresh
- network_counters_errors_dropped
- metrics
- image_source_project
- clustering_config
- network_peer
- linux_sysctl
- network_dns
- ovn_nic_acceleration
- certificate_self_renewal
- instance_project_move
- storage_volume_project_move
- cloud_init
- network_dns_nat
- database_leader
- instance_all_projects
- clustering_groups
- ceph_rbd_du
- instance_get_full
- qemu_metrics
- gpu_mig_uuid
- event_project
- clustering_evacuation_live
- instance_allow_inconsistent_copy
- network_state_ovn
- storage_volume_api_filtering
- image_restrictions
- storage_zfs_export
- network_dns_records
- storage_zfs_reserve_space
- network_acl_log
- storage_zfs_blocksize
- metrics_cpu_seconds
- instance_snapshot_never
- certificate_token
- instance_nic_routed_neighbor_probe
- event_hub
- agent_nic_config
- projects_restricted_intercept
- metrics_authentication
- images_target_project
- cluster_migration_inconsistent_copy
- cluster_ovn_chassis
- container_syscall_intercept_sched_setscheduler
- storage_lvm_thinpool_metadata_size
- storage_volume_state_total
- instance_file_head
- instances_nic_host_name
- image_copy_profile
- container_syscall_intercept_sysinfo
- clustering_evacuation_mode
- resources_pci_vpd
api_status: stable
api_version: "1.0"
auth: trusted
public: false
auth_methods:
- tls
environment:
addresses: []
architectures:
- x86_64
- i686
certificate: |
-----BEGIN CERTIFICATE-----
MIICDzCCAZWgAwIBAgIQPP+US4+1gVh6ru4TKP9J3jAKBggqhkjOPQQDAzA4MRww
GgYDVQQKExNsaW51eGNvbnRhaW5lcnMub3JnMRgwFgYDVQQDDA9yb290QG1vdGhl
cnNoaXAwHhcNMjIwMTE5MjE0ODAxWhcNMzIwMTE3MjE0ODAxWjA4MRwwGgYDVQQK
ExNsaW51eGNvbnRhaW5lcnMub3JnMRgwFgYDVQQDDA9yb290QG1vdGhlcnNoaXAw
djAQBgcqhkjOPQIBBgUrgQQAIgNiAARXnc7TLJVIfWf+cSAcbtgejOc4IVULOvcU
B+0tq/Flpnho3H5p3i4pBGl8+cA45YjBMjbOrh58OSFzI3N7fNZLaL8G2ARRmuec
CQyNXFp9SpzgipW4oDVCZnugBCSNHDGjZDBiMA4GA1UdDwEB/wQEAwIFoDATBgNV
HSUEDDAKBggrBgEFBQcDATAMBgNVHRMBAf8EAjAAMC0GA1UdEQQmMCSCCm1vdGhl
cnNoaXCHBH8AAAGHEAAAAAAAAAAAAAAAAAAAAAEwCgYIKoZIzj0EAwMDaAAwZQIw
fjDzXucyFjwKpxpahVsQ/WwdkwmAIjzX8wLDaztubd5qzYSixt6Ihgr56omoLOml
AjEAp5fB1MQ5J2peHygnw1D/vr3GtYfZF3RZcsMuqX36rgUDkxJA2941UV1aKf4z
b1Vd
-----END CERTIFICATE-----
certificate_fingerprint: 34076e232c61350c040654ec2a228283624c1e25d136e977ac4ef9a2e9fcf10d
driver: lxc | qemu
driver_version: 5.0.0 | 7.0.50
firewall: nftables
kernel: Linux
kernel_architecture: x86_64
kernel_features:
idmapped_mounts: "true"
netnsid_getifaddrs: "true"
seccomp_listener: "true"
seccomp_listener_continue: "true"
shiftfs: "false"
uevent_injection: "true"
unpriv_fscaps: "true"
kernel_version: 5.17.4-arch1-1-x570-vfio-openrgb-sm2262+sm2263
lxc_features:
cgroup2: "true"
core_scheduling: "true"
devpts_fd: "true"
idmapped_mounts_v2: "true"
mount_injection_file: "true"
network_gateway_device_route: "true"
network_ipvlan: "true"
network_l2proxy: "true"
network_phys_macvlan_mtu: "true"
network_veth_router: "true"
pidfd: "true"
seccomp_allow_deny_syntax: "true"
seccomp_notify: "true"
seccomp_proxy_send_notify_fd: "true"
os_name: Arch Linux
os_version: ""
project: default
server: lxd
server_clustered: false
server_event_mode: full-mesh
server_name: mothership
server_pid: 995
server_version: "5.2"
storage: btrfs
storage_version: 5.18.1
storage_supported_drivers:
- name: lvm
version: 2.03.16(2) (2022-05-18) / 1.02.185 (2022-05-18) / 4.45.0
remote: false
- name: ceph
version: 15.2.14
remote: true
- name: btrfs
version: 5.18.1
remote: false
- name: cephfs
version: 15.2.14
remote: true
- name: dir
version: "1"
remote: false
Both versions 5.2 and 5.3 are ran with lxc 5.0
tomp
(Thomas Parrott)
July 18, 2022, 8:32am
7
Seems like an issue with the host’s cgroup setup, the error “Failed to create cgroup at_mnt” was mentioned in here too:
opened 04:18PM - 09 Jul 22 UTC
closed 01:51PM - 11 Jul 22 UTC
# Required information
* Distribution: Gentoo
* The output of "lxc info" o… r if that fails:
```
config: {}
api_extensions:
- storage_zfs_remove_snapshots
- container_host_shutdown_timeout
- container_stop_priority
- container_syscall_filtering
- auth_pki
- container_last_used_at
- etag
- patch
- usb_devices
- https_allowed_credentials
- image_compression_algorithm
- directory_manipulation
- container_cpu_time
- storage_zfs_use_refquota
- storage_lvm_mount_options
- network
- profile_usedby
- container_push
- container_exec_recording
- certificate_update
- container_exec_signal_handling
- gpu_devices
- container_image_properties
- migration_progress
- id_map
- network_firewall_filtering
- network_routes
- storage
- file_delete
- file_append
- network_dhcp_expiry
- storage_lvm_vg_rename
- storage_lvm_thinpool_rename
- network_vlan
- image_create_aliases
- container_stateless_copy
- container_only_migration
- storage_zfs_clone_copy
- unix_device_rename
- storage_lvm_use_thinpool
- storage_rsync_bwlimit
- network_vxlan_interface
- storage_btrfs_mount_options
- entity_description
- image_force_refresh
- storage_lvm_lv_resizing
- id_map_base
- file_symlinks
- container_push_target
- network_vlan_physical
- storage_images_delete
- container_edit_metadata
- container_snapshot_stateful_migration
- storage_driver_ceph
- storage_ceph_user_name
- resource_limits
- storage_volatile_initial_source
- storage_ceph_force_osd_reuse
- storage_block_filesystem_btrfs
- resources
- kernel_limits
- storage_api_volume_rename
- macaroon_authentication
- network_sriov
- console
- restrict_devlxd
- migration_pre_copy
- infiniband
- maas_network
- devlxd_events
- proxy
- network_dhcp_gateway
- file_get_symlink
- network_leases
- unix_device_hotplug
- storage_api_local_volume_handling
- operation_description
- clustering
- event_lifecycle
- storage_api_remote_volume_handling
- nvidia_runtime
- container_mount_propagation
- container_backup
- devlxd_images
- container_local_cross_pool_handling
- proxy_unix
- proxy_udp
- clustering_join
- proxy_tcp_udp_multi_port_handling
- network_state
- proxy_unix_dac_properties
- container_protection_delete
- unix_priv_drop
- pprof_http
- proxy_haproxy_protocol
- network_hwaddr
- proxy_nat
- network_nat_order
- container_full
- candid_authentication
- backup_compression
- candid_config
- nvidia_runtime_config
- storage_api_volume_snapshots - storage_unmapped
- projects
- candid_config_key
- network_vxlan_ttl
- container_incremental_copy
- usb_optional_vendorid
- snapshot_scheduling
- snapshot_schedule_aliases
- container_copy_project
- clustering_server_address
- clustering_image_replication
- container_protection_shift
- snapshot_expiry
- container_backup_override_pool
- snapshot_expiry_creation
- network_leases_location
- resources_cpu_socket
- resources_gpu
- resources_numa
- kernel_features
- id_map_current
- event_location
- storage_api_remote_volume_snapshots
- network_nat_address
- container_nic_routes
- rbac
- cluster_internal_copy
- seccomp_notify
- lxc_features
- container_nic_ipvlan
- network_vlan_sriov
- storage_cephfs
- container_nic_ipfilter
- resources_v2
- container_exec_user_group_cwd
- container_syscall_intercept
- container_disk_shift
- storage_shifted
- resources_infiniband
- daemon_storage
- instances
- image_types
- resources_disk_sata
- clustering_roles
- images_expiry
- resources_network_firmware
- backup_compression_algorithm
- ceph_data_pool_name
- container_syscall_intercept_mount
- compression_squashfs
- container_raw_mount
- container_nic_routed
- container_syscall_intercept_mount_fuse
- container_disk_ceph
- virtual-machines
- image_profiles
- clustering_architecture
- resources_disk_id
- storage_lvm_stripes
- vm_boot_priority
- unix_hotplug_devices
- api_filtering
- instance_nic_network
- clustering_sizing
- firewall_driver
- projects_limits
- container_syscall_intercept_hugetlbfs
- limits_hugepages
- container_nic_routed_gateway
- projects_restrictions
- custom_volume_snapshot_expiry
- volume_snapshot_scheduling
- trust_ca_certificates
- snapshot_disk_usage
- clustering_edit_roles
- container_nic_routed_host_address
- container_nic_ipvlan_gateway
- resources_usb_pci
- resources_cpu_threads_numa
- resources_cpu_core_die
- api_os
- container_nic_routed_host_table
- container_nic_ipvlan_host_table
- container_nic_ipvlan_mode
- resources_system
- images_push_relay
- network_dns_search
- container_nic_routed_limits
- instance_nic_bridged_vlan
- network_state_bond_bridge
- usedby_consistency
- custom_block_volumes
- clustering_failure_domains
- resources_gpu_mdev
- console_vga_type
- projects_limits_disk
- network_type_macvlan
- network_type_sriov
- container_syscall_intercept_bpf_devices
- network_type_ovn
- projects_networks
- projects_networks_restricted_uplinks
- custom_volume_backup
- backup_override_name
- storage_rsync_compression
- network_type_physical
- network_ovn_external_subnets
- network_ovn_nat
- network_ovn_external_routes_remove
- tpm_device_type
- storage_zfs_clone_copy_rebase
- gpu_mdev
- resources_pci_iommu
- resources_network_usb
- resources_disk_address
- network_physical_ovn_ingress_mode
- network_ovn_dhcp
- network_physical_routes_anycast
- projects_limits_instances
- network_state_vlan
- instance_nic_bridged_port_isolation
- instance_bulk_state_change
- network_gvrp
- instance_pool_move
- gpu_sriov
- pci_device_type
- storage_volume_state
- network_acl
- migration_stateful
- disk_state_quota
- storage_ceph_features
- projects_compression
- projects_images_remote_cache_expiry
- certificate_project
- network_ovn_acl
- projects_images_auto_update
- projects_restricted_cluster_target
- images_default_architecture
- network_ovn_acl_defaults
- gpu_mig
- project_usage
- network_bridge_acl
- warnings
- projects_restricted_backups_and_snapshots
- clustering_join_token
- clustering_description
- server_trusted_proxy
- clustering_update_cert
- storage_api_project
- server_instance_driver_operational
- server_supported_storage_drivers
- event_lifecycle_requestor_address
- resources_gpu_usb
- clustering_evacuation
- network_ovn_nat_address
- network_bgp
- network_forward
- custom_volume_refresh
- network_counters_errors_dropped
- metrics
- image_source_project
- clustering_config
- network_peer
- linux_sysctl
- network_dns
- ovn_nic_acceleration
- certificate_self_renewal
- instance_project_move
- storage_volume_project_move
- cloud_init
- network_dns_nat
- database_leader
- instance_all_projects
- clustering_groups
- ceph_rbd_du
- instance_get_full
- qemu_metrics
- gpu_mig_uuid
- event_project
- clustering_evacuation_live
- instance_allow_inconsistent_copy
- network_state_ovn
- storage_volume_api_filtering
- image_restrictions
- storage_zfs_export
- network_dns_records
- storage_zfs_reserve_space
- network_acl_log
- storage_zfs_blocksize
- metrics_cpu_seconds
- instance_snapshot_never
- certificate_token
- instance_nic_routed_neighbor_probe
- event_hub
- agent_nic_config
- projects_restricted_intercept
- metrics_authentication
- images_target_project
- cluster_migration_inconsistent_copy
- cluster_ovn_chassis
- container_syscall_intercept_sched_setscheduler
- storage_lvm_thinpool_metadata_size
api_status: stable
api_version: "1.0"
auth: trusted
public: false
auth_methods:
- tls
environment:
addresses: []
architectures:
- x86_64
- i686
certificate: |
[...]
certificate_fingerprint: 31d1389a877fdb0758210471a80268d61d067cd889e4f46fa6a4296694f76fdd
driver: lxc | qemu
driver_version: 4.0.12 | 7.0.0
firewall: xtables
kernel: Linux
kernel_architecture: x86_64
kernel_features:
idmapped_mounts: "true"
netnsid_getifaddrs: "true"
seccomp_listener: "true"
seccomp_listener_continue: "true"
shiftfs: "false"
uevent_injection: "true"
unpriv_fscaps: "true"
kernel_version: 5.18.10-gentoo
lxc_features:
cgroup2: "true"
core_scheduling: "true"
devpts_fd: "true"
idmapped_mounts_v2: "true"
mount_injection_file: "true"
network_gateway_device_route: "true"
network_ipvlan: "true"
network_l2proxy: "true"
network_phys_macvlan_mtu: "true"
network_veth_router: "true"
pidfd: "true"
seccomp_allow_deny_syntax: "true"
seccomp_notify: "true"
seccomp_proxy_send_notify_fd: "true"
os_name: Gentoo
os_version: "2.6"
project: default
server: lxd
server_clustered: false
server_event_mode: full-mesh
server_name: host.example.com
server_pid: 4794
server_version: 5.0.0
storage: lvm
storage_version: 2.02.188(2) (2021-05-07) / 1.02.172 (2021-05-07) / 4.46.0
storage_supported_drivers:
- name: btrfs
version: 5.15.1
remote: false
- name: dir
version: "1"
remote: false
- name: lvm
version: 2.02.188(2) (2021-05-07) / 1.02.172 (2021-05-07) / 4.46.0
remote: false
```
# Issue description
For the past year or so my lxd setup on Gentoo was non-functional. Whenever I tried to start a container it would fail with a cgroup mount error such as:
```
lxc debian11 20220705053545.180 TRACE cgfsng - cgroups/cgfsng.c:cgfsng_mount:1608 - Read-write cgroup mounts requested
lxc debian11 20220705053545.180 TRACE mount_utils - mount_utils.c:__fs_prepare:177 - Finished initializing new tmpfs filesystem context 22
lxc debian11 20220705053545.180 TRACE mount_utils - mount_utils.c:fs_set_property:215 - Set "mode" to "0755" on filesystem context 22
lxc debian11 20220705053545.180 TRACE mount_utils - mount_utils.c:fs_set_property:215 - Set "size" to "10240k" on filesystem context 22
lxc debian11 20220705053545.180 TRACE mount_utils - mount_utils.c:fs_attach:266 - Mounted 24 onto 23
lxc debian11 20220705053545.180 TRACE mount_utils - mount_utils.c:__fs_prepare:177 - Finished initializing new cgroup filesystem context 24
lxc debian11 20220705053545.180 TRACE mount_utils - mount_utils.c:fs_set_property:215 - Set "name" to "portage" on filesystem context 24
lxc debian11 20220705053545.180 TRACE mount_utils - mount_utils.c:fs_attach:266 - Mounted 26 onto 25
lxc debian11 20220705053545.180 DEBUG cgfsng - cgroups/cgfsng.c:__cgroupfs_mount:1541 - Mounted cgroup filesystem cgroup onto 23(portage)
lxc debian11 20220705053545.180 ERROR cgfsng - cgroups/cgfsng.c:cgfsng_mount:1771 - No such file or directory - Failed to create cgroup at_mnt 23()
lxc debian11 20220705053545.180 ERROR conf - conf.c:lxc_mount_auto_mounts:851 - No such file or directory - Failed to mount "/sys/fs/cgroup"
lxc debian11 20220705053545.180 ERROR conf - conf.c:lxc_setup:4391 - Failed to setup remaining automatic mounts
lxc debian11 20220705053545.180 ERROR start - start.c:do_start:1275 - Failed to setup container "debian11"
lxc debian11 20220705053545.180 TRACE sync - sync.c:lxc_sync_wake_parent:104 - Child waking parent with sequence error
lxc debian11 20220705053545.181 TRACE conf - conf.c:lxc_idmapped_mounts_parent:4119 - Finished receiving idmapped mount file descriptors from child
lxc debian11 20220705053545.181 TRACE sync - sync.c:lxc_sync_wait_child:116 - Parent waiting for child with sequence cgroup-limits
lxc debian11 20220705053545.181 ERROR sync - sync.c:sync_wait:34 - An error occurred in another process (expected sequence number 4)
lxc debian11 20220705053545.181 TRACE start - start.c:lxc_expose_namespace_environment:909 - Set environment variable LXC_USER_NS=/proc/132343/fd/20
lxc debian11 20220705053545.181 TRACE start - start.c:lxc_expose_namespace_environment:909 - Set environment variable LXC_MNT_NS=/proc/132343/fd/21
lxc debian11 20220705053545.181 TRACE start - start.c:lxc_expose_namespace_environment:909 - Set environment variable LXC_PID_NS=/proc/132343/fd/22
lxc debian11 20220705053545.181 TRACE start - start.c:lxc_expose_namespace_environment:909 - Set environment variable LXC_UTS_NS=/proc/132343/fd/23
lxc debian11 20220705053545.181 TRACE start - start.c:lxc_expose_namespace_environment:909 - Set environment variable LXC_IPC_NS=/proc/132343/fd/24
lxc debian11 20220705053545.181 TRACE start - start.c:lxc_expose_namespace_environment:909 - Set environment variable LXC_NET_NS=/proc/132343/fd/4
lxc debian11 20220705053545.201 WARN network - network.c:lxc_delete_network_priv:3617 - Failed to rename interface with index 0 from "eth0" to its initial name "veth0a7d5a71"
lxc debian11 20220705053545.201 DEBUG network - network.c:lxc_delete_network:4159 - Deleted network devices
lxc debian11 20220705053545.201 TRACE start - start.c:lxc_serve_state_socket_pair:547 - Sent container state "ABORTING" to 7
lxc debian11 20220705053545.201 TRACE start - start.c:lxc_serve_state_clients:486 - Set container state to ABORTING
lxc debian11 20220705053545.201 TRACE start - start.c:lxc_serve_state_clients:489 - No state clients registered
lxc debian11 20220705053545.201 ERROR lxccontainer - lxccontainer.c:wait_on_daemonized_start:877 - Received container state "ABORTING" instead of "RUNNING"
lxc debian11 20220705053545.201 ERROR start - start.c:__lxc_start:2074 - Failed to spawn container "debian11"
```
For a long time I ignored the error. I thought maybe something broken with the transition to cgroups v2 as Gentoo was "only" offering the LXD 4.0.x LTS version.
Then recently LXD 5.0.0 became available so I gave it another try but unfortunately with the same behaviour. :cry:
So today I invested some time trying to figure out what's going wrong and I think I found something weird:
* This machine is using systemd which mounts cgroup v2 to `/sys/fs/cgroup`:
```
# mount | grep cgroup
cgroup2 on /sys/fs/cgroup type cgroup2 (rw,nosuid,nodev,noexec,relatime)
```
* If I also enable a [Gentoo portage FEATURE](https://dev.gentoo.org/~zmedico/portage/doc/man/make.conf.5.html) in `/etc/portage/make.conf` called `cgroup`...
> Use Linux control group to control processes spawned by ebuilds. This allows emerge to safely kill all subprocesses when ebuild phase exits
...then another cgroup mount is automatically setup after installing a package:
```
# mount | grep cgroup
cgroup2 on /sys/fs/cgroup type cgroup2 (rw,nosuid,nodev,noexec,relatime)
tmpfs on /sys/fs/cgroup/portage type cgroup (rw,nosuid,nodev,noexec,relatime,release_agent=/usr/lib/portage/python3.10/cgroup-release-agent,name=portage)
```
* The `/sys/fs/cgroup/portage` would remain mounted until the next reboot. As long as it is mounted containers fail to start with the error printed above. As soon as I unmount `/sys/fs/cgroup/portage` the containers can be started again.
Maybe you could hint me if this is a behaviour that could be fixed in lxd/lxcfs or if this is something that I need to bring to attention to the Gentoo portage developers?
# Steps to reproduce
1. Use a Gentoo Linux system with systemd that mounts cgroups v2 to `/sys/fs/cgroup`
2. Ensure `FEATURES="cgroup"` is enabled in `/etc/portage/make.conf`
3. Emerge a package
4. Verify that `/sys/fs/cgroup/portage` is mounted
5. Start a LXD container via `lxc launch ...` or `lxc start ...`
You could also try this:
This isn’t specific a standalone or clustered LXD setups, it is to do with the cgroup setup of the LXD host compared to what the image you’re trying to use requires.
Running older container OSes on more modern host systems that use CGroupV2 can produce this conflict.
To resolve it you need to boot your host system into CGroupV1 mode by modifying your kernel’s boot arguments to include:
systemd.unified_cgroup_hierarchy=false
This was indeed the issue! My cmdline was previously systemd.unified_cgroup_hierarchy=1. Changing it to systemd.unified_cgroup_hierarchy=false as suggested smooths the transition from 5.2 to 5.3. Thanks!
1 Like
That also worked on my machine, but I had this issue using and arch container image downloaded from linuxcontainers.org servers, where can I get a more up to date image that uses CGroupV2?