Building and Deploying Local Kata Artifacts¶

Overview¶

This guide shows how to build Kata guest artifacts (the guest kernel, rootfs image, and shim) locally, deploy them to a test node, and run them with a custom runtime config.

It also documents two failure modes that are easy to hit and hard to diagnose: a rootfs image that builds "successfully" but is empty, and a guest kernel panic caused by module signature enforcement. Both are covered in Troubleshooting.

Note

Examples use the NVIDIA GPU variant (nvidia-gpu), which exercises EROFS, measured rootfs (dm-verity), and signed modules. Substitute the variant name for others.

Build system¶

Artifacts are built from tools/packaging/kata-deploy/local-build/. Each target produces a kata-static-<component>.tar.zst under build/:

Bash

cd tools/packaging/kata-deploy/local-build

make kernel-nvidia-gpu-tarball          # guest kernel + modules
make rootfs-image-nvidia-gpu-tarball    # rootfs image (builds the kernel first)
make shim-v2-tarball                    # shim + configs

USE_CACHE¶

USE_CACHE defaults to yes, which pulls prebuilt artifacts from CI and skips the local build whenever a cached version matches. This means a make run may not contain your changes, and it mixes CI components with your own. Force a local build with:

Bash

export USE_CACHE=no
make rootfs-image-nvidia-gpu-tarball

Note

build/<component>-version, -sha256sum, and -builder-image-version files are the fingerprint of a cache pull. If you intended a local build and see them, you forgot USE_CACHE=no.

The GPU image build sets FS_TYPE=erofs, MEASURED_ROOTFS=yes, and SKIP_DAX_HEADER=yes automatically.

Rebuild only the rootfs image¶

When the rootfs contents are unchanged and you only need to repackage them (for example after editing image_builder.sh), build the image directly against the populated rootfs directory:

Bash

cd tools/osbuilder/image-builder

ROOTFS=../../packaging/kata-deploy/local-build/build/rootfs-image-nvidia-gpu/builddir/rootfs-image/ubuntu_rootfs

sudo USE_DOCKER=1 \
  FS_TYPE=erofs \
  MEASURED_ROOTFS=yes \
  SKIP_DAX_HEADER=yes \
  BUILD_VARIANT=nvidia-gpu \
  AGENT_INIT=no \
  ./image_builder.sh -o /tmp/kata-rootfs.image "$ROOTFS"

USE_DOCKER=1 runs the build in the image-builder-osbuilder container, which has the required tools and bind-mounts the on-disk image_builder.sh, so local edits take effect.

Always verify the image is populated (see Verifying an image is populated) before deploying.

Deploy to a node¶

Artifacts install under /opt/kata/share/kata-containers/.

Replace the default artifacts¶

Extract the kernel tarball at / so its paths land in place and the *-nvidia-gpu.container symlinks repoint to the new kernel:

Bash

sudo tar --zstd -xvf kata-static-kernel-nvidia-gpu.tar.zst -C /
cp kata-rootfs.image /opt/kata/share/kata-containers/<your-image-name>.image

Use a custom location¶

To leave the shipped artifacts untouched, extract into a side directory and point a custom runtime config at it:

Bash

sudo mkdir -p /opt/kata/share/kata-containers/custom/
sudo tar --zstd -xvf kata-static-kernel-nvidia-gpu.tar.zst \
  -C /opt/kata/share/kata-containers/custom/

TOML

[hypervisor.qemu]
image  = "/opt/kata/share/kata-containers/<your-image-name>.image"
kernel = "/opt/kata/share/kata-containers/custom/opt/kata/share/kata-containers/vmlinux-nvidia-gpu.container"
# Disable dm-verity so rebuilt test images boot (see Troubleshooting).
kernel_verity_params = ""

Important:

The kernel and rootfs must come from the same build. The rootfs ships the kernel modules under /lib/modules/<version>, which must match the running kernel's version and, if enforcement is on, its signing key. Deploy them as a pair.

After deploying, recreate the pod. The kernel, image, and kernel_verity_params are read at sandbox creation, so a running sandbox keeps the old values.

Troubleshooting¶

The rootfs image is empty¶

Symptom. The guest panics early:

Text Only

VFS: Unable to mount root fs on unknown-block(...)
erofs (device ...): cannot read erofs superblock

and the firmware falls back to PXE. A hexdump shows a valid partition table but zeros where the filesystem should be.

Cause. image_builder.sh writes the filesystem into the image through a loop partition device (dd ... of=${device}p1). In nested or unprivileged containers that node does not map to the backing file, so the write silently lands nowhere and its exit status is not checked. You get a valid, even dm-verity-signed, image over an empty partition, and the build reports success.

Diagnose. A healthy EROFS image has magic e2 e1 f5 e0 at offset 0x100400; an empty one is all zeros from 0x200 to EOF. A kata-static-rootfs-image-*.tar.zst of only tens of KB (566 MB of zeros compresses to almost nothing) is another giveaway.

Fix. Make the write independent of loop partition devices:

Build where loop partitions work, such as a --privileged container with -v /dev:/dev, or directly on a host; or
Use a loopless image build that writes the filesystem into the backing file at the partition offset (dd if=<fsimage> of=<image> bs=1M seek=<rootfs_start> conv=notrunc,fsync) and runs veritysetup format on plain files instead of partition devices.

dm-verity panic: "metadata block 0 is corrupted"¶

Symptom.

Text Only

device-mapper: verity: ...: metadata block 0 is corrupted
erofs (device dm-0): cannot read erofs superblock

Cause. The root hash on the kernel command line does not match the hash tree in the image. The runtime reads kernel_verity_params from configuration-*.toml, not from root_hash_<variant>.txt at boot. That value is baked into the config at shim build time, so rebuilding the image without rebuilding the config leaves them mismatched. Each rebuild also uses a new random salt, so the hash changes every time.

Fix.

Disable verity for testing: set kernel_verity_params = "" (empty string and removing the key are equivalent). The runtime then boots root=/dev/vda1 and ignores the hash partition.
Or keep it in sync: rebuild the image and shim/config together and deploy them as a set, so kernel_verity_params equals the build's root_hash_<variant>.txt.

Kernel panic: "Loading of module with unavailable key is rejected"¶

Symptom. The rootfs mounts and init runs, then:

Text Only

Loading of module with unavailable key is rejected
panic: /sbin/modprobe failed with status: exit status: 1

Cause. Module signature enforcement. Built with KBUILD_SIGN_PIN set, the kernel gets CONFIG_MODULE_SIG_FORCE=y and signs modules with a key derived from that pin, and CONFIG_SYSTEM_TRUSTED_KEYS is empty, so it trusts only its own key. "unavailable key" means the module is signed with a key the running kernel does not trust, i.e. the kernel and the in-rootfs modules came from different builds. This often comes from USE_CACHE=yes pulling a CI kernel while the modules were signed by another build.

Fix.

Build the kernel and rootfs together with USE_CACHE=no so they share one key, and deploy both.
Or build without enforcement (simplest for testing): a kernel built without KBUILD_SIGN_PIN has no MODULE_SIG_FORCE and loads modules regardless of key. Verify with:

Bash

tar --zstd -xO -f kata-static-kernel-nvidia-gpu.tar.zst \
  ./opt/kata/share/kata-containers/config-*-nvidia-gpu | grep MODULE_SIG_FORCE
# want: # CONFIG_MODULE_SIG_FORCE is not set

Note

Even with signing off, the kernel and modules must share the same version (vermagic, e.g. 6.18.28-nvidia-gpu) or modprobe fails. Same-build artifacts guarantee this.

Boots straight to UEFI/PXE with no kernel output¶

Symptom. The guest produces no kernel log lines at all and the firmware loops on boot-device selection and PXE:

Text Only

BdsDxe: failed to load Boot0002 "UEFI Misc Device" ...: Not Found
>>Start PXE over IPv4.
BdsDxe: No bootable option or device was found.

Cause. kernel = points at vmlinux (the raw ELF) instead of vmlinuz (the bzImage). Variants that boot through OVMF firmware (firmware = .../OVMF.fd in the config, -bios .../OVMF.fd in the qemu line) require a vmlinuz bzImage; OVMF cannot boot a raw vmlinux ELF. QEMU loads -kernel into fw_cfg without validating it, so the VM starts, OVMF fails to boot the kernel, and falls through to disk/PXE. The default GPU config uses vmlinuz-nvidia-gpu.container for this reason; a custom override that sets vmlinux instead hits this.

Fix. Point kernel = at the vmlinuz symlink:

TOML

kernel = "/opt/kata/share/kata-containers/vmlinuz-nvidia-gpu.container"

The kernel tarball ships both vmlinux-* and vmlinuz-* plus their .container symlinks, so this is a config-only change. Confirm the qemu command line then shows -kernel .../vmlinuz-....

Verifying an image is populated¶

Bash

python3 - /path/to/your.image <<'EOF'
import sys
f = open(sys.argv[1], 'rb'); f.seek(0x100400); d = f.read(16)
print("0x100400:", d.hex(' '))
print("EROFS magic present:", d[:4] == bytes.fromhex('e2e1f5e0'))
EOF

EROFS magic present: True means the filesystem is really in the image.