rootlessutil: remove dead -r/ from nsenter args by MayCXC · Pull Request #4837 · containerd/nerdctl

MayCXC · 2026-04-09T02:21:55Z

Summary

The -r/ flag in ParentMain's nsenter args has been a no-op since it was added. It was placed at args[0], which becomes argv[0] (the program name) when passed to syscall.Exec(arg0, args, env). nsenter consumes it as its own name and never parses it as a flag.

This is harmless today, but if someone corrects the argv ordering (e.g. prepending arg0 to the slice), -r/ would start working and break rootless container creation:

nsenter opens the root fd (/) before setns
After entering the mount namespace, fchdir(root_fd) + chroot(\".\") anchors the process to the host root
In rootless mode, the host's /var/lib/containerd is owned by real root (uid 0), which is unmapped in the user namespace (appears as nobody/65534)
Overlay mount lowerdir resolution fails with EACCES because the process cannot traverse the 0700 host directory

This also fixes argv[0] to be arg0 (the nsenter binary path), matching the standard convention.

Verification

Strace comparison before and after, running nerdctl create in rootless mode:

Before (current code, -r/ accidentally in argv[0]):

execve("/usr/bin/nsenter", ["-r/", "-w/home/user", "--preserve-credentials", "-m", "-U", ...], ...)
setns(CLONE_NEWUSER) = 0
setns(CLONE_NEWNS) = 0
fchdir(3) = 0          # -r/ consumed as argv[0], no chroot
execve("nerdctl", ...)
mount("overlay", ...) = 0   # works because no chroot happened

If -r/ were at argv[1] (the latent bug):

setns(CLONE_NEWUSER) = 0
setns(CLONE_NEWNS) = 0
fchdir(3) = 0
chroot(".") = 0       # anchors to host root
fchdir(4) = 0
execve("nerdctl", ...)
mount("overlay", ...) = -1 EACCES   # host /var/lib/containerd inaccessible

After (this PR, -r/ removed, arg0 as argv[0]):

execve("/usr/bin/nsenter", ["/usr/bin/nsenter", "-w/home/user", "--preserve-credentials", "-m", "-U", ...], ...)
setns(CLONE_NEWUSER) = 0
setns(CLONE_NEWNS) = 0
execve("nerdctl", ...)
mount("overlay", ...) = 0   # paths resolve through mount namespace
``` debugger eval code:1:9

utafrali

The fix is correct: -r/ was sitting at argv[0] and was consumed by nsenter as its own program name, never parsed as a flag, so its removal is a no-op today and a safety improvement if argv ordering is ever corrected. Setting args[0] = arg0 now properly follows Unix convention. The only minor gap is the loss of the busybox nsenter compatibility comment without any replacement explanation.

utafrali · 2026-04-09T02:24:07Z

-	args := []string{
-		"-r/", // root dir (busybox nsenter wants this to be explicitly specified),
-	}
+	args := []string{arg0}


The removed comment mentioned busybox nsenter compatibility with -r/. Now that the flag is intentionally absent, a short note here explaining why it is omitted would help future readers avoid re-adding it. For example:

// Note: -r/ (root dir) is intentionally omitted. In rootless mode, chrooting to // the host root before setns would anchor the process to host paths that are // inaccessible inside the user namespace, breaking overlay mounts. args := []string{arg0}

@MayCXC Could you take a look at the comment above?

Added in fc77158 — comment explains why -r/ is intentionally absent:

// -r/ (root dir) is intentionally omitted. nsenter would open the host // root fd before setns, then chroot to it after entering the mount // namespace, anchoring the process to host paths. In rootless mode, // host dirs owned by real uid 0 (e.g. /var/lib/containerd) are // inaccessible inside the user namespace and overlay mounts would // fail with EACCES. args := []string{arg0}

Used "opens root fd before setns, chroots to it after" to match what nsenter actually does (the fd is captured in the host mount namespace, then the chroot happens after setns(CLONE_NEWNS) has entered the rootless mount namespace, which is why the anchor is wrong in rootless mode). Mentioned EACCES directly since that's the failure mode a future reader would see in strace if they re-added the flag.

The -r/ flag was placed at args[0], which becomes argv[0] (the program name) when passed to syscall.Exec. nsenter never parsed it as a flag, so it has been a no-op since it was added. If -r/ were moved to a proper argv position, it would break rootless container creation. nsenter opens the root fd before setns, so chroot anchors path resolution to the host root. In rootless mode, the host /var/lib/containerd is owned by real root (unmapped in the user namespace), causing overlay lowerdir resolution to fail with EACCES during WithAdditionalGIDs. Remove -r/ entirely rather than fixing its position. Signed-off-by: Aaron <aaron@omniband.ca>

utafrali approved these changes Apr 9, 2026

View reviewed changes

AkihiroSuda added this to the v2.3.0 milestone Apr 9, 2026

MayCXC force-pushed the fix/rootless-nsenter-argv0 branch from f7fc48a to fc77158 Compare April 18, 2026 16:56

MayCXC requested a review from AkihiroSuda April 18, 2026 16:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rootlessutil: remove dead -r/ from nsenter args#4837

rootlessutil: remove dead -r/ from nsenter args#4837
MayCXC wants to merge 1 commit intocontainerd:mainfrom
MayCXC:fix/rootless-nsenter-argv0

MayCXC commented Apr 9, 2026 •

edited

Loading

Uh oh!

utafrali left a comment

Uh oh!

utafrali Apr 9, 2026

Uh oh!

AkihiroSuda Apr 17, 2026

Uh oh!

MayCXC Apr 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

MayCXC commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Verification

Uh oh!

utafrali left a comment

Choose a reason for hiding this comment

Uh oh!

utafrali Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

AkihiroSuda Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

MayCXC Apr 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

MayCXC commented Apr 9, 2026 •

edited

Loading