🐳 Advanced Shell in Containers

🧠 Overview

Shell scripts inside containers behave differently than on a normal Linux host. Why? Because containers change:

PID hierarchy
signal delivery
process groups
environment initialization
filesystem layout
entrypoint semantics
zombie reaping
logging and stdout/stderr behavior

This module explains how to write robust, production‑grade shell scripts that run correctly inside Docker, Kubernetes, and containerized CI/CD environments.

🎓 Who this is for

DevOps/SRE building container images or entrypoints.
Engineers writing startup scripts, health checks, or lifecycle hooks.
Anyone debugging signal handling, zombie processes, or shutdown issues.
People deploying applications in Kubernetes, Nomad, ECS, or Docker Swarm.

🧩 Internals / Mechanics

🧩 PID 1 is special

Inside a container, the entrypoint becomes PID 1.

PID 1 has unique semantics:

ignores some signals by default
does not automatically reap zombie processes
does not forward signals unless explicitly implemented
is responsible for clean shutdown

This is the root cause of many container bugs.

🧩 Shell as PID 1 is dangerous

If your entrypoint is:

CMD ["sh", "-c", "run.sh"]

Then sh becomes PID 1 and:

does not reap children
may ignore SIGTERM
may not forward signals to the app
may leave zombies
may cause slow or broken shutdowns

🧩 Exec vs non‑exec entrypoints

Bad:

app "$@"

Good:

exec app "$@"

exec replaces the shell with the application → no extra shell process → correct signal handling.

🧩 Environment initialization

Containers often run with:

missing environment variables
empty PATH
no user dotfiles
minimal locale settings

Scripts must validate everything.

🔧 Techniques

🔧 Always use `exec` in entrypoints

#!/bin/sh
set -e
exec app "$@"

🔧 Use a minimal init system

Recommended:

tini
dumb-init

Example:

ENTRYPOINT ["/usr/bin/tini", "--"]
CMD ["run.sh"]

🔧 Validate environment variables before use

: "${CONFIG_PATH:?CONFIG_PATH must be set}"

🔧 Use predictable logging

log() { printf '[%s] %s\n' "$(date +%H:%M:%S)" "$*" >&2; }

🔧 Use `trap` for graceful shutdown

trap 'cleanup; exit 0' SIGTERM SIGINT

⚠️ Pitfalls

⚠️ Shell as PID 1 not reaping zombies

cmd &
# no wait → zombie

⚠️ SIGTERM not stopping the app

Kubernetes sends SIGTERM → shell ignores it → pod hangs.

⚠️ Using `sleep infinity` in entrypoints

Prevents proper shutdown.

⚠️ Relying on interactive features

Containers do not load:

.bashrc
.profile
.zshrc

⚠️ Using `tail -f` as the main process

Prevents signal propagation.

🚨 Real‑World Failures

🚨 Failure: Kubernetes pod refuses to terminate

Entrypoint:

#!/bin/sh
app &
wait

SIGTERM sent → shell ignores → pod stuck in Terminating.

Fix:

trap 'kill 0; exit 0' SIGTERM
exec app

🚨 Failure: Zombie processes accumulate in container

Long‑running script spawns children but never reaps them.

Fix:

Use tini or implement:

trap 'wait' CHLD

🚨 Failure: Application never receives SIGTERM

Entrypoint:

app "$@"

Shell remains PID 1 → app is child → SIGTERM goes to shell, not app.

Fix:

exec app "$@"

🛠️ Patterns

🛠️ Pattern: Use `exec` to replace the shell

Ensures correct signal handling.

🛠️ Pattern: Use a real init system

tini solves:

zombie reaping
signal forwarding
predictable shutdown

🛠️ Pattern: Validate environment early

Fail fast if required variables are missing.

🛠️ Pattern: Use health checks that do not fork excessively

Avoid heavy loops.

❌ Anti‑Patterns

❌ Anti‑pattern: Shell as a long‑running supervisor

Shell is not systemd.

❌ Anti‑pattern: Using `sleep infinity`

❌ Anti‑pattern: Using `tail -f` as PID 1

❌ Anti‑pattern: Ignoring SIGTERM

❌ Anti‑pattern: Running background jobs without cleanup

🔍 Debugging

🔍 Inspect PID tree inside container

1	`ps -o pid,ppid,stat,cmd`

🔍 Debug signals

strace -f -e trace=signal -p 1

🔍 Debug FD leaks

1	`ls -l /proc/1/fd`

🔍 Debug entrypoint behavior

Add:

set -x
echo "PID=$$"

⚙️ Performance

⚙️ Avoid heavy loops in entrypoints

Use compiled tools for heavy work.

⚙️ Avoid unnecessary forks

Use builtins where possible.

⚙️ Use streaming tools for large data

awk, sed, jq are optimized for streaming.

🧵 Process Control

🧵 PID 1 must forward signals

Otherwise:

Kubernetes cannot stop pods
Docker cannot stop containers
CI jobs hang

🧵 PID 1 must reap children

Otherwise zombies accumulate.

🛰️ CI/CD

🛰️ Containers in CI must fail fast

Use:

set -euo pipefail

🛰️ Validate environment before running commands

🛰️ Avoid background jobs unless necessary

🧠 Summary

Shell scripts inside containers must account for:

PID 1 semantics
signal forwarding
zombie reaping
environment validation
deterministic startup and shutdown
predictable logging
safe entrypoint design

Mastering these techniques ensures reliable, production‑grade container behavior.

🐳 Advanced Shell in Containers

🧠 Overview

🎓 Who this is for

🧩 Internals / Mechanics

🧩 PID 1 is special

🧩 Shell as PID 1 is dangerous

🧩 Exec vs non‑exec entrypoints

🧩 Environment initialization

🔧 Techniques

🔧 Always use exec in entrypoints

🔧 Use a minimal init system

🔧 Validate environment variables before use

🔧 Use predictable logging

🔧 Use trap for graceful shutdown

⚠️ Pitfalls

⚠️ Shell as PID 1 not reaping zombies

⚠️ SIGTERM not stopping the app

⚠️ Using sleep infinity in entrypoints

⚠️ Relying on interactive features

⚠️ Using tail -f as the main process

🚨 Real‑World Failures

🚨 Failure: Kubernetes pod refuses to terminate

🚨 Failure: Zombie processes accumulate in container

🚨 Failure: Application never receives SIGTERM

🛠️ Patterns

🛠️ Pattern: Use exec to replace the shell

🛠️ Pattern: Use a real init system

🛠️ Pattern: Validate environment early

🛠️ Pattern: Use health checks that do not fork excessively

❌ Anti‑Patterns

❌ Anti‑pattern: Shell as a long‑running supervisor

❌ Anti‑pattern: Using sleep infinity

❌ Anti‑pattern: Using tail -f as PID 1

❌ Anti‑pattern: Ignoring SIGTERM

❌ Anti‑pattern: Running background jobs without cleanup

🔍 Debugging

🔍 Inspect PID tree inside container

🔍 Debug signals

🔍 Debug FD leaks

🔍 Debug entrypoint behavior

⚙️ Performance

⚙️ Avoid heavy loops in entrypoints

⚙️ Avoid unnecessary forks

⚙️ Use streaming tools for large data

🧵 Process Control

🧵 PID 1 must forward signals

🧵 PID 1 must reap children

🛰️ CI/CD

🛰️ Containers in CI must fail fast

🛰️ Validate environment before running commands

🛰️ Avoid background jobs unless necessary

🧠 Summary

🔧 Always use `exec` in entrypoints

🔧 Use `trap` for graceful shutdown

⚠️ Using `sleep infinity` in entrypoints

⚠️ Using `tail -f` as the main process

🛠️ Pattern: Use `exec` to replace the shell

❌ Anti‑pattern: Using `sleep infinity`

❌ Anti‑pattern: Using `tail -f` as PID 1