How much can multi-stage builds reduce image size?

Multi-stage builds typically reduce image sizes by 70-90% by excluding build dependencies, source code, and intermediate files from the final production image.

Can I use multi-stage builds with Docker Compose?

Yes, Docker Compose fully supports multi-stage builds. Use the target option in your docker-compose.yml to specify which build stage to use for different environments.

Mastering Docker Multi-Stage Builds for Production-Ready Images

Q: What is a Docker multi-stage build?

Multi-stage builds allow you to use multiple FROM statements in a Dockerfile, copying only necessary artifacts between stages to create smaller, more secure final images without build dependencies.

What This Guide Covers (And Why Multi-Stage Builds Matter)

Docker multi-stage builds let you create lean, secure production images by building your application in one environment and shipping only what's necessary in another. If you've ever stared at a 2GB image for a simple Node.js app—or worried about shipping build tools and source code to production—this technique will change how you containerize software. You'll learn how to slash image sizes, reduce attack surfaces, and speed up deployments without adding complexity to your workflow.

What Is a Docker Multi-Stage Build?

A multi-stage build is a Dockerfile feature that uses multiple FROM statements—each starting a new build stage with its own base image—to separate the build environment from the final runtime environment.

Here's the thing: traditional Dockerfiles bundle everything together. Your compiler, test dependencies, dev tools, and source code all ship alongside your actual application. It's like buying a house and getting the construction crew's equipment thrown in for free—except nothing about that scenario is actually free. Larger images mean slower pulls, higher storage costs, and more components that could harbor vulnerabilities.

Multi-stage builds solve this by letting you compile, test, and package in a "builder" stage (which can be as bloated as needed), then copy only the compiled artifacts into a minimal "runtime" stage. The builder gets discarded. What ships to production is clean, small, and purpose-built.

A basic example makes this clearer:

# Builder stage
FROM node:20-alpine AS builder
WORKDIR /app
COPY package*.json ./
RUN npm ci
COPY . .
RUN npm run build

# Runtime stage
FROM node:20-alpine AS runtime
WORKDIR /app
COPY --from=builder /app/dist ./dist
COPY --from=builder /app/node_modules ./node_modules
COPY package.json ./
EXPOSE 3000
CMD ["node", "dist/main.js"]

The --from=builder syntax copies files across stages. The final image contains no TypeScript compiler, no devDependencies, no test files—just the built application and production dependencies.

How Much Smaller Can Multi-Stage Builds Make Your Images?

Significantly smaller—often 80-95% reductions depending on your stack. A typical React application built with create-react-app might produce a 1.2GB image using a single-stage Node.js base. The same application in a multi-stage build—using Nginx Alpine to serve static files—drops to roughly 25MB.

Worth noting: the savings compound. Smaller images pull faster during deployments, start quicker on container orchestrators like Kubernetes, and consume less registry storage. For teams running hundreds of deployments daily, this translates to measurable cost and time savings.

Here's a real comparison using a Go microservice:

Approach	Base Image	Final Size	Build Tools in Final Image?
Single-stage (Golang)	golang:1.21	~850MB	Yes
Multi-stage (Scratch)	scratch	~12MB	No
Multi-stage (Distroless)	gcr.io/distroless/static	~18MB	No

Distroless images—maintained by Google—contain only your application and its runtime dependencies. No shell, no package manager, no extra attack surface. They're the gold standard for Go, Python, Java, and Node.js applications in production.

When Should You Use Multi-Stage Builds?

Use them for every production container—full stop. The exceptions (local development scratchpads, internal tooling that never leaves your laptop) are so rare they barely warrant mentioning.

That said, certain scenarios make multi-stage builds especially valuable:

Compiled languages: Go, Rust, Java, and C++ absolutely require this approach. Build toolchains are massive. Nobody needs the Maven repository cached in a production JVM container.
Frontend applications: Your build process needs Node.js. Your static files don't. Ship them via Nginx, Caddy, or a CDN instead.
Machine learning models: Training frameworks like PyTorch or TensorFlow are gigabytes. Inference runtimes are megabytes. Separate them.
Security-sensitive workloads: Compliance requirements (SOC 2, ISO 27001) often mandate minimal attack surfaces. Multi-stage builds help you meet those requirements without heroic efforts.

The catch? Multi-stage builds add slight complexity to your Dockerfile. You'll need to understand layer caching—Docker builds each stage independently, and stages can run in parallel if they don't depend on each other. Get the stage order wrong, and you'll rebuild the world every time.

Practical Patterns for Production

The Builder-Runtime Split

This is your bread-and-butter pattern. One stage builds, one stage runs. The builder uses a full-featured image (Ubuntu, Debian, language-specific official images). The runtime uses Alpine, Distroless, or scratch.

For Python applications, this often means:

Builder: python:3.11 with gcc, libffi-dev, and build tools installed
Runtime: python:3.11-slim or python:3.11-alpine

Install build dependencies in the builder, compile wheels, then copy them to the runtime stage. The slim variants work for most applications. Alpine works if you don't need glibc compatibility (some Python packages with C extensions struggle here).

The Test-Integrated Build

Running tests inside your build ensures what ships passed validation. Add a dedicated test stage between builder and runtime:

FROM node:20-alpine AS builder
# ... build steps ...

FROM builder AS test
RUN npm run test

FROM node:20-alpine AS runtime
COPY --from=builder /app/dist ./dist
# ... runtime setup ...

Build with docker build --target test to run tests. Build with docker build --target runtime (or omit --target to use the last stage) to skip tests and produce your production image. CI pipelines can explicitly test then build, ensuring no untested code reaches production.

The Dependency Cache Optimization

Docker layer caching works per-stage. Structure your Dockerfile so dependency installation happens early—before code that changes frequently. This prevents reinstalling node_modules or pip packages on every source edit.

Here's the thing about cache mounts: modern Docker supports RUN --mount=type=cache for even faster builds. Dependency directories (like npm's cache or Go's module cache) persist between builds without bloating your final image.

FROM golang:1.21 AS builder
WORKDIR /app
COPY go.mod go.sum ./
RUN --mount=type=cache,target=/go/pkg/mod \
    go mod download
COPY . .
RUN go build -o app

This pattern requires BuildKit (enabled by default in Docker 23.0+). The module cache speeds builds without shipping to production.

Common Pitfalls (And How to Avoid Them)

Breaking layer caching with incorrect ordering. Copy your dependency manifests first, install, then copy source code. Reverse this order and every code change invalidates the dependency layer—slowing builds unnecessarily.

Using latest tags. Pin your base images to specific versions (node:20.5.0-alpine, not node:alpine). Reproducible builds matter. What works today shouldn't mysteriously break tomorrow because latest shifted.

Leaving secrets in intermediate layers. If you copy an .env file or private key into a builder stage, it exists in that layer—even if you don't copy it to the final stage. Use BuildKit secrets (--mount=type=secret) for sensitive data, or multi-stage builds where secrets never touch the filesystem.

Assuming Alpine is always smallest. Alpine uses musl libc instead of glibc. Some applications (notably certain Python data science libraries) perform poorly—or fail entirely. Test thoroughly. Distroless images (based on Debian) often provide better compatibility with marginally larger sizes.

"Multi-stage builds aren't optimization—they're hygiene. Shipping build tools to production is like leaving your house keys in the front door. It might work fine for years. But why risk it?" — Docker Best Practices Guide

Putting It All Together: A Production-Ready Example

Here's a complete, real-world Dockerfile for a Python FastAPI application deployed on AWS ECS:

# syntax=docker/dockerfile:1
FROM python:3.11-slim AS builder

WORKDIR /app
RUN apt-get update && apt-get install -y --no-install-recommends gcc && rm -rf /var/lib/apt/lists/*

COPY requirements.txt .
RUN pip install --user --no-cache-dir -r requirements.txt

# Runtime stage
FROM python:3.11-slim AS runtime

# Create non-root user for security
RUN groupadd -r appuser && useradd -r -g appuser appuser

WORKDIR /app

# Copy only installed packages from builder
COPY --from=builder /root/.local /home/appuser/.local
COPY --chown=appuser:appuser ./app ./app

# Ensure scripts in .local are usable
ENV PATH=/home/appuser/.local/bin:$PATH
ENV PYTHONDONTWRITEBYTECODE=1
ENV PYTHONUNBUFFERED=1

USER appuser

EXPOSE 8000

CMD ["uvicorn", "app.main:app", "--host", "0.0.0.0", "--port", "8000"]

This Dockerfile achieves several goals simultaneously:

Minimal final image (no gcc, no build headers)
Non-root execution (security best practice)
No bytecode caching in the container
Explicit, pinned Python version for reproducibility

Deploy this to Amazon Elastic Container Service with AWS Fargate, and you'll pay for exactly the compute you need—no excess bloat slowing cold starts or inflating storage bills. Container registries like Amazon ECR charge by the gigabyte; smaller images directly reduce infrastructure costs.

The transformation from a naive single-stage approach to this pattern takes maybe thirty minutes. The ongoing benefits—faster deployments, lower costs, reduced security exposure—compound indefinitely. Start using multi-stage builds today. Your future self (and your ops team) will appreciate the discipline.