OCI-Extract

A CLI tool for extracting specific files from OCI/Docker images without mounting them or requiring root privileges.

Features

No Root Required: Extract files without needing privileged access or container runtime
Efficient: Uses HTTP Range requests to fetch only necessary bytes
Format Support: Automatically detects and handles multiple image formats:
- Standard OCI/Docker layers (gzip)
- eStargz (seekable tar.gz with Table of Contents)
- SOCI (Seekable OCI with zTOC indices)
- zstd (zstandard compression)
- zstd:chunked (seekable zstd with TOC)
Remote-First: Works directly with remote registries without pulling entire images
File Listing: List all files in an image without downloading it

Installation

Download Binary

Download the latest release for your platform from the GitHub releases page.

Using mise

mise use --global github:amartani/oci-extract

Using Go Install

go install github.com/amartani/oci-extract@latest

Usage

Basic Extraction

Extract a single file from an image:

oci-extract extract alpine:latest /bin/sh -o ./sh

Extract Configuration Files

oci-extract extract nginx:latest /etc/nginx/nginx.conf -o ./nginx.conf

Verbose Output

See detailed information about the extraction process:

oci-extract extract ubuntu:latest /etc/passwd -o ./passwd --verbose

Force Specific Format

If you know the image format, you can skip auto-detection:

oci-extract extract myimage:latest /app/config.json --format estargz -o ./config.json

Extract from Private Registries

The tool uses Docker's credential helper by default:

# Authenticate with your registry first
docker login registry.example.com

# Then extract
oci-extract extract registry.example.com/myapp:v1.0 /app/binary -o ./binary

List Files in an Image

List all files in an image without downloading it:

# List all files
oci-extract list alpine:latest

# List with verbose output
oci-extract list nginx:latest --verbose

# Force a specific format
oci-extract list myimage:latest --format estargz

How It Works

Architecture

┌─────────────┐
│   CLI Tool  │
└──────┬──────┘
       │
       ├──────────────┐
       │              │
┌──────▼──────┐  ┌───▼────────┐
│  Registry   │  │  Format    │
│   Client    │  │  Detector  │
└──────┬──────┘  └───┬────────┘
       │             │
       │      ┌──────▼──────┐
       │      │ Orchestrator│
       │      └──────┬──────┘
       │             │
       │     ┌───────┼───────┬────────┐
       │     │       │       │        │
    ┌──▼─────▼─┐  ┌─▼────┐  ┌▼─────┐ │
    │ eStargz  │  │ SOCI │  │ zstd │ │
    │Extractor │  │ Extr.│  │ Extr.│ │
    └──────────┘  └──────┘  └──────┘ │
                                   ┌──▼─────┐
                                   │  Std   │
                                   │ Extr.  │
                                   └────────┘

The "No-Mount" Approach

Instead of mounting the image, oci-extract:

Authenticates with the OCI registry
Fetches Manifest to discover available layers
Detects Format (eStargz, SOCI, or Standard)
Fetches Metadata (TOC/zTOC) using small HTTP Range requests
Locates File in the metadata to find exact byte offsets
Surgical Download of only the required compressed chunks
Decompresses and writes the file to disk

Format-Specific Behavior

eStargz

Reads the footer (last 47 bytes) to locate the TOC
Fetches the TOC to get file offsets
Downloads only the specific chunk containing the file
Decompresses on-the-fly with gzip

SOCI

Queries the Referrers API or tag-based index
Downloads the zTOC (compression info) for relevant layers
Maps file paths to compressed byte ranges
Fetches and decompresses specific ranges

zstd:chunked

Similar to eStargz but uses zstd compression
Reads TOC to locate file chunks
Downloads only necessary compressed chunks
Better compression ratio than gzip

zstd

Standard tar archive with zstd compression
Requires streaming the entire layer (like standard gzip)
Better compression than gzip, smaller layer sizes
Falls back to full layer extraction

Standard Layers

Falls back to streaming decompression (less efficient)
Still avoids pulling the entire image into local storage
Works with gzip-compressed tar archives

Performance Comparison

File Extraction

For extracting a 10KB file from a 500MB image:

Method	Downloaded	Time
docker pull + cp	500 MB	~2 min
oci-extract (eStargz)	~50 KB	~2 sec
oci-extract (zstd:chunked)	~50 KB	~2 sec
oci-extract (SOCI)	~100 KB	~3 sec
oci-extract (zstd)	~15 MB*	~12 sec
oci-extract (Standard)	~20 MB*	~15 sec

*Standard and zstd formats require downloading the entire layer containing the file

File Listing

For listing all files in a typical image:

Format	Downloaded	Time
eStargz	~50-100 KB (TOC)	~2-3 sec
zstd:chunked	~50-100 KB (TOC)	~2-3 sec
SOCI	~100-200 KB (zTOC + index)	~3-4 sec
zstd	Full layer size	~8-25 sec
Standard	Full layer size	~10-30 sec

Limitations

Standard and zstd (non-seekable) formats require downloading entire layer containing the target file
SOCI support requires the image to have SOCI indices generated beforehand
zstd:chunked requires images to be converted with nerdctl or compatible tools
Some registries may not support HTTP Range requests (though most do)
Large files in highly compressed layers may still require significant downloads

Contributing

Contributions are welcome! See CONTRIBUTING.md for detailed development setup, workflow, and guidelines.

License

MIT License

Acknowledgments

Built on top of:

google/go-containerregistry - OCI registry client
containerd/stargz-snapshotter - eStargz support
awslabs/soci-snapshotter - SOCI support
klauspost/compress - zstd compression
spf13/cobra - CLI framework

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.claude		.claude
.devcontainer		.devcontainer
.github		.github
cmd		cmd
internal		internal
test-images		test-images
tests		tests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go
mise.toml		mise.toml

License

amartani/oci-extract

Folders and files

Latest commit

History

Repository files navigation

OCI-Extract

Features

Installation

Download Binary

Using mise

Using Go Install

Usage

Basic Extraction

Extract Configuration Files

Verbose Output

Force Specific Format

Extract from Private Registries

List Files in an Image

How It Works

Architecture

The "No-Mount" Approach

Format-Specific Behavior

eStargz

SOCI

zstd:chunked

zstd

Standard Layers

Performance Comparison

File Extraction

File Listing

Limitations

Contributing

License

Acknowledgments

References

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 8

Packages 0

Uh oh!

Uh oh!

Contributors 3

Uh oh!

Languages

Packages