Skip to content
Mbox Viewer

Glossary

Email & MBOX glossary

Key terms for email archives, formats, protocols and the anatomy of a message — each with its own page, explained in plain language.

File formats

Protocols & services

Message structure

Header

The structured metadata block at the beginning of an email message, containing fields like From, To, Subject, Date, and numerous technical fields that describe how the message was composed, routed, and encoded.

MIME

Multipurpose Internet Mail Extensions — the standard that defines how email messages encode non-ASCII text, HTML bodies, attachments, and other binary content within the plain-text structure of email.

Multipart boundary

A MIME message structure that combines multiple content parts — such as plain text, HTML, and attachments — in a single message, each separated by a unique boundary string.

Message-ID

A globally unique identifier assigned to each email message, specified in the Message-ID header. It is used to track messages, build conversation threads, and detect duplicates when merging archives.

In-Reply-To / References

Email headers (In-Reply-To and References) that link a reply to the message it responds to, enabling mail clients and archive tools to group related messages into conversation threads.

Envelope

The delivery metadata used by SMTP servers to route an email message — specifically the envelope sender (MAIL FROM) and envelope recipients (RCPT TO) — which may differ from the visible From and To headers.

Attachment

A file — such as a PDF, image, or spreadsheet — embedded in an email message and encoded as a MIME part, separate from the message body, intended for the recipient to save or open.

Inline image CID

An image embedded directly into an HTML email body using a Content-ID (cid:) reference, rather than attached as a separate downloadable file. The image data is stored as a MIME part within the same message.

Encoding & charsets

Concepts & features

Threading (JWZ)

The process of grouping related email messages into conversations by following In-Reply-To and References header links, typically using the JWZ algorithm that supports up to four levels of nesting.

Deduplication

The process of detecting and removing duplicate email messages from an archive, typically by comparing Message-ID values, to avoid redundancy when merging multiple MBOX files.

Streaming parser

A parsing technique that reads a file incrementally in small chunks rather than loading the entire file into memory at once, enabling tools to open and index very large MBOX files — tens or hundreds of gigabytes — with low memory usage.

Binary index

A compact index file that Mbox Viewer writes alongside an MBOX archive after the first parse, storing message byte offsets and metadata to enable near-instant reopens without re-scanning the entire file.

HTML email

An email message whose body is formatted with HTML and CSS, allowing rich typography, layout, colors, and images. Most modern email is HTML, but privacy and security considerations require careful rendering.

Universal Binary

A macOS application bundle that contains native compiled code for both Apple Silicon (ARM64) and Intel (x86_64) architectures, running natively on either processor without emulation.

App Sandbox

App Sandbox is a macOS security feature that restricts an application's access to system resources, files, and network connections, limiting the potential impact if the app or its dependencies have a vulnerability.