SYSTEM DESIGN

Built for the age of AI agents

Edge-first. Event-sourced. Content-addressed. Every layer designed so humans and AI agents collaborate on files through one unified API.

System Overview

Client, edge, storage

Every request follows the same path. Any client - browser, AI agent, CLI - hits the nearest edge node for auth and routing. The edge reads and writes to two storage layers: a metadata index and a content-addressed blob store.

CLIENTS

Browser

Human

Claude

AI agent

ChatGPT

AI agent

CLI / API

Any HTTP client

HTTPS / MCP

GLOBAL EDGE NETWORK

US East

EU West

AP South

AP East

SA East

Auth · Routing · Rate limiting · Presigned URLs · MCP dispatch

Read / Write

STORAGE

Global consistency

Meta Pane

File index, event log, sessions, blob references, tx counters

fileseventsblobssessions

Geo-distributed

Object Storage

S3-compatible blob store, content-addressed by SHA-256

SHA-256presigned URLszero egress

Meta Pane

Paths and content live apart

File identity is decoupled from file content. A path points to a content hash, like how a filename points to an inode. This makes moves, renames, and dedup instant.

files

path	addr	size	tx
report.pdf	a1b2c3d4...	200 KB	1
data/config.json	d4e5f6a7...	512 B	2
backup/report.pdf	a1b2c3d4...	200 KB	6

blobs

addr	ref_count
a1b2c3d4...	2
d4e5f6a7...	1

Move / Rename O(1)

Update the path column. The blob stays in place. Zero bytes copied, zero storage operations. Instant regardless of file size.

Deduplication automatic

Two files with identical content share one blob. The addr column matches, ref_count increments. No extra storage consumed.

Delete ref counted

Remove the file entry, decrement ref_count. The blob is only garbage-collected when no files reference it anymore.

Integrity built in

The content hash IS the address. If the data doesn't match the hash, corruption is self-evident. No separate checksums needed.

Request Lifecycle

Zero data through the API

Clients compute SHA-256 locally, then upload directly to the content-addressed location via presigned URLs. Downloads redirect to presigned URLs. File bytes never touch the edge worker.

Compute hash & initiate

Client computes SHA-256 of the file locally, then sends path, content type, and hash. Edge checks if the blob already exists (instant dedup). If not, generates a presigned PUT URL to the content-addressed location.

POST /files/uploads {path: "report.pdf", content_hash: "a1b2c3..."}

dedup check

Direct upload to content-addressed location

Client PUTs file bytes directly to the presigned URL. The blob lands at its final content-addressed path. No data flows through the API server.

PUT {presigned_url} blobs/{actor}/a1/b2/a1b2c3...

direct to storage

Confirm & index

Client confirms completion. Edge verifies the blob exists at the content-addressed location via HEAD (no data pull), then writes the file entry + event to the Meta Pane and updates blob ref count.

POST /files/uploads/complete {path: "report.pdf"}

fast

Request file

Client requests a file by path. Edge validates auth, looks up the file in the Meta Pane.

GET /files/report.pdf -H "Authorization: Bearer $TOKEN"

fast

Presigned redirect

Edge generates a time-limited presigned GET URL and returns a 302 redirect. The client follows it automatically.

302 Location: https://storage.../blobs/alice/a1/b2/a1b2c3...

direct from storage

AI calls storage_write

Claude, ChatGPT, or any MCP client invokes the storage_write tool with path and content.

tool: storage_write {path: "notes/summary.md", content: "..."}

MCP protocol

MCP server handles internally

The MCP server uses the same storage engine as REST. It uploads the blob, writes the file entry, and records the event in one step.

engine.put(actor, "notes/summary.md", blob)

single round-trip

Result returned to AI

The tool returns success with a transaction number (tx). The AI can immediately read the file back or share it.

result: {tx: 42, path: "notes/summary.md", size: 1847}

immediately visible

Multipart Upload

Large files, same zero-proxy guarantee

Files too large for a single PUT are split into parts. Each part uploads directly to object storage. The edge never touches the file bytes.

Compute hash & initiate

Client computes SHA-256 of the complete file, then requests a multipart upload. Edge creates the upload at the content-addressed location and returns presigned URLs for each part.

POST /files/uploads/multipart {path: "dataset.parquet", part_count: 5, content_hash: "a1b2c3..."}

up to 10,000 parts

Upload parts directly

Client PUTs each part directly to its presigned URL. Parts upload in parallel to object storage. No data flows through the API server.

PUT {part_url_1} --data-binary @part1 PUT {part_url_2} --data-binary @part2 (parallel)

direct to storage

Complete & index

Client sends part ETags to finalize assembly. Edge verifies the assembled blob via HEAD (no data pull), then records file entry + event in the Meta Pane.

POST /files/uploads/multipart/complete {parts: [{part_number: 1, etag: "..."}, ...]}

metadata only

Range Read

Download any byte range

Downloads redirect to presigned object storage URLs, which natively support HTTP Range headers. Partial downloads never touch the edge worker.

Request file

Client requests a file by path. Edge validates auth, looks up metadata, and returns a 302 redirect to a presigned object storage URL.

GET /files/report.pdf → 302 Location: {presigned_url}

redirect

Range request to storage

Client follows the redirect and sends a standard HTTP Range header directly to object storage. The storage layer handles partial content natively, returning HTTP 206.

GET {presigned_url} -H "Range: bytes=0-1023" → 206 Partial Content

direct from storage

Resume downloads

Interrupted transfers can resume from the last byte received.

Stream sections

Read specific byte ranges without downloading the entire file.

Zero worker load

All byte serving happens at the storage layer, not the edge worker.

Standard HTTP

Uses standard Range headers. Works with any HTTP client.

Content Addressing

Same content, one blob

Files are stored by their SHA-256 hash. Upload the same file twice, only one copy is stored. Rename a file, zero bytes copied.

report.pdf

→

blobs/alice/a1/b2/a1b2c3d4e5f6789...

data/config.json

→

blobs/alice/d4/e5/d4e5f6a7b8c90ab...

backup/report.pdf

→

blobs/alice/a1/b2/a1b2c3d4e5f6789...

Automatic dedup

Same content = same blob. Always.

Free integrity

The hash IS the identifier. Corruption is self-evident.

Zero-cost move

Only the Meta Pane path changes. No blob copies.

Collision risk ~2^-128

Effectively zero. Would take 10²¹ years at 10B files/sec.

Event Architecture

Every mutation is an event

Not just logging. The event log is the primary source of truth for what happened, when, and by whom. Every version, every action, fully replayable.

ACTION

PATH

ADDR

TIME

put

readme.md

a1b2c3d4...