Architecture

Insight is built as a Tauri 2.0 desktop application with a Rust backend and Svelte frontend.

Core Stack

Component	Library	Purpose
App framework	Tauri 2.0	Desktop app (Rust backend, web frontend)
UI	Svelte 5	Frontend
Styling	Tailwind 4	Utility-first CSS
LLM inference	mistralrs	Local model loading and inference (GGUF format)
Model download	hf-hub	Fetch models from HuggingFace
P2P / Sync	iroh	Connections, NAT traversal, sync
Content storage	iroh-blobs	Content-addressed file storage
Metadata sync	iroh-docs	CRDT key-value store for metadata
Real-time	iroh-gossip	Pub/sub for live updates
Search	milli	Full-text + vector search (used by agent)
PDF processing	lopdf	Text extraction

Agent Architecture

User Query
    ↓
Local LLM (via mistralrs)
    ↓
Tool Calling Loop
    ↓
Synthesized Answer (with citations)

The agent has tools for searching and reading documents. It iteratively gathers evidence to answer questions, citing sources along the way. There is no direct user-facing search—all document retrieval happens through the agent.

Data Model

Collections as Namespaces

Each collection is an iroh-docs namespace. Sharing a collection means sharing namespace access.

Namespace: 7f3a8b2c... ("Climate Research")
│
├── files/abc123     → blob with metadata JSON
├── files/def456     → blob with metadata JSON
└── _collection      → blob with collection settings

Document Metadata

Document metadata is stored as a blob and referenced by an entry in the namespace:

{
	"name": "paper.pdf",
	"pdf_hash": "blake3-hash-of-pdf",
	"text_hash": "blake3-hash-of-extracted-text",
	"tags": ["research", "climate"],
	"created_at": "2024-01-15T10:30:00Z"
}

Content-Addressed Storage

All file content (PDFs, extracted text) is stored in iroh-blobs using content-addressing:

Files are identified by their BLAKE3 hash
Duplicate files are automatically deduplicated
Content can be verified for integrity

Embedding Sync

Embeddings are stored in iroh-docs under embeddings/{doc_id}/{model_id}. This design:

Avoids redundant computation — generating embeddings is expensive, so peers share them
Preserves model flexibility — different peers can use different embedding models
Enables offline use — embeddings sync with documents, ready for immediate use

When a peer receives a document, it checks for existing embeddings matching its configured model. If found, they’re used directly. If not (different model or new document), embeddings are generated locally and stored for other peers to use.

Data Flow

Local Import

User adds PDF to collection
Extract text via lopdf
Store PDF blob → get pdf_hash
Store text blob → get text_hash
Create metadata entry in iroh-docs
Index text + generate embeddings in milli

On Sync

When a new metadata entry arrives from a peer:

Fetch text blob using text_hash
Index text + generate embeddings in milli
PDF blob fetched on-demand (when user opens document)

What Syncs vs What’s Local

Data	Syncs	Stored in
PDF files	Yes	iroh-blobs
Extracted text	Yes	iroh-blobs
File metadata	Yes	iroh-docs
Collection info	Yes	iroh-docs
Embeddings	Yes (keyed by model)	iroh-docs
Search index	No (derived)	milli (for agent)
LLM models	No	~/.cache/huggingface/hub

Local Storage

~/.local/share/insight/
├── iroh/               # iroh data (blobs, docs)
└── search/             # milli index

~/.cache/huggingface/hub/
└── models--*/          # Downloaded models (LLM + embedding)

On Windows, app data is under %LOCALAPPDATA%\insight\ and models under %USERPROFILE%\.cache\huggingface\hub\.

Keyboard shortcuts

Insight