Architecture

SystemArchitecture

SAMI is a multi-layered AI system built on precise planning and stage-gated consensus. Every stage must persuade the council before code is written.

ResponseStreaming

StagesStage-gated

ConsensusVeto-based

LAYER_01_INPUT
STATUS_ACTIVE

CONSENSUS_LAYER
MODE_PARALLEL

System Overview Specification

Client Layer

Code Editor

Industry-standard editor

Multi-Agent Chat

Live discussion view

Terminal

Integrated shell

Diff View

Live Streaming

Multi-Agent Orchestration

SAMI Agent Flow

Agent Workflows

Supervisor

Multi-model council

Consensus

Voting Protocol

Code Intelligence

AST Parser

Semantic understanding

Symbol Index

Functions/Classes

Semantic Chunks

Smart Splitting

Hybrid Context Engine

SAMI Vector Core

Vector context

SAMI Search Engine

Keyword Search

Rank Fusion

Multi-retriever fusion

Reranker

Cross-encoder scoring

Model Execution Layer

Frontier reasoning

Top-tier reasoning models

Long-context drafting

Extended-context generation models

Vision-enabled

Multimodal vision models

Open-weights

Open-weights via managed providers

Code specialist

Reasoning-optimised code models

Core Modules

The core pillars of the SAMI architecture

Hybrid Context Engine

Hybrid project retrieval for source-aware context

Code Intelligence

AST-based semantic code understanding

Multi-Agent Orchestration

Multi-model council: every model reviews with equal weight and votes

Model Gateway & Managed Models

Cloud API Routing + Managed Model Catalog

Smart Indexing

AST-based Chunking + Symbol Metadata

Security & Privacy Layer

Managed infrastructure with zero cloud persistence

The Council

Every model you pick is an equal-vote Guardian for the whole run. Lead and Writer are duties that move between Guardians stage by stage — never a higher rank. A no never ends the run: it loops back until the council reaches a genuine yes.

Equal council, rotating duties

Every selected model is an equal-vote Guardian for the whole run. Lead and Writer are duties that move between Guardians as the pipeline advances — never a higher rank.

04MaverickStep 4 / 7

The code stage — the Writer drafts the implementation while the council critiques.

Guardian A
Yes
Reads, researches, files change requests.
Guardian B
Yes
Reads, researches, files change requests.
Guardian C
Yes
Reads, researches, files change requests.
Guardian D
Yes
Leadduty this stage
Coordinates discussion, announces the outcome.
Guardian E
Yes
Writerduty this stage
Writes code and runs commands this stage.

Guardian. Every selected model. Equal vote, every stage, the whole run.

Lead

Lead. A per-stage duty: coordinates the discussion and announces the outcome — votes with equal weight. It moves to a different Guardian every stage.

Writer

Writer. A per-stage duty: the only model that writes code or runs commands, at the build stages. It also rotates between Guardians.

Binary vote. Each Guardian votes Yes or No — there is no third option. A No never ends the run: it loops back to deliberation, the council brings counter-arguments, and it resolves into a genuine Yes.

Any Guardian can pause the Writer mid-work

Supervision is not a one-time gate at the end of a stage. While the Writer is producing or changing code, any Guardian that spots a problem can raise an objection before the next action is taken. The objection routes straight back to deliberation — the council brings counter-arguments and alternatives, and the work only resumes once the concern resolves into a genuine yes.

Hybrid-Search Pipeline

From raw query to hyper-optimized AI context

Query Analysis Firewall

Intent detection classifies: Structural? Conceptual? Hybrid?

Vector Search Engine

Semantic retrieval for source-aware context

Keyword Search Engine

Exact-symbol retrieval for names, APIs, and file paths

Rank Fusion

Rank fusion, scoring, and a compact result set for the council

Optimized Context Block

Max 8K tokens. Structural + semantic information merged and compressed.

Final Output

Context Flows Into the Council

The optimized context block is not the finish line — it is the start. From there, context flows into every stage: a shared baseline every stage reads, a per-stage slice pulled on top, and cross-stage recall so any model can look back at what any other Guardian said earlier in the run.

Context engine

Hybrid retrieval blends semantic, keyword and code intelligence into one project-context stream the council can draw on.

Semantic· meaning-based searchKeyword· exact-term searchCode intelligence· workspace grounding

per-stage slice
01Connector
per-stage slice
02Strategist
per-stage slice
03Architectrecalled later
per-stage slice
04Maverick
per-stage slice
05Optimizer
per-stage slice
06Guardianlooks back ↩
per-stage slice
07Debugger

Shared baseline contextevery stage reads this, the whole run

Cross-stage recall — any model can look back at earlier council context

Shared baseline context — every stage reads it
Per-stage context slice — pulled on top into each stage
Cross-stage recall — look back at earlier council context

The Iterative Workflow

A loop with human-in-the-loop validation — step by step until the finished product.

The 7-stage council pipeline

One council, seven stages. Every model is an equal-vote Guardian the whole run; the Lead and Writer duties move between them stage by stage.

Actually runs the project; any failure goes back to the council and is worked to a genuine pass before finishing.

Duty this stageLeadC

Read
Docs + Web
Write / Edit
Run

Capabilities

Read
Docs + Web
Write / Edit
Run

Read & research belong to every Guardian. Write & Run belong to the Writer only, and only at Maverick (04) and Optimizer (05).

Per-stage duties

LeadA per-stage duty: coordinates the discussion and announces the outcome — votes with equal weight. It moves to a different Guardian each stage.
WriterA per-stage duty: the only model that writes code or runs commands, at the build stages.

01 Connector — Clarification Q&A Loop

Interactive question-and-answer loop between the council and the user — before any artifact is produced

Flow: The council collects open questions (missing information, scope ambiguities) and presents them to the user in a single batch. The user answers inline until every Guardian agrees the task is fully understood — the council itself decides when the scope is clear, with no fixed round limit (only your run budget or an explicit cancel bounds it). Only then does Stage 02 begin.

✓

Scope clarified — ready for planning

02 Strategist — Architecture Planning

Create the master plan, choose patterns, align on architecture

Action: The agents shift to planning mode. Based on context, they design folder structures and generic interfaces, resulting in a Master Plan.

1.Database Schema Definition

2.API Contracts & Types

03 Architect — Structural Logic

Translate the master plan into interfaces, schemas, contracts, and module boundaries

Building the actual scaffolding. Interfaces, classes, and empty functions are drafted to enforce strict type safety across the entire system.

interface UserData

→

class AuthController

04 Maverick — Code Draft

One model writes the binding code draft; all others review and raise change requests

Only one model (Writer) writes the binding code draft. All other Guardians critique the direction, sketch alternatives in prose (no code), and raise change requests to the Writer. One model (Lead) coordinates the flow. Contributions arrive in arrival order — no round-robin.

Brief

Draft

Change Requests

05 Optimizer — Challenge & Revision

All Guardians challenge the draft; the strongest optimization path is adopted

Action: All reviewing agents challenge the draft with performance and correctness feedback, the coordinating agent selects the final optimization path, and the authoring agent records the accepted revision.

O(N^2) Array.map->O(1) Hash Map

06 Guardian — Red Teaming

Adversarial review: actively surface vulnerabilities, edge cases, and unsafe assumptions

Mandatory Consensus

Veto Checkpoint Required: Guardians actively attempt to break the code (SQLi, race conditions, stale state). Voting is binary — yes or no. A model with any doubt votes no and must explain why; there is no separate concern or abstain vote.

How a NO resolves

A NO is never dropped or rubber-stamped. The Guardian that votes no must state why; the council then deliberates — surfacing counter-arguments and alternatives — until the objection resolves into a genuine yes.

Only run-budget exhaustion or the user cancelling the run advances a stage with an unresolved no, and in that case the final artifact carries an explicit unresolved-objection note visible in the War Room.

✓

VETOES RESOLVED → Ready for Final Audit

07 Debugger — Execution-Verify Gate

A council that doesn't just agree — it runs the code

Most AI tools vote on plausibility — they read the code and decide if it looks right. SAMI's Execution-Verify Gate makes the council vote on proven behaviour: it actually runs the project's build, tests, lint, and typecheck and feeds the real result back to the council before any verdict is reached.

Passing run — proven, not assumed. The council votes on a confirmed result.

Failing run — the real failure is surfaced to the council. If the council blocks on it, the same never-abort resolution applies: the council deliberates — bringing counter-arguments or alternatives — until the objection resolves into a genuine yes; only run-budget exhaustion or cancelling the run advances with an explicit note. The run is never silently stopped.

🔄

Iterative Refinement

Validation failed? - back to Stage 4 (Draft)
All review stages complete? - Ready for delivery

You Set the Autonomy

Three autonomy levels decide how much you approve along the way. They change the checkpoints you see — not the council. Multi-model supervision is always on at every level.

Guided

You confirm at each stage gate. Nothing advances to the next stage without your go-ahead.

You approve every stage

Balanced

The council advances on its own through routine stages and pauses for your call on the decisions that matter.

You approve key checkpoints

Autonomous

The council runs the full pipeline end to end and surfaces the result. You stay in control and can step in at any time.

You review the outcome

The council never switches off. Autonomy only governs how often the work pauses for your approval. Every stage is still reviewed and voted on by the full council, and a no still loops back to a genuine yes — even at the most autonomous level.

Technology Stack

Built with best-in-class frameworks and tools.

IDE Core

Cross-platform desktop runtime

Code Editor

Integrated Terminal

Editor API

Vector Search

SAMI Vector Core

Managed Embeddings

Reranker

Rank Fusion

Code Intel

AST parser

Symbol Index

Semantic Chunks

Orchestration

SAMI Agent Flow

Supervisor Pattern

Consensus

LLM Routing

SAMI Model Router

Managed Gateway

Secure Storage

Backend

REST API

Async Runtime

Managed Database

SAMI Search Engine

Privacy & Security

Managed models. Ephemeral processing. No permanent cloud storage. Your data stays under your control.

Managed Embeddings

Code indexing runs through SAMI's managed infrastructure. Your data is processed ephemerally and never stored permanently.

Managed Provider Vault

Provider credentials are encrypted and isolated inside SAMI's managed infrastructure.

Managed Model Access

For maximum consistency: Use only the models SAMI offers in your plan, with routing and billing handled centrally.

Download SAMI

The multi-model council is ready for your project — get started with the desktop app.

Download SAMI Documentation

System Overview Specification

Client Layer

Code Editor

Industry-standard editor

Multi-Agent Chat

Live discussion view

Terminal

Integrated shell

Diff View

Live Streaming

Multi-Agent Orchestration

SAMI Agent Flow

Agent Workflows

Supervisor

Multi-model council

Consensus

Voting Protocol

Code Intelligence

AST Parser

Semantic understanding

Symbol Index

Functions/Classes

Semantic Chunks

Smart Splitting

Hybrid Context Engine

SAMI Vector Core

Vector context

SAMI Search Engine

Keyword Search

Rank Fusion

Multi-retriever fusion

Reranker

Cross-encoder scoring

Model Execution Layer

Frontier reasoning

Top-tier reasoning models

Long-context drafting

Extended-context generation models

Vision-enabled

Multimodal vision models

Open-weights

Open-weights via managed providers

Code specialist

Reasoning-optimised code models

Privacy & Security

Managed models. Ephemeral processing. No permanent cloud storage. Your data stays under your control.

Managed Embeddings

Code indexing runs through SAMI's managed infrastructure. Your data is processed ephemerally and never stored permanently.

Managed Provider Vault

Provider credentials are encrypted and isolated inside SAMI's managed infrastructure.

Managed Model Access

For maximum consistency: Use only the models SAMI offers in your plan, with routing and billing handled centrally.