First release — 2026

The AI agent that
understands your system
before it touches it.

The agent and the knowledge base, working as one. — SYMBIOS AGENT is the first engineering agent built around a live World Model — a structured internal representation of your system's topology, history, and risk — running autonomously, around the clock. Built for ML and infra teams with complex, dependency-heavy systems.

See how it works Compare to alternatives

symbios — loop status

# symbios-agent · autonomous loop — running

health_checkevery 60s

task_discoveryevery 300s

self_planningevery 1800s

# world model state

cascade_riskLOW

evidencefresh

recommendationproceed

# awaiting next pass...

Who it's for

Built for a specific problem.

This is not a general-purpose AI assistant. It's infrastructure.

Right fit

✓ML / infra teams with recurring incidents
✓Complex platforms with critical dependencies
✓Engineers tired of manually triggering AI agents
✓Teams running self-hosted models (Ollama, vLLM)

Not the right fit

—Small projects without ops complexity
—Teams without their own infrastructure
—One-off code generation needs

In action

02:17. A service goes unhealthy.

No ticket. No wake-up call. Here's what happens next.

02:17 — Health pass detects unhealthy inference service

Cascade risk assessed: two downstream modules at High risk.

02:19 — World Model traversal triggers abort decision

A pending task touching affected modules is removed from queue.

02:20 — YAML playbook executed, audit written to JSONL

Remediation logged. Evidence freshness updated. No human involved.

09:00 — Your team reads the incident report

Full trace: what happened, what was decided, what was done.

Architecture

Three layers.
One system that runs itself.

Most engineering tools wait to be asked. SYMBIOS AGENT discovers work, assesses risk, and executes — on its own schedule, without a human in the loop.

01 · LOOP

Autonomous Daemon

A persistent process running three independent passes: health checks, task discovery, and self-directed planning. Restarts automatically on failure.

60s health checks with YAML remediation playbooks
300s task discovery from pending_tasks queue
1800s self-planning from E2E results + KB freshness
Exponential back-off restart: 5s → 300s, up to 10 retries
Watchdog: triggers restart if loop silent >120s

02 · COGNITION

World Model

A live internal representation of your system's structure. The agent queries it before any action to predict consequences and decide whether to proceed.

Directed dependency graph built from modules.json
4-level cascade risk via topological traversal
7-day incident window for risk scoring
Evidence freshness threshold: 24h
Confidence score 0.0–1.0 from quality pass rate

03 · MEMORY

Knowledge Base

A structured, typed, persistent store. Every run writes back to it. The agent's ground truth for what your system is, what happened, and what was decided.

Vector DB for semantic retrieval
JSONL audit trails for every action taken
Long-form reasoning logs (reflections.jsonl)
Retrieval benchmarks + semantic drift tracking
Self-generated work proposals in kb/self_plans/

60s

Health Pass

Polls all service endpoints, runs playbooks on failure, writes JSONL audit trail.

300s

Task Discovery

Scans pending task queue, checks world-model blockers, queues valid work.

1800s

Self-Planning

Reads E2E results and KB freshness, generates its own work proposals without prompting.

10s

Control Pulse

Loop heartbeat. Receives stop signal. Watchdog probes every 30s for liveness.

World Model

It knows what breaks
if you touch this.

Before executing any task, the agent traverses the dependency graph, checks incident history, and issues a go / hold / abort decision — without being asked.

What is a World Model? — A live map of your system, maintained by the agent. — A World Model is an internal representation of the environment that enables an agent to predict the consequences of actions before taking them. Rather than reacting to observations, the agent reasons from a maintained model of system state, causal relationships, and risk context. SYMBIOS AGENT implements this as a live dependency graph with confidence scoring, incident-weighted risk, and freshness-gated evidence — symbolic rather than neural, but structurally equivalent: it models what exists, how things are connected, and what happens if something changes.

Criticalunhealthy services > 0
AND incidents ≥ 2→ abort

Highincidents ≥ 3
OR unhealthy services ≥ 2→ manual review

Mediumincidents ≥ 2
OR unhealthy services ≥ 1→ extra validation

Lowno incidents,
all services healthy→ proceed

Risk is computed via topological traversal of the full dependency graph — not just immediate dependents.
Evidence freshness threshold: 24h. Returns graceful fallback if stale. Confidence: 0.0–1.0.

Knowledge Base

Structured memory
that grows with every run.

Not a conversation log. A typed, indexed record of your system's modules, incidents, evaluations, test history, and the agent's own reasoning — all queryable, all versioned.

kb/index/ — structured indexes

modules.jsonModule catalog + dep graph

incidents.jsonIndexed failure history

project_state.jsonLive service snapshot

world_model_phase0_latest.jsonQuality / latency evidence

e2e_pipeline_runs.jsonlEnd-to-end test history

eval_scoreboard.jsonModel evaluation results

semantic_drift_*.jsonlDrift detection runs

kb/agent_state/ — live state

runloop.jsonLive loop state

health_incidents.jsonlJSONL audit trail

remediation_actions.jsonlRemediation history

kb/ — agent reasoning

pending_tasks/Task intake queue

self_plans/Agent-generated work proposals

reflections.jsonlLong-form reasoning logs

vector_db/Semantic search index

Operational intelligence, not conversation memory.

Most tools that claim memory store what you told them in chat. SYMBIOS AGENT indexes what your system is — its topology, its failure patterns, its evaluation trajectory — and uses that to decide what to do next, without asking.

The KB is the agent's ground truth. Every health check, every task completion, every self-generated plan writes back into it. Evidence freshness is tracked. Confidence degrades with staleness. The agent won't act on stale information.

Drop a task file in kb/pending_tasks/ and the agent picks it up within 300 seconds — checks the world model for blockers, assesses cascade risk, and starts work. No conversation required.

Local Model Prompt Engineering

Local models perform differently.
The system adapts its prompts accordingly.

Running a local model isn't just swapping an API endpoint. Instruction-tuned models have different sensitivity to system prompt structure, verbosity, and framing. SYMBIOS AGENT ships a dedicated prompt policy engine that resolves the right profile per model, per intent — automatically.

Profile

4 Prompt Profiles

Each profile shapes system prompt structure, verbosity, and instruction framing to match the task type and model behavior.

balanced — default, general-purpose framing
code-strict — minimal prose, precise code instructions
reasoning-deep — chain-of-thought scaffolding, extended context
concise — compressed prompts for fast, lightweight models

Routing

Intent-Based Lane Classifier

Before routing to a local model, the request is classified into a lane. The lane determines which model and which prompt profile are applied.

reasoning lane — complex multi-step tasks
coding lane — code generation, review, debugging
balanced lane — general queries and planning
Profile saved as user default via persistent chat preferences

Pipeline

End-to-End Policy Resolution

Every local model call goes through a centralized policy resolver that determines the effective profile, mode, and routed model — with full audit trail.

UI selector → local_prompt_profile_hint
Settings endpoint persists preference
Policy engine resolves effective profile + source
System prompt built with explicit "Local model profile" section

Policy resolution pipeline

UI Profile Selector

→

chat_preferences

→

prompt_policy.py

→

lane classifier

→

ollama_routes.py

→

optimized system prompt

Fallback: BALANCED profile applied automatically if no preference set. Regression-tested in test_prompt_profile_propagation.py.

REST API

10 endpoints.
Full world-state access over HTTP.

Query the dependency graph, assess cascade risk before a deploy, read incident history, control the daemon — all from your existing toolchain.

Method	Endpoint	Description
GET	/api/v1/world/health	Service up check
GET	/api/v1/world/state	Full project snapshot
GET	/api/v1/world/dependencies/{module_id}	Reverse dependency graph for a module
GET	/api/v1/world/cascade?modules=a,b,c	Cascade risk for a set of modules
GET	/api/v1/world/incidents/{module_id}?days=7	Incident history, 7-day default window
GET	/api/v1/world/loop/status	Daemon state: running / stopped / failed
POST	/api/v1/world/loop/start	Start daemon (async)
POST	/api/v1/world/loop/stop	Graceful stop
GET	/api/v1/world/loop/metrics	All loop counters — tasks, incidents, plans, uptime
GET	/api/v1/world/plans	Self-generated work queue

Skill Library

37 skills across six domains.
Including ML infrastructure — natively.

No other autonomous agent ships with native GPU and ML infrastructure skills. Deploy inference servers, switch GPU modes, optimize cache — without manual orchestration.

Development · 8

code-review
debug-investigate
test-generate
refactor-plan
dependency-audit
validate-edit
regression-check
smoke-test

ML / GPU Infrastructure · 7

gpu-monitor
gpu-mode-switch
benchmark-gpu
deploy-vllm
vllm-health
kv-cache-optimize
ollama-manage

DevOps / System · 8

service-manager
docker-compose-gen
env-setup
disk-health
network-check
linux-diagnostic
netplan-config
backup-snapshot

Project Management · 5

task-intake
kb-sync
incident-report
progress-report
decision-record

Research / Analysis · 5

web-research
paper-analyze
compare-tools
changelog-digest
youtube-report

Audio / Media · 4

speech-pipeline
speech-stt
speech-tts
comfyui-workflow

Comparison

What existing tools don't do.

The capabilities below are unique to SYMBIOS AGENT. Reactive agents — IDE assistants, cloud coding agents, memory bolt-ons — share none of them in combination. To our knowledge at time of publication, based on publicly documented architectures and capabilities.

Capability	SYMBIOS AGENT	Memory tools	IDE agents	Cloud agents
Live dependency graph (World Model)	✓	—	—	—
Cascade risk before every action	✓	—	—	—
Autonomous 3-pass loop (no trigger)	✓	—	—	—
Self-generated work proposals	✓	—	—	—
Native GPU / ML infra skills	✓	—	—	partial
Persistent structured KB	✓	conversation only	static files	—
JSONL audit trail + reasoning logs	✓	—	—	—
Self-hosted, $0 inference cost	✓	self-host option	—	enterprise tier
Watchdog + exponential back-off restart	✓	—	—	—
REST API for external toolchain	✓	✓	—	✓

By the numbers

Concrete, not claimed.

Built-in skills

Across development, ML infra, DevOps, project management, research, and media processing.

REST API endpoints

Full world-state access — dependency graph, cascade risk, loop control, metrics.

Cascade risk levels

Critical / High / Medium / Low, computed via topological traversal before every action.

Autonomous passes

Health (60s), task discovery (300s), self-planning (1800s). Zero human triggers needed.

Pricing

Simple tiers.
No inference costs on local models.

Run on your own hardware with Ollama or vLLM — $0 per token. You pay for the system, not the usage. Infrastructure and energy costs remain yours.

Monthly Annual SAVE 20%

Community

Patreon
Access

1 user

● Live now

1 user seat
Full autonomous daemon
World Model + KB
37 skills
REST API (10 endpoints)
Community support (Patreon)

Join on Patreon

Solo

$29

/ month · 1 seat

● Live now

1 user seat
Full autonomous daemon
World Model + KB
37 skills
REST API (10 endpoints)
Async support
Configuration guidance
No team management
No custom skill dev

Get Started

— In development

Request Access

Team

$99

/ month · 5 seats

◌ Early access — founding member pricing locked in

Up to 5 seats
Full autonomous daemon
World Model + KB
37 skills
REST API + async support
Onboarding + playbook review
No custom skill dev
No dedicated support

— In development

Request Access

Studio

$299

/ month · up to 15 licences · 3 teams

◌ Early access — we onboard your team personally

Up to 15 licences / 3 teams
Full autonomous daemon
World Model + KB
37 skills
REST API + async support
Custom skill development
Dedicated async support
Personal onboarding for each team

The system runs entirely on your infrastructure. We don't access your environment. Support is async — documentation, configuration guidance, and custom feature development delivered remotely.

Request Access

Join the waitlist.
Founding member pricing locked in.

Team and Studio tiers are in early access. Tell us about your use case — we onboard in order of fit and readiness.

Name & Email

Team size

Primary use case

Timeline

What happens after you submit

We review your use case

We read every submission. We prioritize teams with concrete ops pain over explorers.

We reach out within 48h

Short async exchange to confirm fit and answer any technical questions about your infra.

You install, we deliver

You run the installer on your infrastructure. We deliver config, playbooks, and documentation. We never access your systems.

Founding member pricing locked

Your rate is locked for as long as you stay subscribed — regardless of future price increases.

Early access

Stop assigning tickets to AI.
Let it find the work itself.

SYMBIOS AGENT is in limited release. We're onboarding engineers and teams ready to run a system that doesn't wait to be asked.

Request Access

The AI agent thatunderstands your systembefore it touches it.

Built for a specific problem.

02:17. A service goes unhealthy.

Three layers.One system that runs itself.

Autonomous Daemon

World Model

Knowledge Base

It knows what breaksif you touch this.

Structured memorythat grows with every run.

Operational intelligence, not conversation memory.

Local models perform differently.The system adapts its prompts accordingly.

4 Prompt Profiles

Intent-Based Lane Classifier

End-to-End Policy Resolution

10 endpoints.Full world-state access over HTTP.

37 skills across six domains.Including ML infrastructure — natively.

What existing tools don't do.

Concrete, not claimed.

Simple tiers.No inference costs on local models.

Join the waitlist.Founding member pricing locked in.

What happens after you submit

Stop assigning tickets to AI.Let it find the work itself.

The AI agent that
understands your system
before it touches it.

Three layers.
One system that runs itself.

It knows what breaks
if you touch this.

Structured memory
that grows with every run.

Local models perform differently.
The system adapts its prompts accordingly.

10 endpoints.
Full world-state access over HTTP.

37 skills across six domains.
Including ML infrastructure — natively.

Simple tiers.
No inference costs on local models.

Join the waitlist.
Founding member pricing locked in.

Stop assigning tickets to AI.
Let it find the work itself.