First release — 2026

The AI agent that
understands your system
before it touches it.

The agent and the knowledge base, working as one. — SYMBIOS AGENT is the first engineering agent built around a live World Model — a structured internal representation of your system's topology, history, and risk — running autonomously, around the clock. Built for ML and infra teams with complex, dependency-heavy systems.

symbios — loop status
# symbios-agent · autonomous loop — running
health_checkevery 60s
task_discoveryevery 300s
self_planningevery 1800s
# world model state
cascade_riskLOW
evidencefresh
recommendationproceed
# awaiting next pass...
Who it's for

Built for a specific problem.

This is not a general-purpose AI assistant. It's infrastructure.

Right fit
  • ML / infra teams with recurring incidents
  • Complex platforms with critical dependencies
  • Engineers tired of manually triggering AI agents
  • Teams running self-hosted models (Ollama, vLLM)
Not the right fit
  • Small projects without ops complexity
  • Teams without their own infrastructure
  • One-off code generation needs
In action

02:17. A service goes unhealthy.

No ticket. No wake-up call. Here's what happens next.

02:17 — Health pass detects unhealthy inference service
Cascade risk assessed: two downstream modules at High risk.
02:19 — World Model traversal triggers abort decision
A pending task touching affected modules is removed from queue.
02:20 — YAML playbook executed, audit written to JSONL
Remediation logged. Evidence freshness updated. No human involved.
09:00 — Your team reads the incident report
Full trace: what happened, what was decided, what was done.
Architecture

Three layers.
One system that runs itself.

Most engineering tools wait to be asked. SYMBIOS AGENT discovers work, assesses risk, and executes — on its own schedule, without a human in the loop.

01 · LOOP

Autonomous Daemon

A persistent process running three independent passes: health checks, task discovery, and self-directed planning. Restarts automatically on failure.

  • 60s health checks with YAML remediation playbooks
  • 300s task discovery from pending_tasks queue
  • 1800s self-planning from E2E results + KB freshness
  • Exponential back-off restart: 5s → 300s, up to 10 retries
  • Watchdog: triggers restart if loop silent >120s
02 · COGNITION

World Model

A live internal representation of your system's structure. The agent queries it before any action to predict consequences and decide whether to proceed.

  • Directed dependency graph built from modules.json
  • 4-level cascade risk via topological traversal
  • 7-day incident window for risk scoring
  • Evidence freshness threshold: 24h
  • Confidence score 0.0–1.0 from quality pass rate
03 · MEMORY

Knowledge Base

A structured, typed, persistent store. Every run writes back to it. The agent's ground truth for what your system is, what happened, and what was decided.

  • Vector DB for semantic retrieval
  • JSONL audit trails for every action taken
  • Long-form reasoning logs (reflections.jsonl)
  • Retrieval benchmarks + semantic drift tracking
  • Self-generated work proposals in kb/self_plans/
60s
Health Pass
Polls all service endpoints, runs playbooks on failure, writes JSONL audit trail.
300s
Task Discovery
Scans pending task queue, checks world-model blockers, queues valid work.
1800s
Self-Planning
Reads E2E results and KB freshness, generates its own work proposals without prompting.
10s
Control Pulse
Loop heartbeat. Receives stop signal. Watchdog probes every 30s for liveness.
World Model

It knows what breaks
if you touch this.

Before executing any task, the agent traverses the dependency graph, checks incident history, and issues a go / hold / abort decision — without being asked.

What is a World Model? — A live map of your system, maintained by the agent. — A World Model is an internal representation of the environment that enables an agent to predict the consequences of actions before taking them. Rather than reacting to observations, the agent reasons from a maintained model of system state, causal relationships, and risk context. SYMBIOS AGENT implements this as a live dependency graph with confidence scoring, incident-weighted risk, and freshness-gated evidence — symbolic rather than neural, but structurally equivalent: it models what exists, how things are connected, and what happens if something changes.
Criticalunhealthy services > 0
AND incidents ≥ 2
→ abort
Highincidents ≥ 3
OR unhealthy services ≥ 2
→ manual review
Mediumincidents ≥ 2
OR unhealthy services ≥ 1
→ extra validation
Lowno incidents,
all services healthy
→ proceed

Risk is computed via topological traversal of the full dependency graph — not just immediate dependents.
Evidence freshness threshold: 24h. Returns graceful fallback if stale. Confidence: 0.0–1.0.

Knowledge Base

Structured memory
that grows with every run.

Not a conversation log. A typed, indexed record of your system's modules, incidents, evaluations, test history, and the agent's own reasoning — all queryable, all versioned.

kb/index/ — structured indexes
modules.jsonModule catalog + dep graph
incidents.jsonIndexed failure history
project_state.jsonLive service snapshot
world_model_phase0_latest.jsonQuality / latency evidence
e2e_pipeline_runs.jsonlEnd-to-end test history
eval_scoreboard.jsonModel evaluation results
semantic_drift_*.jsonlDrift detection runs
kb/agent_state/ — live state
runloop.jsonLive loop state
health_incidents.jsonlJSONL audit trail
remediation_actions.jsonlRemediation history
kb/ — agent reasoning
pending_tasks/Task intake queue
self_plans/Agent-generated work proposals
reflections.jsonlLong-form reasoning logs
vector_db/Semantic search index

Operational intelligence, not conversation memory.

Most tools that claim memory store what you told them in chat. SYMBIOS AGENT indexes what your system is — its topology, its failure patterns, its evaluation trajectory — and uses that to decide what to do next, without asking.

The KB is the agent's ground truth. Every health check, every task completion, every self-generated plan writes back into it. Evidence freshness is tracked. Confidence degrades with staleness. The agent won't act on stale information.

Drop a task file in kb/pending_tasks/ and the agent picks it up within 300 seconds — checks the world model for blockers, assesses cascade risk, and starts work. No conversation required.

Local Model Prompt Engineering

Local models perform differently.
The system adapts its prompts accordingly.

Running a local model isn't just swapping an API endpoint. Instruction-tuned models have different sensitivity to system prompt structure, verbosity, and framing. SYMBIOS AGENT ships a dedicated prompt policy engine that resolves the right profile per model, per intent — automatically.

Profile

4 Prompt Profiles

Each profile shapes system prompt structure, verbosity, and instruction framing to match the task type and model behavior.

  • balanced — default, general-purpose framing
  • code-strict — minimal prose, precise code instructions
  • reasoning-deep — chain-of-thought scaffolding, extended context
  • concise — compressed prompts for fast, lightweight models
Routing

Intent-Based Lane Classifier

Before routing to a local model, the request is classified into a lane. The lane determines which model and which prompt profile are applied.

  • reasoning lane — complex multi-step tasks
  • coding lane — code generation, review, debugging
  • balanced lane — general queries and planning
  • Profile saved as user default via persistent chat preferences
Pipeline

End-to-End Policy Resolution

Every local model call goes through a centralized policy resolver that determines the effective profile, mode, and routed model — with full audit trail.

  • UI selector → local_prompt_profile_hint
  • Settings endpoint persists preference
  • Policy engine resolves effective profile + source
  • System prompt built with explicit "Local model profile" section
Policy resolution pipeline
UI Profile Selector
chat_preferences
prompt_policy.py
lane classifier
ollama_routes.py
optimized system prompt
Fallback: BALANCED profile applied automatically if no preference set. Regression-tested in test_prompt_profile_propagation.py.
REST API

10 endpoints.
Full world-state access over HTTP.

Query the dependency graph, assess cascade risk before a deploy, read incident history, control the daemon — all from your existing toolchain.

MethodEndpointDescription
GET/api/v1/world/healthService up check
GET/api/v1/world/stateFull project snapshot
GET/api/v1/world/dependencies/{module_id}Reverse dependency graph for a module
GET/api/v1/world/cascade?modules=a,b,cCascade risk for a set of modules
GET/api/v1/world/incidents/{module_id}?days=7Incident history, 7-day default window
GET/api/v1/world/loop/statusDaemon state: running / stopped / failed
POST/api/v1/world/loop/startStart daemon (async)
POST/api/v1/world/loop/stopGraceful stop
GET/api/v1/world/loop/metricsAll loop counters — tasks, incidents, plans, uptime
GET/api/v1/world/plansSelf-generated work queue
Skill Library

37 skills across six domains.
Including ML infrastructure — natively.

No other autonomous agent ships with native GPU and ML infrastructure skills. Deploy inference servers, switch GPU modes, optimize cache — without manual orchestration.

Development · 8
  • code-review
  • debug-investigate
  • test-generate
  • refactor-plan
  • dependency-audit
  • validate-edit
  • regression-check
  • smoke-test
ML / GPU Infrastructure · 7
  • gpu-monitor
  • gpu-mode-switch
  • benchmark-gpu
  • deploy-vllm
  • vllm-health
  • kv-cache-optimize
  • ollama-manage
DevOps / System · 8
  • service-manager
  • docker-compose-gen
  • env-setup
  • disk-health
  • network-check
  • linux-diagnostic
  • netplan-config
  • backup-snapshot
Project Management · 5
  • task-intake
  • kb-sync
  • incident-report
  • progress-report
  • decision-record
Research / Analysis · 5
  • web-research
  • paper-analyze
  • compare-tools
  • changelog-digest
  • youtube-report
Audio / Media · 4
  • speech-pipeline
  • speech-stt
  • speech-tts
  • comfyui-workflow
Comparison

What existing tools don't do.

The capabilities below are unique to SYMBIOS AGENT. Reactive agents — IDE assistants, cloud coding agents, memory bolt-ons — share none of them in combination. To our knowledge at time of publication, based on publicly documented architectures and capabilities.

Capability SYMBIOS AGENT Memory tools IDE agents Cloud agents
Live dependency graph (World Model)
Cascade risk before every action
Autonomous 3-pass loop (no trigger)
Self-generated work proposals
Native GPU / ML infra skillspartial
Persistent structured KBconversation onlystatic files
JSONL audit trail + reasoning logs
Self-hosted, $0 inference costself-host optionenterprise tier
Watchdog + exponential back-off restart
REST API for external toolchain
By the numbers

Concrete, not claimed.

37
Built-in skills
Across development, ML infra, DevOps, project management, research, and media processing.
10
REST API endpoints
Full world-state access — dependency graph, cascade risk, loop control, metrics.
4
Cascade risk levels
Critical / High / Medium / Low, computed via topological traversal before every action.
3
Autonomous passes
Health (60s), task discovery (300s), self-planning (1800s). Zero human triggers needed.
Pricing

Simple tiers.
No inference costs on local models.

Run on your own hardware with Ollama or vLLM — $0 per token. You pay for the system, not the usage. Infrastructure and energy costs remain yours.

Monthly Annual SAVE 20%
Community
Patreon
Access
1 user
● Live now
  • 1 user seat
  • Full autonomous daemon
  • World Model + KB
  • 37 skills
  • REST API (10 endpoints)
  • Community support (Patreon)
Join on Patreon
Solo
$29
/ month · 1 seat
● Live now
  • 1 user seat
  • Full autonomous daemon
  • World Model + KB
  • 37 skills
  • REST API (10 endpoints)
  • Async support
  • Configuration guidance
  • No team management
  • No custom skill dev
Get Started
— In development
Request Access
Team
$99
/ month · 5 seats
◌ Early access — founding member pricing locked in
  • Up to 5 seats
  • Full autonomous daemon
  • World Model + KB
  • 37 skills
  • REST API + async support
  • Onboarding + playbook review
  • No custom skill dev
  • No dedicated support
— In development
Request Access
Studio
$299
/ month · up to 15 licences · 3 teams
◌ Early access — we onboard your team personally
  • Up to 15 licences / 3 teams
  • Full autonomous daemon
  • World Model + KB
  • 37 skills
  • REST API + async support
  • Custom skill development
  • Dedicated async support
  • Personal onboarding for each team

The system runs entirely on your infrastructure. We don't access your environment. Support is async — documentation, configuration guidance, and custom feature development delivered remotely.

Request Access

Join the waitlist.
Founding member pricing locked in.

Team and Studio tiers are in early access. Tell us about your use case — we onboard in order of fit and readiness.

What happens after you submit

01
We review your use case

We read every submission. We prioritize teams with concrete ops pain over explorers.

02
We reach out within 48h

Short async exchange to confirm fit and answer any technical questions about your infra.

03
You install, we deliver

You run the installer on your infrastructure. We deliver config, playbooks, and documentation. We never access your systems.

04
Founding member pricing locked

Your rate is locked for as long as you stay subscribed — regardless of future price increases.

Early access

Stop assigning tickets to AI.
Let it find the work itself.

SYMBIOS AGENT is in limited release. We're onboarding engineers and teams ready to run a system that doesn't wait to be asked.

Request Access