SoAZCloud — AI Infrastructure One-Pager

Production Agents

Conversational Chat Agent — External
Serves a large external customer base from one instance. Three-layer data isolation. 89% token cost reduction achieved through prompt caching and model selection.
Technical Reference Agent — External
4,052 manual sections, 3,241 verified facts. FTS5 search. Zero-fabrication policy.
Support Operations — Internal
Automated ticket triage with GitHub code context. 1,600+ memories from months of operation.
Security Operations — Internal
CrowdStrike monitoring, SOC 2 automation, daily compliance verification.
Data Acquisition / Provisioning / Sales / Account Mgmt — Internal
Scraping, onboarding automation, sales enablement, and account support — multiple agents replacing headcount across internal teams.

Shared Semantic Memory

A fleet of agents shares a persistent memory system built on PostgreSQL + pgvector with local embeddings (768-dim, zero API cost). When one agent solves a problem, every other agent benefits. Memory types: architectural decisions, discovered patterns, and verified references. Accessible via API and MCP server for IDE integration.

Capabilities

Multi-Agent Orchestration

Unified gateway across Slack, HTTP, WebSocket, cron

Multi-Tenant Architecture

Many customers from one instance with full data isolation

Cost Engineering

89% token reduction, prompt caching, model selection

Security Hardening

40 capabilities dropped, env filtering, sandboxed exec

Self-Healing Ops

Nightly updates with smoke tests, auto-fix, rollback

Hybrid Cloud + Edge

EC2 + edge nodes via encrypted mesh networking

Architecture Philosophy

One agent per function Persistent identity, memory, tools, and security boundaries. They learn over time.
Workspace files as behavioral contracts Agent behavior in versioned markdown — not application code. Hot-reloadable.
Local embeddings, not API calls 300MB model on-instance. Zero per-query cost, lower latency, no external dependency.
Docker sandboxing as security boundary Session-scoped containers. Credentials injected, never in agent-readable files.