Manual
System monitors only — no automated actions.
- Human role
- All actions performed manually.
- Real-world example
- A blinking server light. The admin reads logs and reacts.
The L0–L5 scale inherits the TM Forum AN baseline and extends it with AI-specific dimensions: cost circuit breakers, LLM output validation, model fallback chains, tool-use scoping, eval feedback loop.
System monitors only — no automated actions.
Executes pre-configured deterministic subtasks (cron, pipeline step). Failures emit raw logs.
Closed-loop ops in narrow domain with static rules. Health endpoint, classified errors, post-run status to ops dashboard.
Real-time sensing via structured observability (traces, eval scores). Heartbeat, post-deploy smoke gate, validated credentials, explicit exception hierarchy. Adapts via dynamic policies.
Cross-domain self-detection, self-diagnosis, self-recovery. Retry-with-classification, circuit breakers, fallback chains, state checkpointing, recovery audit trail, known-fix mapping, post-recovery verification, hard cost circuit breakers for LLM calls.
Closed-loop across services. Proactive self-improvement: A/B testing prompts and models, dynamic model routing, cost governance with auto-downgrade, eval-driven prompt refinement, knowledge graph evolution.
Five dimensions absent from the original TM Forum baseline. Mandatory from level L2+.
Real incident 2026-04-29: $437 OpenAI bill in 8 hours from a runaway loop. Hard cost CB is non-negotiable.
Prompt injection (OWASP LLM01) and schema drift in LLM output. Zod/JSON schema at L3, corrective retry at L4.
Provider outage / deprecation / rate limit demands runtime model swap (Claude → GPT → local). Single-model agents fail the L4 gate.
Agentic file-write and shell-exec are a new attack surface (OWASP LLM06 Excessive Agency). High-risk tools are sandboxed at L4.
Self-optimization without labeled eval has no signal. Every decision gets a downstream signal; the meta-loop tunes prompts.