ADR-0017: M4 Autonomous Development Loop — Concept Capture

Status

proposed — This is a vision capture, not an implementation commitment. Status will move to accepted when M4 sprint is scheduled.

Context

The current development workflow requires the founder to act as a manual courier between three agent roles:

Strategist (Claude chat / co-founder role) — formulates tasks, writes prompts, reviews results
Developer (Claude Code on VPS) — implements code, runs tests, commits
QA / Testers (Perplexity, Cowork, other LLM agents) — test the live product in browser, find bugs, produce reports

The founder manually copies prompts from strategist to developer, build results from developer to QA, bug reports from QA back to strategist, and fix prompts back to developer. This loop repeats until the quality target is met.

Estimated 60-70% of dev session time is spent on this mechanical courier work — the same class of problem that M3 Deliberation Chamber solved for strategic questions, but applied to the entire development cycle.

This vision was articulated on 2026-04-17 during a session where M3 Chamber UI was completed, and the pattern became clear: if Chamber can automate multi-LLM deliberation for strategic questions, the same orchestration pattern can automate the strategist→developer→QA loop.

Decision

Capture the M4 Autonomous Development Loop as a strategic roadmap document for future implementation. The concept introduces four agent types:

Health Agents — continuous quality monitoring per module (M1, M2, M3)
SOS Agents — emergency automated diagnosis and escalation when something breaks
QA Agents — multi-LLM browser-based testing replacing manual QA handoffs
Courier Agent (Orchestrator) — replaces the founder’s courier role, managing the full strategist→developer→QA loop

The founder’s role shifts from courier to decision-maker, vision-setter, and quality arbiter of last resort — intervening only per CriticalityPolicy Level 3.

Implementation is deferred to post-M2 completion. This ADR captures the vision so it is preserved across sessions and available to future sprints.

Alternatives Considered

(a) Keep manual workflow. The founder continues as courier indefinitely. Rejected because it caps development velocity at founder availability and wastes 60-70% of session time on mechanical work.

(b) Partial automation — QA only. Automate only the QA step (multi-LLM browser testing) while keeping manual strategist↔developer handoff. This is simpler to build and delivers the most immediate time savings (QA is the most tedious handoff). However, it leaves the strategist→developer handoff manual and doesn’t address the full courier problem. Could serve as a stepping stone toward full M4.

(c) Full autonomous loop (captured). Automate the entire strategist→developer→QA cycle with a Courier Agent orchestrator and specialized Health/SOS/QA agents. Most ambitious, highest payoff, highest implementation cost. This is the vision captured in this ADR.

Consequences

Positive:

Vision is preserved in manifest — future sessions have full context without founder re-explaining
Clear articulation of agent types (Health, SOS, QA, Courier) provides implementation targets
Relationship to existing modules (M1, M2, M3) and governance (Constitution, CriticalityPolicy) is documented
Open questions are explicitly captured for resolution before implementation begins

Negative / Risks:

Implementation is deferred — no immediate workflow improvement
Vision may evolve as M2 build reveals new patterns or constraints
Cost model for multi-LLM QA loops is unknown — could be expensive at scale
Security model for Courier Agent (VPS access, git, Streamlit) needs design

Neutral:

Does not affect current M2 sprint or any in-progress work
Does not create technical debt — this is a concept document only

References

Autonomous-Dev-Loop — full roadmap document with detailed vision
Constitution — Law 1 (Founder Interests), Law 5 (Human Veto), Law 9 (Safe defaults)
CriticalityPolicy — escalation levels determining founder intervention
ADR-0014-m3-deliberation-chamber — M3 Chamber design (pattern M4 extends)
ADR-0015-m3-chamber-implementation — M3 implementation record
ADR-0016-chamber-v2-vision — Chamber v2 criticality levels (M4 relies on)
ADR-0013-m2-team-implementation-navigator — M2, prerequisite for M4
NorthStar — 80% autonomous closure target aligns with M4 vision

Synth Nova Manifest

Explorer

ADR-0017: M4 Autonomous Development Loop — Concept Capture

ADR-0017: M4 Autonomous Development Loop — Concept Capture

Status

Context

Decision

Alternatives Considered

Consequences

References

Graph View

Table of Contents

Backlinks