Files

Adolfo Delorenzo ac54d819f8 docs(02): add research and validation strategy

2026-03-23 14:16:42 -06:00

phase, slug, status, nyquist_compliant, wave_0_complete, created

phase	slug	status	nyquist_compliant	wave_0_complete	created
2	agent-features	draft	false	false	2026-03-23

Phase 2 — Validation Strategy

Per-phase validation contract for feedback sampling during execution.

Test Infrastructure

Property	Value
Framework	pytest 8.x + pytest-asyncio (existing from Phase 1)
Config file	`pyproject.toml` (existing `[tool.pytest.ini_options]`)
Quick run command	`pytest tests/unit -x -q`
Full suite command	`pytest tests/ -x`
Estimated runtime	~45 seconds

Task ID	Plan	Wave	Requirement	Test Type	Automated Command	File Exists	Status
02-01-xx	01	1	AGNT-02	unit	`pytest tests/unit/test_memory_short_term.py -x`	❌ W0	⬜ pending
02-01-xx	01	1	AGNT-03	integration	`pytest tests/integration/test_memory_long_term.py -x`	❌ W0	⬜ pending
02-02-xx	02	1	AGNT-04	unit	`pytest tests/unit/test_tool_registry.py tests/unit/test_tool_executor.py -x`	❌ W0	⬜ pending
02-02-xx	02	1	AGNT-06	integration	`pytest tests/integration/test_audit.py -x`	❌ W0	⬜ pending
02-03-xx	03	2	CHAN-03	unit	`pytest tests/unit/test_whatsapp_verify.py tests/unit/test_whatsapp_normalize.py -x`	❌ W0	⬜ pending
02-03-xx	03	2	CHAN-04	unit	`pytest tests/unit/test_whatsapp_scoping.py -x`	❌ W0	⬜ pending
02-04-xx	04	2	AGNT-05	unit+integ	`pytest tests/unit/test_escalation.py tests/integration/test_escalation.py -x`	❌ W0	⬜ pending

Status: ⬜ pending · ✅ green · ❌ red · ⚠️ flaky

tests/unit/test_memory_short_term.py — AGNT-02: Redis sliding window
tests/integration/test_memory_long_term.py — AGNT-03: pgvector retrieval + tenant isolation
tests/unit/test_tool_registry.py — AGNT-04: tool registry lookup
tests/unit/test_tool_executor.py — AGNT-04: schema validation + confirmation
tests/integration/test_audit.py — AGNT-06: audit immutability
tests/unit/test_escalation.py — AGNT-05: transcript packaging
tests/integration/test_escalation.py — AGNT-05: DM delivery
tests/unit/test_whatsapp_verify.py — CHAN-03: webhook signature verification
tests/unit/test_whatsapp_normalize.py — CHAN-03: message normalization
tests/unit/test_whatsapp_scoping.py — CHAN-04: business-function gate
tests/conftest.py — extend with pgvector fixtures, mock MinIO (moto)
Install: uv add --dev moto (S3/MinIO mocking)

Behavior	Requirement	Why Manual	Test Instructions
WhatsApp message delivery via live Cloud API	CHAN-03	Requires Meta-verified WABA + phone number	Send test message to configured WhatsApp number, verify reply
Media (image) round-trip via WhatsApp	CHAN-03	Requires live WhatsApp + multimodal LLM	Send photo to agent, verify interpretation; request doc from agent

Approval: pending