konstruct

Author	SHA1	Message	Date
Adolfo Delorenzo	a64634ff90	feat(10-02): mount KB and calendar routers, update tool registry and prompt builder - Mount kb_router and calendar_auth_router on gateway (Phase 10 agent capabilities) - Update calendar_lookup tool schema with action/event_summary/event_start/event_end params - Add tool result formatting instruction to build_system_prompt when tools assigned (CAP-06) - Add kb_router and calendar_auth_router to shared/api/__init__.py exports - Confirm CAP-04 (http_request) and CAP-07 (audit logging) already working	2026-03-26 09:10:01 -06:00
Adolfo Delorenzo	7d3a393758	feat(08-03): push notification backend — DB model, migration, API router, VAPID setup - Add PushSubscription ORM model with unique(user_id, endpoint) constraint - Add Alembic migration 012 for push_subscriptions table - Add push router (subscribe, unsubscribe, send) in shared/api/push.py - Mount push router in gateway/main.py - Add pywebpush to gateway dependencies for server-side VAPID delivery - Wire push trigger into WebSocket handler (fires when client disconnects mid-stream) - Add VAPID keys to .env / .env.example - Add push/install i18n keys in en/es/pt message files	2026-03-25 21:26:51 -06:00
Adolfo Delorenzo	17f6d7cb4b	fix: streaming timeout + WebSocket close guard - Streaming httpx client uses 300s read timeout (cloud LLMs can take 30-60s for first token). Was using 120s general timeout. - Guard all WebSocket sends with try/except for client disconnect. Prevents "Cannot send once close message has been sent" crash. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-25 18:39:32 -06:00
Adolfo Delorenzo	dd80e2b822	perf: bypass Celery for web chat — stream LLM directly from WebSocket Eliminates 5-10s of overhead by calling the LLM pool's streaming endpoint directly from the WebSocket handler instead of going through Celery queue → worker → asyncio.run() → Redis pub-sub → WebSocket. New flow: WebSocket → agent lookup → memory → LLM stream → WebSocket Old flow: WebSocket → Celery → worker → DB → memory → LLM → Redis → WebSocket Memory still saved (Redis sliding window + fire-and-forget embedding). Slack/WhatsApp still use Celery (async webhook pattern). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-25 18:32:16 -06:00
Adolfo Delorenzo	61b8762bac	feat(streaming): update WebSocket handler to forward streaming chunks to browser - Pub-sub loop now handles 'chunk' and 'done' message types (not just 'response') - 'chunk' messages are forwarded immediately via websocket.send_json - 'done' message breaks the loop and triggers DB persistence of full response - Sends final 'done' JSON to browser to signal stream completion - Legacy 'response' type no longer emitted from orchestrator (now unified as 'done')	2026-03-25 17:57:08 -06:00
Adolfo Delorenzo	b6c8da8cca	fix: increase WebSocket pub-sub timeout from 60s to 180s LLM responses can take >60s (especially with local models). The WebSocket listener was timing out before the response arrived, causing agent replies to appear in logs but not in the chat UI. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-25 17:48:22 -06:00
Adolfo Delorenzo	84d2e775ad	fix: register RLS hook on gateway — agent creation was failing with policy violation The gateway never called configure_rls_hook(engine), so SET LOCAL app.current_tenant was never set for any DB operation through the portal API endpoints. All tenant-scoped writes (agent creation, etc.) failed with "new row violates row-level security policy." Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-25 12:40:08 -06:00
Adolfo Delorenzo	56c11a0f1a	feat(06-01): WebSocket endpoint, chat REST API, orchestrator wiring, gateway mounting - Create gateway/channels/web.py with normalize_web_event() and /chat/ws/{conversation_id} WebSocket endpoint (auth via first JSON message, typing indicator, Redis pub-sub response) - Create shared/api/chat.py with GET/POST/DELETE /api/portal/chat/conversations* REST API with require_tenant_member RBAC enforcement and RLS context var setup - Add chat_router to shared/api/__init__.py exports - Mount chat_router and web_chat_router in gateway/main.py (Phase 6 Web Chat routers) - All 19 unit tests pass; full 313-test suite green	2026-03-25 10:26:54 -06:00
Adolfo Delorenzo	f9ce3d650f	feat(05-01): template list/detail/deploy API + RBAC + integration tests - Create shared/api/templates.py with templates_router - GET /api/portal/templates: list active templates (any authenticated user) - GET /api/portal/templates/{id}: get template detail (any authenticated user) - POST /api/portal/templates/{id}/deploy: create Agent snapshot (tenant_admin only) - customer_operator returns 403 on deploy (RBAC enforced) - Export templates_router from shared/api/__init__.py - Mount templates_router in gateway/main.py (Phase 5 section) - 11 integration tests pass (list, detail, deploy, RBAC, 404, snapshot independence)	2026-03-24 20:32:30 -06:00
Adolfo Delorenzo	d59f85cd87	feat(04-rbac-01): RBAC guards + invite token + email + invitation API - rbac.py: PortalCaller dataclass + get_portal_caller dependency (header-based) - rbac.py: require_platform_admin (403 for non-platform_admin) - rbac.py: require_tenant_admin (platform_admin bypasses; customer_admin checks UserTenantRole; operator always rejected) - rbac.py: require_tenant_member (platform_admin bypasses; all roles checked against UserTenantRole) - invite_token.py: generate_invite_token (HMAC-SHA256, base64url, 48h TTL) - invite_token.py: validate_invite_token (timing-safe compare_digest, TTL check) - invite_token.py: token_to_hash (SHA-256 for DB storage) - email.py: send_invite_email (sync smtplib, skips if smtp_host empty) - invitations.py: POST /api/portal/invitations (create, requires tenant admin) - invitations.py: POST /api/portal/invitations/accept (accept invitation) - invitations.py: POST /api/portal/invitations/{id}/resend (regenerate token) - invitations.py: GET /api/portal/invitations (list pending) - portal.py: AuthVerifyResponse now returns role+tenant_ids+active_tenant_id - portal.py: auth/register gated behind require_platform_admin - tasks.py: send_invite_email_task Celery task (fire-and-forget) - gateway/main.py: invitations_router mounted	2026-03-24 13:52:45 -06:00
Adolfo Delorenzo	0e0ea5fb66	fix: runtime deployment fixes for Docker Compose stack - Add .gitignore for __pycache__, node_modules, .playwright-mcp - Add CLAUDE.md project instructions - docker-compose: remove host port exposure for internal services, remove Ollama container (use host), add CORS origin, bake NEXT_PUBLIC_API_URL at build time, run alembic migrations on gateway startup, add CPU-only torch pre-install - gateway: add CORS middleware, graceful Slack degradation without bot token, fix None guard on slack_handler - gateway pyproject: add aiohttp dependency for slack-bolt async - llm-pool pyproject: install litellm from GitHub (removed from PyPI), enable hatch direct references - portal: enable standalone output in next.config.ts - Remove orphaned migration 003_phase2_audit_kb.py (renamed to 004) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-24 12:26:34 -06:00
Adolfo Delorenzo	c47cc2f5bf	feat(03-05): mount Phase 3 API routers on gateway FastAPI app - Import all 6 Phase 3 routers from shared.api (portal, billing, channels, llm_keys, usage, webhook) - Add include_router() calls after existing whatsapp_router - Update module docstring to document portal API endpoints	2026-03-24 00:53:32 -06:00
Adolfo Delorenzo	9dd7c481a3	feat(02-05): Slack file_share extraction and channel-aware outbound routing - Add gateway/channels/slack_media.py with is_file_share_event, media_type_from_mime, build_slack_storage_key, build_attachment_from_slack_file, download_and_store_slack_file - Add _send_response() helper to orchestrator/tasks.py for channel-aware dispatch (Slack -> chat.update, WhatsApp -> send_whatsapp_message) - Add send_whatsapp_message import to orchestrator/tasks.py for WhatsApp outbound - Add boto3>=1.35.0 to gateway dependencies for MinIO S3 client - Add 23 unit tests in test_slack_media.py (TDD)	2026-03-23 15:06:45 -06:00
Adolfo Delorenzo	6fea34db28	feat(02-03): WhatsApp adapter with business-function scoping and router registration - Register whatsapp_router in gateway main.py (GET + POST /whatsapp/webhook) - Implement is_clearly_off_topic() tier 1 keyword scoping gate - Implement build_off_topic_reply() canned redirect message builder - Full webhook handler: verify -> normalize -> tenant -> rate limit -> dedup -> scope -> media -> dispatch - Outbound delivery via send_whatsapp_message() and send_whatsapp_media() - Media download from Meta API and storage in MinIO with tenant-prefixed keys - 14 new passing scoping tests	2026-03-23 14:43:04 -06:00
Adolfo Delorenzo	370a860622	feat(02-03): add MediaAttachment model, WhatsApp normalizer, and signature verification - Add MediaType(StrEnum) and MediaAttachment(BaseModel) to shared/models/message.py - Add media: list[MediaAttachment] field to MessageContent - Add whatsapp_app_secret, whatsapp_verify_token, and MinIO settings to shared/config.py - Add normalize_whatsapp_event() to gateway/normalize.py (text, image, document support) - Create whatsapp.py adapter with verify_whatsapp_signature() and verify_hub_challenge() - 30 new passing tests (signature verification + normalizer)	2026-03-23 14:41:48 -06:00
Adolfo Delorenzo	6f30705e1a	feat(01-03): Channel Gateway (Slack adapter) and Message Router - gateway/normalize.py: normalize_slack_event -> KonstructMessage (strips bot mention) - gateway/channels/slack.py: register_slack_handlers for app_mention + DM events - rate limit check -> ephemeral rejection on exceeded - idempotency dedup (Slack retry protection) - placeholder 'Thinking...' message posted in-thread before Celery dispatch - auto-follow engaged threads with 30-minute TTL - HTTP 200 returned immediately; all LLM work dispatched to Celery - gateway/main.py: FastAPI on port 8001, /slack/events + /health - router/tenant.py: resolve_tenant workspace_id -> tenant_id (RLS-bypass query) - router/ratelimit.py: check_rate_limit Redis token bucket, RateLimitExceeded exception - router/idempotency.py: is_duplicate + mark_processed (SET NX, 24h TTL) - router/context.py: load_agent_for_tenant with RLS ContextVar setup - orchestrator/tasks.py: handle_message now extracts placeholder_ts/channel_id, calls _update_slack_placeholder via chat.update after LLM response - docker-compose.yml: gateway service on port 8001 - pyproject.toml: added redis, konstruct-router, konstruct-orchestrator deps	2026-03-23 10:27:59 -06:00
Adolfo Delorenzo	5714acf741	feat(01-foundation-01): monorepo scaffolding, Docker Compose, and shared data models - pyproject.toml: uv workspace with 5 member packages (shared, gateway, router, orchestrator, llm-pool) - docker-compose.yml: PostgreSQL 16 + Redis 7 + Ollama services on konstruct-net - .env.example: all required env vars documented, konstruct_app role (not superuser) - scripts/init-db.sh: creates konstruct_app role at DB init time - packages/shared/shared/config.py: Pydantic Settings loading all env vars - packages/shared/shared/models/message.py: KonstructMessage, ChannelType, SenderInfo, MessageContent - packages/shared/shared/models/tenant.py: Tenant, Agent, ChannelConnection SQLAlchemy 2.0 models - packages/shared/shared/models/auth.py: PortalUser model for admin portal auth - packages/shared/shared/db.py: async SQLAlchemy engine, session factory, get_session dependency - packages/shared/shared/rls.py: current_tenant_id ContextVar and configure_rls_hook with parameterized SET LOCAL - packages/shared/shared/redis_keys.py: tenant-namespaced key constructors (rate_limit, idempotency, session, engaged_thread)	2026-03-23 09:49:28 -06:00

17 Commits