konstruct

Author	SHA1	Message	Date
Adolfo Delorenzo	6c1086046f	fix: add LLM_POOL_URL to gateway env — was using localhost instead of llm-pool Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-25 18:35:43 -06:00
Adolfo Delorenzo	ebf6e76174	feat: make Ollama model configurable via OLLAMA_MODEL env var - Add OLLAMA_MODEL setting to shared config (default: qwen3:32b) - LLM router reads from settings instead of hardcoded model name - Create .env file with all configurable settings documented - docker-compose passes OLLAMA_MODEL to llm-pool container To change the model: edit OLLAMA_MODEL in .env and restart llm-pool. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-25 13:22:18 -06:00
Adolfo Delorenzo	0e0ea5fb66	fix: runtime deployment fixes for Docker Compose stack - Add .gitignore for __pycache__, node_modules, .playwright-mcp - Add CLAUDE.md project instructions - docker-compose: remove host port exposure for internal services, remove Ollama container (use host), add CORS origin, bake NEXT_PUBLIC_API_URL at build time, run alembic migrations on gateway startup, add CPU-only torch pre-install - gateway: add CORS middleware, graceful Slack degradation without bot token, fix None guard on slack_handler - gateway pyproject: add aiohttp dependency for slack-bolt async - llm-pool pyproject: install litellm from GitHub (removed from PyPI), enable hatch direct references - portal: enable standalone output in next.config.ts - Remove orphaned migration 003_phase2_audit_kb.py (renamed to 004) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-24 12:26:34 -06:00
Adolfo Delorenzo	28a5ee996e	feat(02-01): add two-layer memory system — Redis sliding window + pgvector long-term - ConversationEmbedding ORM model with Vector(384) column (pgvector) - memory_short_key, escalation_status_key, pending_tool_confirm_key in redis_keys.py - orchestrator/memory/short_term.py: RPUSH/LTRIM sliding window (get_recent_messages, append_message) - orchestrator/memory/long_term.py: pgvector HNSW cosine search (retrieve_relevant, store_embedding) - Migration 002: conversation_embeddings table, HNSW index, RLS with FORCE, SELECT/INSERT only - 10 unit tests (fakeredis), 6 integration tests (pgvector) — all passing - Auto-fix [Rule 3]: postgres image updated to pgvector/pgvector:pg16 (extension required)	2026-03-23 14:41:57 -06:00
Adolfo Delorenzo	6f30705e1a	feat(01-03): Channel Gateway (Slack adapter) and Message Router - gateway/normalize.py: normalize_slack_event -> KonstructMessage (strips bot mention) - gateway/channels/slack.py: register_slack_handlers for app_mention + DM events - rate limit check -> ephemeral rejection on exceeded - idempotency dedup (Slack retry protection) - placeholder 'Thinking...' message posted in-thread before Celery dispatch - auto-follow engaged threads with 30-minute TTL - HTTP 200 returned immediately; all LLM work dispatched to Celery - gateway/main.py: FastAPI on port 8001, /slack/events + /health - router/tenant.py: resolve_tenant workspace_id -> tenant_id (RLS-bypass query) - router/ratelimit.py: check_rate_limit Redis token bucket, RateLimitExceeded exception - router/idempotency.py: is_duplicate + mark_processed (SET NX, 24h TTL) - router/context.py: load_agent_for_tenant with RLS ContextVar setup - orchestrator/tasks.py: handle_message now extracts placeholder_ts/channel_id, calls _update_slack_placeholder via chat.update after LLM response - docker-compose.yml: gateway service on port 8001 - pyproject.toml: added redis, konstruct-router, konstruct-orchestrator deps	2026-03-23 10:27:59 -06:00
Adolfo Delorenzo	cec7180fb0	feat(01-04): Next.js 16 admin portal with Auth.js v5, tenant CRUD, and Agent Designer - Initialize Next.js 16 project in packages/portal/ with TypeScript, Tailwind 4, shadcn/ui - Auth.js v5 with Credentials provider calling FastAPI /auth/verify endpoint - proxy.ts (Next.js 16 replacement for middleware.ts) protects all routes - Login page with React Hook Form + zod validation (standard-schema resolver for zod v4 compat) - Agent Designer: prominent dedicated module with Identity, Personality, Configuration, Capabilities, Escalation, and Status sections; employee-centric language throughout - Tenant CRUD: list, create (slug auto-gen), view/edit, delete with confirmation - TanStack Query hooks for all API operations with proper cache invalidation - Route group (dashboard) provides shared Nav sidebar + QueryClientProvider - Update docker-compose.yml to add portal service on port 3000 - Deviations: middleware.ts renamed to proxy.ts in Next.js 16; zodResolver replaced with standardSchemaResolver for zod v4 + @hookform/resolvers v5 compatibility	2026-03-23 10:19:40 -06:00
Adolfo Delorenzo	ee2f88e13b	feat(01-02): LLM Backend Pool — LiteLLM Router with Ollama + Anthropic + OpenAI fallback - Create llm_pool/router.py: LiteLLM Router with fast (Ollama) and quality (Anthropic/OpenAI) model groups - Configure fallback chain: quality providers fail -> fast group - Pin LiteLLM to ==1.82.5 (avoid September 2025 OOM regression in later releases) - Create llm_pool/main.py: FastAPI service on port 8004 with /complete and /health endpoints - Add providers/__init__.py: reserved for future per-provider customization - Update docker-compose.yml: add llm-pool and celery-worker service stubs	2026-03-23 10:03:05 -06:00
Adolfo Delorenzo	5714acf741	feat(01-foundation-01): monorepo scaffolding, Docker Compose, and shared data models - pyproject.toml: uv workspace with 5 member packages (shared, gateway, router, orchestrator, llm-pool) - docker-compose.yml: PostgreSQL 16 + Redis 7 + Ollama services on konstruct-net - .env.example: all required env vars documented, konstruct_app role (not superuser) - scripts/init-db.sh: creates konstruct_app role at DB init time - packages/shared/shared/config.py: Pydantic Settings loading all env vars - packages/shared/shared/models/message.py: KonstructMessage, ChannelType, SenderInfo, MessageContent - packages/shared/shared/models/tenant.py: Tenant, Agent, ChannelConnection SQLAlchemy 2.0 models - packages/shared/shared/models/auth.py: PortalUser model for admin portal auth - packages/shared/shared/db.py: async SQLAlchemy engine, session factory, get_session dependency - packages/shared/shared/rls.py: current_tenant_id ContextVar and configure_rls_hook with parameterized SET LOCAL - packages/shared/shared/redis_keys.py: tenant-namespaced key constructors (rate_limit, idempotency, session, engaged_thread)	2026-03-23 09:49:28 -06:00

8 Commits