konstruct/packages/llm-pool/llm_pool/__init__.py at 370a8606229f69639572896bd822178723ea22cc - konstruct - Adolfo Delorenzo's - Git Server

adelorenzo/konstruct

Files

Adolfo Delorenzo ee2f88e13b feat(01-02): LLM Backend Pool — LiteLLM Router with Ollama + Anthropic + OpenAI fallback

- Create llm_pool/router.py: LiteLLM Router with fast (Ollama) and quality (Anthropic/OpenAI) model groups
- Configure fallback chain: quality providers fail -> fast group
- Pin LiteLLM to ==1.82.5 (avoid September 2025 OOM regression in later releases)
- Create llm_pool/main.py: FastAPI service on port 8004 with /complete and /health endpoints
- Add providers/__init__.py: reserved for future per-provider customization
- Update docker-compose.yml: add llm-pool and celery-worker service stubs

2026-03-23 10:03:05 -06:00

11 lines

280 B

Python

Raw Blame History

 """
 konstruct-llm-pool — LiteLLM Router service.
 Exposes a FastAPI HTTP API for LLM completions with multi-provider routing
 and automatic fallback. Import the FastAPI app from main.py.
 """
 from llm_pool.router import complete, llm_router
 __all__ = ["complete", "llm_router"]