Update frontend branding from 'Voice Language Translator' to 'Talk2Me'

- Updated page title in index.html - Updated main heading in index.html - Updated PWA manifest name - Updated service worker comment 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>
Add production WSGI server - Flask dev server unsuitable for production load
2025-06-03 08:59:00 -06:00 · 2025-06-03 08:49:32 -06:00 · 2025-06-03 08:37:13 -06:00 · 2025-06-03 08:11:26 -06:00 · 2025-06-03 00:58:14 -06:00 · 2025-06-03 00:50:45 -06:00
63 changed files with 16800 additions and 598 deletions
--- a/.dockerignore
+++ b/.dockerignore
@@ -0,0 +1,71 @@
+# Git
+.git
+.gitignore
+
+# Python
+__pycache__
+*.pyc
+*.pyo
+*.pyd
+.Python
+venv/
+env/
+.venv
+pip-log.txt
+pip-delete-this-directory.txt
+.tox/
+.coverage
+.coverage.*
+.cache
+*.egg-info/
+.pytest_cache/
+
+# Node
+node_modules/
+npm-debug.log*
+yarn-debug.log*
+yarn-error.log*
+
+# IDE
+.vscode/
+.idea/
+*.swp
+*.swo
+*~
+
+# OS
+.DS_Store
+.DS_Store?
+._*
+.Spotlight-V100
+.Trashes
+ehthumbs.db
+Thumbs.db
+
+# Project specific
+logs/
+*.log
+.env
+.env.*
+!.env.production
+*.db
+*.sqlite
+/tmp
+/temp
+test_*.py
+tests/
+
+# Documentation
+*.md
+!README.md
+docs/
+
+# CI/CD
+.github/
+.gitlab-ci.yml
+.travis.yml
+
+# Development files
+deploy.sh
+Makefile
+docker-compose.override.yml
--- a/.env.example
+++ b/.env.example
@@ -0,0 +1,22 @@
+# Example environment configuration for Talk2Me
+# Copy this file to .env and update with your actual values
+
+# Flask Configuration
+SECRET_KEY=your-secret-key-here-change-this
+
+# Upload Configuration
+UPLOAD_FOLDER=/path/to/secure/upload/folder
+
+# TTS Server Configuration
+TTS_SERVER_URL=http://localhost:5050/v1/audio/speech
+TTS_API_KEY=your-tts-api-key-here
+
+# CORS Configuration (for production)
+CORS_ORIGINS=https://yourdomain.com,https://app.yourdomain.com
+ADMIN_CORS_ORIGINS=https://admin.yourdomain.com
+
+# Admin Token (for admin endpoints)
+ADMIN_TOKEN=your-secure-admin-token-here
+
+# Optional: GPU Configuration
+# CUDA_VISIBLE_DEVICES=0
--- a/.gitignore
+++ b/.gitignore
@@ -1 +1,69 @@
+# Python
+__pycache__/
+*.py[cod]
+*$py.class
+*.so
+.Python
 venv/
+env/
+ENV/
+.venv
+.env
+
+# Flask
+instance/
+.webassets-cache
+
+# IDE
+.vscode/
+.idea/
+*.swp
+*.swo
+*~
+
+# OS
+.DS_Store
+.DS_Store?
+._*
+.Spotlight-V100
+.Trashes
+ehthumbs.db
+Thumbs.db
+
+# Node.js
+node_modules/
+npm-debug.log*
+yarn-debug.log*
+yarn-error.log*
+
+# TypeScript
+static/js/dist/
+*.tsbuildinfo
+
+# Temporary files
+*.log
+*.tmp
+temp/
+tmp/
+
+# Audio files (for testing)
+*.mp3
+*.wav
+*.ogg
+
+# Local environment
+.env.local
+.env.*.local
+.env.production
+.env.development
+.env.staging
+
+# VAPID keys
+vapid_private.pem
+vapid_public.pem
+
+# Secrets management
+.secrets.json
+.master_key
+secrets.db
+*.key
--- a/CONNECTION_RETRY.md
+++ b/CONNECTION_RETRY.md
@@ -0,0 +1,173 @@
+# Connection Retry Logic Documentation
+
+This document explains the connection retry and network interruption handling features in Talk2Me.
+
+## Overview
+
+Talk2Me implements robust connection retry logic to handle network interruptions gracefully. When a connection is lost or a request fails due to network issues, the application automatically queues requests and retries them when the connection is restored.
+
+## Features
+
+### 1. Automatic Connection Monitoring
+- Monitors browser online/offline events
+- Periodic health checks to the server (every 5 seconds when offline)
+- Visual connection status indicator
+- Automatic detection when returning from sleep/hibernation
+
+### 2. Request Queuing
+- Failed requests are automatically queued during network interruptions
+- Requests maintain their priority and are processed in order
+- Queue persists across connection failures
+- Visual indication of queued requests
+
+### 3. Exponential Backoff Retry
+- Failed requests are retried with exponential backoff
+- Initial retry delay: 1 second
+- Maximum retry delay: 30 seconds
+- Backoff multiplier: 2x
+- Maximum retries: 3 attempts
+
+### 4. Connection Status UI
+- Real-time connection status indicator (bottom-right corner)
+- Offline banner with retry button
+- Queue status showing pending requests by type
+- Temporary status messages for important events
+
+## User Experience
+
+### When Connection is Lost
+
+1. **Visual Indicators**:
+   - Connection status shows "Offline" or "Connection error"
+   - Red banner appears at top of screen
+   - Queued request count is displayed
+
+2. **Request Handling**:
+   - New requests are automatically queued
+   - User sees "Connection error - queued" message
+   - Requests will be sent when connection returns
+
+3. **Manual Retry**:
+   - Users can click "Retry" button in offline banner
+   - Forces immediate connection check
+
+### When Connection is Restored
+
+1. **Automatic Recovery**:
+   - Connection status changes to "Connecting..."
+   - Queued requests are processed automatically
+   - Success message shown briefly
+
+2. **Request Processing**:
+   - Queued requests maintain their order
+   - Higher priority requests (transcription) processed first
+   - Progress indicators show processing status
+
+## Configuration
+
+The connection retry logic can be configured programmatically:
+
+```javascript
+// In app.ts or initialization code
+connectionManager.configure({
+    maxRetries: 3,           // Maximum retry attempts
+    initialDelay: 1000,      // Initial retry delay (ms)
+    maxDelay: 30000,         // Maximum retry delay (ms)
+    backoffMultiplier: 2,    // Exponential backoff multiplier
+    timeout: 10000,          // Request timeout (ms)
+    onlineCheckInterval: 5000 // Health check interval (ms)
+});
+```
+
+## Request Priority
+
+Requests are prioritized as follows:
+1. **Transcription** (Priority: 8) - Highest priority
+2. **Translation** (Priority: 5) - Normal priority
+3. **TTS/Audio** (Priority: 3) - Lower priority
+
+## Error Types
+
+### Retryable Errors
+- Network errors
+- Connection timeouts
+- Server errors (5xx)
+- CORS errors (in some cases)
+
+### Non-Retryable Errors
+- Client errors (4xx)
+- Authentication errors
+- Rate limit errors
+- Invalid request errors
+
+## Best Practices
+
+1. **For Users**:
+   - Wait for queued requests to complete before closing the app
+   - Use the manual retry button if automatic recovery fails
+   - Check the connection status indicator for current state
+
+2. **For Developers**:
+   - All fetch requests should go through RequestQueueManager
+   - Use appropriate request priorities
+   - Handle both online and offline scenarios in UI
+   - Provide clear feedback about connection status
+
+## Technical Implementation
+
+### Key Components
+
+1. **ConnectionManager** (`connectionManager.ts`):
+   - Monitors connection state
+   - Implements retry logic with exponential backoff
+   - Provides connection state subscriptions
+
+2. **RequestQueueManager** (`requestQueue.ts`):
+   - Queues failed requests
+   - Integrates with ConnectionManager
+   - Handles request prioritization
+
+3. **ConnectionUI** (`connectionUI.ts`):
+   - Displays connection status
+   - Shows offline banner
+   - Updates queue information
+
+### Integration Example
+
+```typescript
+// Automatic integration through RequestQueueManager
+const queue = RequestQueueManager.getInstance();
+const data = await queue.enqueue<ResponseType>(
+    'translate',  // Request type
+    async () => {
+        // Your fetch request
+        const response = await fetch('/api/translate', options);
+        return response.json();
+    },
+    5  // Priority (1-10, higher = more important)
+);
+```
+
+## Troubleshooting
+
+### Connection Not Detected
+- Check browser permissions for network status
+- Ensure health endpoint (/health) is accessible
+- Verify no firewall/proxy blocking
+
+### Requests Not Retrying
+- Check browser console for errors
+- Verify request type is retryable
+- Check if max retries exceeded
+
+### Queue Not Processing
+- Manually trigger retry with button
+- Check if requests are timing out
+- Verify server is responding
+
+## Future Enhancements
+
+- Persistent queue storage (survive page refresh)
+- Configurable retry strategies per request type
+- Network speed detection and adaptation
+- Progressive web app offline mode
--- a/CORS_CONFIG.md
+++ b/CORS_CONFIG.md
@@ -0,0 +1,152 @@
+# CORS Configuration Guide
+
+This document explains how to configure Cross-Origin Resource Sharing (CORS) for the Talk2Me application.
+
+## Overview
+
+CORS is configured using Flask-CORS to enable secure cross-origin usage of the API endpoints. This allows the Talk2Me application to be embedded in other websites or accessed from different domains while maintaining security.
+
+## Environment Variables
+
+### `CORS_ORIGINS`
+
+Controls which domains are allowed to access the API endpoints.
+
+- **Default**: `*` (allows all origins - use only for development)
+- **Production Example**: `https://yourdomain.com,https://app.yourdomain.com`
+- **Format**: Comma-separated list of allowed origins
+
+```bash
+# Development (allows all origins)
+export CORS_ORIGINS="*"
+
+# Production (restrict to specific domains)
+export CORS_ORIGINS="https://talk2me.example.com,https://app.example.com"
+```
+
+### `ADMIN_CORS_ORIGINS`
+
+Controls which domains can access admin endpoints (more restrictive).
+
+- **Default**: `http://localhost:*` (allows all localhost ports)
+- **Production Example**: `https://admin.yourdomain.com`
+- **Format**: Comma-separated list of allowed admin origins
+
+```bash
+# Development
+export ADMIN_CORS_ORIGINS="http://localhost:*"
+
+# Production
+export ADMIN_CORS_ORIGINS="https://admin.talk2me.example.com"
+```
+
+## Configuration Details
+
+The CORS configuration includes:
+
+- **Allowed Methods**: GET, POST, OPTIONS
+- **Allowed Headers**: Content-Type, Authorization, X-Requested-With, X-Admin-Token
+- **Exposed Headers**: Content-Range, X-Content-Range
+- **Credentials Support**: Enabled (supports cookies and authorization headers)
+- **Max Age**: 3600 seconds (preflight requests cached for 1 hour)
+
+## Endpoints
+
+All endpoints have CORS enabled with the following configuration:
+
+### Regular API Endpoints
+- `/api/*`
+- `/transcribe`
+- `/translate`
+- `/translate/stream`
+- `/speak`
+- `/get_audio/*`
+- `/check_tts_server`
+- `/update_tts_config`
+- `/health/*`
+
+### Admin Endpoints (More Restrictive)
+- `/admin/*` - Uses `ADMIN_CORS_ORIGINS` instead of general `CORS_ORIGINS`
+
+## Security Best Practices
+
+1. **Never use `*` in production** - Always specify exact allowed origins
+2. **Use HTTPS** - Always use HTTPS URLs in production CORS origins
+3. **Separate admin origins** - Keep admin endpoints on a separate, more restrictive origin list
+4. **Review regularly** - Periodically review and update allowed origins
+
+## Example Configurations
+
+### Local Development
+```bash
+export CORS_ORIGINS="*"
+export ADMIN_CORS_ORIGINS="http://localhost:*"
+```
+
+### Staging Environment
+```bash
+export CORS_ORIGINS="https://staging.talk2me.com,https://staging-app.talk2me.com"
+export ADMIN_CORS_ORIGINS="https://staging-admin.talk2me.com"
+```
+
+### Production Environment
+```bash
+export CORS_ORIGINS="https://talk2me.com,https://app.talk2me.com"
+export ADMIN_CORS_ORIGINS="https://admin.talk2me.com"
+```
+
+### Mobile App Integration
+```bash
+# Include mobile app schemes if needed
+export CORS_ORIGINS="https://talk2me.com,https://app.talk2me.com,capacitor://localhost,ionic://localhost"
+```
+
+## Testing CORS Configuration
+
+You can test CORS configuration using curl:
+
+```bash
+# Test preflight request
+curl -X OPTIONS https://your-api.com/api/transcribe \
+  -H "Origin: https://allowed-origin.com" \
+  -H "Access-Control-Request-Method: POST" \
+  -H "Access-Control-Request-Headers: Content-Type" \
+  -v
+
+# Test actual request
+curl -X POST https://your-api.com/api/transcribe \
+  -H "Origin: https://allowed-origin.com" \
+  -H "Content-Type: application/json" \
+  -d '{"test": "data"}' \
+  -v
+```
+
+## Troubleshooting
+
+### CORS Errors in Browser Console
+
+If you see CORS errors:
+
+1. Check that the origin is included in `CORS_ORIGINS`
+2. Ensure the URL protocol matches (http vs https)
+3. Check for trailing slashes in origins
+4. Verify environment variables are set correctly
+
+### Common Issues
+
+1. **"No 'Access-Control-Allow-Origin' header"**
+   - Origin not in allowed list
+   - Check `CORS_ORIGINS` environment variable
+
+2. **"CORS policy: The request client is not a secure context"**
+   - Using HTTP instead of HTTPS
+   - Update to use HTTPS in production
+
+3. **"CORS policy: Credentials flag is true, but Access-Control-Allow-Credentials is not 'true'"**
+   - This should not occur with current configuration
+   - Check that `supports_credentials` is True in CORS config
+
+## Additional Resources
+
+- [MDN CORS Documentation](https://developer.mozilla.org/en-US/docs/Web/HTTP/CORS)
+- [Flask-CORS Documentation](https://flask-cors.readthedocs.io/)
--- a/46
+++ b/46
@@ -0,0 +1,46 @@
+# Production Dockerfile for Talk2Me
+FROM python:3.10-slim
+
+# Install system dependencies
+RUN apt-get update && apt-get install -y \
+    build-essential \
+    curl \
+    ffmpeg \
+    git \
+    && rm -rf /var/lib/apt/lists/*
+
+# Create non-root user
+RUN useradd -m -u 1000 talk2me
+
+# Set working directory
+WORKDIR /app
+
+# Copy requirements first for better caching
+COPY requirements.txt requirements-prod.txt ./
+RUN pip install --no-cache-dir -r requirements-prod.txt
+
+# Copy application code
+COPY --chown=talk2me:talk2me . .
+
+# Create necessary directories
+RUN mkdir -p logs /tmp/talk2me_uploads && \
+    chown -R talk2me:talk2me logs /tmp/talk2me_uploads
+
+# Switch to non-root user
+USER talk2me
+
+# Set environment variables
+ENV FLASK_ENV=production \
+    PYTHONUNBUFFERED=1 \
+    UPLOAD_FOLDER=/tmp/talk2me_uploads \
+    LOGS_DIR=/app/logs
+
+# Health check
+HEALTHCHECK --interval=30s --timeout=10s --start-period=40s --retries=3 \
+    CMD curl -f http://localhost:5005/health || exit 1
+
+# Expose port
+EXPOSE 5005
+
+# Run with gunicorn
+CMD ["gunicorn", "--config", "gunicorn_config.py", "wsgi:application"]
--- a/ERROR_LOGGING.md
+++ b/ERROR_LOGGING.md
@@ -0,0 +1,460 @@
+# Error Logging Documentation
+
+This document describes the comprehensive error logging system implemented in Talk2Me for debugging production issues.
+
+## Overview
+
+Talk2Me implements a structured logging system that provides:
+- JSON-formatted structured logs for easy parsing
+- Multiple log streams (app, errors, access, security, performance)
+- Automatic log rotation to prevent disk space issues
+- Request tracing with unique IDs
+- Performance metrics collection
+- Security event tracking
+- Error deduplication and frequency tracking
+
+## Log Types
+
+### 1. Application Logs (`logs/talk2me.log`)
+General application logs including info, warnings, and debug messages.
+
+```json
+{
+  "timestamp": "2024-01-15T10:30:45.123Z",
+  "level": "INFO",
+  "logger": "talk2me",
+  "message": "Whisper model loaded successfully",
+  "app": "talk2me",
+  "environment": "production",
+  "hostname": "server-1",
+  "thread": "MainThread",
+  "process": 12345
+}
+```
+
+### 2. Error Logs (`logs/errors.log`)
+Dedicated error logging with full exception details and stack traces.
+
+```json
+{
+  "timestamp": "2024-01-15T10:31:00.456Z",
+  "level": "ERROR",
+  "logger": "talk2me.errors",
+  "message": "Error in transcribe: File too large",
+  "exception": {
+    "type": "ValueError",
+    "message": "Audio file exceeds maximum size",
+    "traceback": ["...full stack trace..."]
+  },
+  "request_id": "1234567890-abcdef",
+  "endpoint": "transcribe",
+  "method": "POST",
+  "path": "/transcribe",
+  "ip": "192.168.1.100"
+}
+```
+
+### 3. Access Logs (`logs/access.log`)
+HTTP request/response logging for traffic analysis.
+
+```json
+{
+  "timestamp": "2024-01-15T10:32:00.789Z",
+  "level": "INFO",
+  "message": "request_complete",
+  "request_id": "1234567890-abcdef",
+  "method": "POST",
+  "path": "/transcribe",
+  "status": 200,
+  "duration_ms": 1250,
+  "content_length": 4096,
+  "ip": "192.168.1.100",
+  "user_agent": "Mozilla/5.0..."
+}
+```
+
+### 4. Security Logs (`logs/security.log`)
+Security-related events and suspicious activities.
+
+```json
+{
+  "timestamp": "2024-01-15T10:33:00.123Z",
+  "level": "WARNING",
+  "message": "Security event: rate_limit_exceeded",
+  "event": "rate_limit_exceeded",
+  "severity": "warning",
+  "ip": "192.168.1.100",
+  "endpoint": "/transcribe",
+  "attempts": 15,
+  "blocked": true
+}
+```
+
+### 5. Performance Logs (`logs/performance.log`)
+Performance metrics and slow request tracking.
+
+```json
+{
+  "timestamp": "2024-01-15T10:34:00.456Z",
+  "level": "INFO",
+  "message": "Performance metric: transcribe_audio",
+  "metric": "transcribe_audio",
+  "duration_ms": 2500,
+  "function": "transcribe",
+  "module": "app",
+  "request_id": "1234567890-abcdef"
+}
+```
+
+## Configuration
+
+### Environment Variables
+
+```bash
+# Log level (DEBUG, INFO, WARNING, ERROR, CRITICAL)
+export LOG_LEVEL=INFO
+
+# Log file paths
+export LOG_FILE=logs/talk2me.log
+export ERROR_LOG_FILE=logs/errors.log
+
+# Log rotation settings
+export LOG_MAX_BYTES=52428800      # 50MB
+export LOG_BACKUP_COUNT=10         # Keep 10 backup files
+
+# Environment
+export FLASK_ENV=production
+```
+
+### Flask Configuration
+
+```python
+app.config.update({
+    'LOG_LEVEL': 'INFO',
+    'LOG_FILE': 'logs/talk2me.log',
+    'ERROR_LOG_FILE': 'logs/errors.log',
+    'LOG_MAX_BYTES': 50 * 1024 * 1024,
+    'LOG_BACKUP_COUNT': 10
+})
+```
+
+## Admin API Endpoints
+
+### GET /admin/logs/errors
+View recent error logs and error frequency statistics.
+
+```bash
+curl -H "X-Admin-Token: your-token" http://localhost:5005/admin/logs/errors
+```
+
+Response:
+```json
+{
+  "error_summary": {
+    "abc123def456": {
+      "count_last_hour": 5,
+      "last_seen": 1705320000
+    }
+  },
+  "recent_errors": [...],
+  "total_errors_logged": 150
+}
+```
+
+### GET /admin/logs/performance
+View performance metrics and slow requests.
+
+```bash
+curl -H "X-Admin-Token: your-token" http://localhost:5005/admin/logs/performance
+```
+
+Response:
+```json
+{
+  "performance_metrics": {
+    "transcribe_audio": {
+      "avg_ms": 850.5,
+      "max_ms": 3200,
+      "min_ms": 125,
+      "count": 1024
+    }
+  },
+  "slow_requests": [
+    {
+      "metric": "transcribe_audio",
+      "duration_ms": 3200,
+      "timestamp": "2024-01-15T10:35:00Z"
+    }
+  ]
+}
+```
+
+### GET /admin/logs/security
+View security events and suspicious activities.
+
+```bash
+curl -H "X-Admin-Token: your-token" http://localhost:5005/admin/logs/security
+```
+
+Response:
+```json
+{
+  "security_events": [...],
+  "event_summary": {
+    "rate_limit_exceeded": 25,
+    "suspicious_error": 3,
+    "high_error_rate": 1
+  },
+  "total_events": 29
+}
+```
+
+## Usage Patterns
+
+### 1. Logging Errors with Context
+
+```python
+from error_logger import log_exception
+
+try:
+    # Some operation
+    process_audio(file)
+except Exception as e:
+    log_exception(
+        e,
+        message="Failed to process audio",
+        user_id=user.id,
+        file_size=file.size,
+        file_type=file.content_type
+    )
+```
+
+### 2. Performance Monitoring
+
+```python
+from error_logger import log_performance
+
+@log_performance('expensive_operation')
+def process_large_file(file):
+    # This will automatically log execution time
+    return processed_data
+```
+
+### 3. Security Event Logging
+
+```python
+app.error_logger.log_security(
+    'unauthorized_access',
+    severity='warning',
+    ip=request.remote_addr,
+    attempted_resource='/admin',
+    user_agent=request.headers.get('User-Agent')
+)
+```
+
+### 4. Request Context
+
+```python
+from error_logger import log_context
+
+with log_context(user_id=user.id, feature='translation'):
+    # All logs within this context will include user_id and feature
+    translate_text(text)
+```
+
+## Log Analysis
+
+### Finding Specific Errors
+
+```bash
+# Find all authentication errors
+grep '"error_type":"AuthenticationError"' logs/errors.log | jq .
+
+# Find errors from specific IP
+grep '"ip":"192.168.1.100"' logs/errors.log | jq .
+
+# Find errors in last hour
+grep "$(date -u -d '1 hour ago' +%Y-%m-%dT%H)" logs/errors.log | jq .
+```
+
+### Performance Analysis
+
+```bash
+# Find slow requests (>2000ms)
+jq 'select(.extra_fields.duration_ms > 2000)' logs/performance.log
+
+# Calculate average response time for endpoint
+jq 'select(.extra_fields.metric == "transcribe_audio") | .extra_fields.duration_ms' logs/performance.log | awk '{sum+=$1; count++} END {print sum/count}'
+```
+
+### Security Monitoring
+
+```bash
+# Count security events by type
+jq '.extra_fields.event' logs/security.log | sort | uniq -c
+
+# Find all blocked IPs
+jq 'select(.extra_fields.blocked == true) | .extra_fields.ip' logs/security.log | sort -u
+```
+
+## Log Rotation
+
+Logs are automatically rotated based on size or time:
+
+- **Application/Error logs**: Rotate at 50MB, keep 10 backups
+- **Access logs**: Daily rotation, keep 30 days
+- **Performance logs**: Hourly rotation, keep 7 days
+- **Security logs**: Rotate at 50MB, keep 10 backups
+
+Rotated logs are named with numeric suffixes:
+- `talk2me.log` (current)
+- `talk2me.log.1` (most recent backup)
+- `talk2me.log.2` (older backup)
+- etc.
+
+## Best Practices
+
+### 1. Structured Logging
+
+Always include relevant context:
+```python
+logger.info("User action completed", extra={
+    'extra_fields': {
+        'user_id': user.id,
+        'action': 'upload_audio',
+        'file_size': file.size,
+        'duration_ms': processing_time
+    }
+})
+```
+
+### 2. Error Handling
+
+Log errors at appropriate levels:
+```python
+try:
+    result = risky_operation()
+except ValidationError as e:
+    logger.warning(f"Validation failed: {e}")  # Expected errors
+except Exception as e:
+    logger.error(f"Unexpected error: {e}", exc_info=True)  # Unexpected errors
+```
+
+### 3. Performance Tracking
+
+Track key operations:
+```python
+start = time.time()
+result = expensive_operation()
+duration = (time.time() - start) * 1000
+
+app.error_logger.log_performance(
+    'expensive_operation',
+    value=duration,
+    input_size=len(data),
+    output_size=len(result)
+)
+```
+
+### 4. Security Awareness
+
+Log security-relevant events:
+```python
+if failed_attempts > 3:
+    app.error_logger.log_security(
+        'multiple_failed_attempts',
+        severity='warning',
+        ip=request.remote_addr,
+        attempts=failed_attempts
+    )
+```
+
+## Monitoring Integration
+
+### Prometheus Metrics
+
+Export log metrics for Prometheus:
+```python
+@app.route('/metrics')
+def prometheus_metrics():
+    error_summary = app.error_logger.get_error_summary()
+    # Format as Prometheus metrics
+    return format_prometheus_metrics(error_summary)
+```
+
+### ELK Stack
+
+Ship logs to Elasticsearch:
+```yaml
+filebeat.inputs:
+- type: log
+  paths:
+    - /app/logs/*.log
+  json.keys_under_root: true
+  json.add_error_key: true
+```
+
+### CloudWatch
+
+For AWS deployments:
+```python
+# Install boto3 and watchtower
+import watchtower
+cloudwatch_handler = watchtower.CloudWatchLogHandler()
+logger.addHandler(cloudwatch_handler)
+```
+
+## Troubleshooting
+
+### Common Issues
+
+#### 1. Logs Not Being Written
+
+Check permissions:
+```bash
+ls -la logs/
+# Should show write permissions for app user
+```
+
+Create logs directory:
+```bash
+mkdir -p logs
+chmod 755 logs
+```
+
+#### 2. Disk Space Issues
+
+Monitor log sizes:
+```bash
+du -sh logs/*
+```
+
+Force rotation:
+```bash
+# Manually rotate logs
+mv logs/talk2me.log logs/talk2me.log.backup
+# App will create new log file
+```
+
+#### 3. Performance Impact
+
+If logging impacts performance:
+- Increase LOG_LEVEL to WARNING or ERROR
+- Reduce backup count
+- Use asynchronous logging (future enhancement)
+
+## Security Considerations
+
+1. **Log Sanitization**: Sensitive data is automatically masked
+2. **Access Control**: Admin endpoints require authentication
+3. **Log Retention**: Old logs are automatically deleted
+4. **Encryption**: Consider encrypting logs at rest in production
+5. **Audit Trail**: All log access is itself logged
+
+## Future Enhancements
+
+1. **Centralized Logging**: Ship logs to centralized service
+2. **Real-time Alerts**: Trigger alerts on error patterns
+3. **Log Analytics**: Built-in log analysis dashboard
+4. **Correlation IDs**: Track requests across microservices
+5. **Async Logging**: Reduce performance impact
--- a/GPU_SUPPORT.md
+++ b/GPU_SUPPORT.md
@@ -0,0 +1,68 @@
+# GPU Support for Talk2Me
+
+## Current GPU Support Status
+
+### ✅ NVIDIA GPUs (Full Support)
+- **Requirements**: CUDA 11.x or 12.x
+- **Optimizations**:
+  - TensorFloat-32 (TF32) for Ampere GPUs (RTX 30xx, A100)
+  - cuDNN auto-tuning
+  - Half-precision (FP16) inference
+  - CUDA kernel pre-caching
+  - Memory pre-allocation
+
+### ⚠️ AMD GPUs (Limited Support)
+- **Requirements**: ROCm 5.x installation
+- **Status**: Falls back to CPU unless ROCm is properly configured
+- **To enable AMD GPU**:
+  ```bash
+  # Install PyTorch with ROCm support
+  pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm5.6
+  ```
+- **Limitations**:
+  - No cuDNN optimizations
+  - May have compatibility issues
+  - Performance varies by GPU model
+
+### ✅ Apple Silicon (M1/M2/M3)
+- **Requirements**: macOS 12.3+
+- **Status**: Uses Metal Performance Shaders (MPS)
+- **Optimizations**:
+  - Native Metal acceleration
+  - Unified memory architecture benefits
+  - No FP16 (not well supported on MPS yet)
+
+### 📊 Performance Comparison
+
+| GPU Type | First Transcription | Subsequent | Notes |
+|----------|-------------------|------------|-------|
+| NVIDIA RTX 3080 | ~2s | ~0.5s | Full optimizations |
+| AMD RX 6800 XT | ~3-4s | ~1-2s | With ROCm |
+| Apple M2 | ~2.5s | ~1s | MPS acceleration |
+| CPU (i7-12700K) | ~5-10s | ~5-10s | No acceleration |
+
+## Checking Your GPU Status
+
+Run the app and check the logs:
+```
+INFO: NVIDIA GPU detected - using CUDA acceleration
+INFO: GPU memory allocated: 542.00 MB
+INFO: Whisper model loaded and optimized for NVIDIA GPU
+```
+
+## Troubleshooting
+
+### AMD GPU Not Detected
+1. Install ROCm-compatible PyTorch
+2. Set environment variable: `export HSA_OVERRIDE_GFX_VERSION=10.3.0`
+3. Check with: `rocm-smi`
+
+### NVIDIA GPU Not Used
+1. Check CUDA installation: `nvidia-smi`
+2. Verify PyTorch CUDA: `python -c "import torch; print(torch.cuda.is_available())"`
+3. Install CUDA toolkit if needed
+
+### Apple Silicon Not Accelerated
+1. Update macOS to 12.3+
+2. Update PyTorch: `pip install --upgrade torch`
+3. Check MPS: `python -c "import torch; print(torch.backends.mps.is_available())"`
--- a/MEMORY_MANAGEMENT.md
+++ b/MEMORY_MANAGEMENT.md
@@ -0,0 +1,285 @@
+# Memory Management Documentation
+
+This document describes the comprehensive memory management system implemented in Talk2Me to prevent memory leaks and crashes after extended use.
+
+## Overview
+
+Talk2Me implements a dual-layer memory management system:
+1. **Backend (Python)**: Manages GPU memory, Whisper model, and temporary files
+2. **Frontend (JavaScript)**: Manages audio blobs, object URLs, and Web Audio contexts
+
+## Memory Leak Issues Addressed
+
+### Backend Memory Leaks
+
+1. **GPU Memory Fragmentation**
+   - Whisper model accumulates GPU memory over time
+   - Solution: Periodic GPU cache clearing and model reloading
+
+2. **Temporary File Accumulation**
+   - Audio files not cleaned up quickly enough under load
+   - Solution: Aggressive cleanup with tracking and periodic sweeps
+
+3. **Session Resource Leaks**
+   - Long-lived sessions accumulate resources
+   - Solution: Integration with session manager for resource limits
+
+### Frontend Memory Leaks
+
+1. **Audio Blob Leaks**
+   - MediaRecorder chunks kept in memory
+   - Solution: SafeMediaRecorder wrapper with automatic cleanup
+
+2. **Object URL Leaks**
+   - URLs created but not revoked
+   - Solution: Centralized tracking and automatic revocation
+
+3. **AudioContext Leaks**
+   - Contexts created but never closed
+   - Solution: MemoryManager tracks and closes contexts
+
+4. **MediaStream Leaks**
+   - Microphone streams not properly stopped
+   - Solution: Automatic track stopping and stream cleanup
+
+## Backend Memory Management
+
+### MemoryManager Class
+
+The `MemoryManager` monitors and manages memory usage:
+
+```python
+memory_manager = MemoryManager(app, {
+    'memory_threshold_mb': 4096,      # 4GB process memory limit
+    'gpu_memory_threshold_mb': 2048,  # 2GB GPU memory limit
+    'cleanup_interval': 30            # Check every 30 seconds
+})
+```
+
+### Features
+
+1. **Automatic Monitoring**
+   - Background thread checks memory usage
+   - Triggers cleanup when thresholds exceeded
+   - Logs statistics every 5 minutes
+
+2. **GPU Memory Management**
+   - Clears CUDA cache after each operation
+   - Reloads Whisper model if fragmentation detected
+   - Tracks reload count and timing
+
+3. **Temporary File Cleanup**
+   - Tracks all temporary files
+   - Age-based cleanup (5 minutes normal, 1 minute aggressive)
+   - Cleanup on process exit
+
+4. **Context Managers**
+   ```python
+   with AudioProcessingContext(memory_manager) as ctx:
+       # Process audio
+       ctx.add_temp_file(temp_path)
+       # Files automatically cleaned up
+   ```
+
+### Admin Endpoints
+
+- `GET /admin/memory` - View current memory statistics
+- `POST /admin/memory/cleanup` - Trigger manual cleanup
+
+## Frontend Memory Management
+
+### MemoryManager Class
+
+Centralized tracking of all browser resources:
+
+```typescript
+const memoryManager = MemoryManager.getInstance();
+
+// Register resources
+memoryManager.registerAudioContext(context);
+memoryManager.registerObjectURL(url);
+memoryManager.registerMediaStream(stream);
+```
+
+### SafeMediaRecorder
+
+Wrapper for MediaRecorder with automatic cleanup:
+
+```typescript
+const recorder = new SafeMediaRecorder();
+await recorder.start(constraints);
+// Recording...
+const blob = await recorder.stop(); // Automatically cleans up
+```
+
+### AudioBlobHandler
+
+Safe handling of audio blobs and object URLs:
+
+```typescript
+const handler = new AudioBlobHandler(blob);
+const url = handler.getObjectURL(); // Tracked automatically
+// Use URL...
+handler.cleanup(); // Revokes URL and clears references
+```
+
+## Memory Thresholds
+
+### Backend Thresholds
+
+| Resource | Default Limit | Configurable Via |
+|----------|--------------|------------------|
+| Process Memory | 4096 MB | MEMORY_THRESHOLD_MB |
+| GPU Memory | 2048 MB | GPU_MEMORY_THRESHOLD_MB |
+| Temp File Age | 300 seconds | Built-in |
+| Model Reload Interval | 300 seconds | Built-in |
+
+### Frontend Thresholds
+
+| Resource | Cleanup Trigger |
+|----------|----------------|
+| Closed AudioContexts | Every 30 seconds |
+| Stopped MediaStreams | Every 30 seconds |
+| Orphaned Object URLs | On navigation/unload |
+
+## Best Practices
+
+### Backend
+
+1. **Use Context Managers**
+   ```python
+   @with_memory_management
+   def process_audio():
+       # Automatic cleanup
+   ```
+
+2. **Register Temporary Files**
+   ```python
+   register_temp_file(path)
+   ctx.add_temp_file(path)
+   ```
+
+3. **Clear GPU Memory**
+   ```python
+   torch.cuda.empty_cache()
+   torch.cuda.synchronize()
+   ```
+
+### Frontend
+
+1. **Use Safe Wrappers**
+   ```typescript
+   // Don't use raw MediaRecorder
+   const recorder = new SafeMediaRecorder();
+   ```
+
+2. **Clean Up Handlers**
+   ```typescript
+   if (audioHandler) {
+       audioHandler.cleanup();
+   }
+   ```
+
+3. **Register All Resources**
+   ```typescript
+   const context = new AudioContext();
+   memoryManager.registerAudioContext(context);
+   ```
+
+## Monitoring
+
+### Backend Monitoring
+
+```bash
+# View memory stats
+curl -H "X-Admin-Token: token" http://localhost:5005/admin/memory
+
+# Response
+{
+  "memory": {
+    "process_mb": 850.5,
+    "system_percent": 45.2,
+    "gpu_mb": 1250.0,
+    "gpu_percent": 61.0
+  },
+  "temp_files": {
+    "count": 5,
+    "size_mb": 12.5
+  },
+  "model": {
+    "reload_count": 2,
+    "last_reload": "2024-01-15T10:30:00"
+  }
+}
+```
+
+### Frontend Monitoring
+
+```javascript
+// Get memory stats
+const stats = memoryManager.getStats();
+console.log('Active contexts:', stats.audioContexts);
+console.log('Object URLs:', stats.objectURLs);
+```
+
+## Troubleshooting
+
+### High Memory Usage
+
+1. **Check Current Usage**
+   ```bash
+   curl -H "X-Admin-Token: token" http://localhost:5005/admin/memory
+   ```
+
+2. **Trigger Manual Cleanup**
+   ```bash
+   curl -X POST -H "X-Admin-Token: token" \
+     http://localhost:5005/admin/memory/cleanup
+   ```
+
+3. **Check Logs**
+   ```bash
+   grep "Memory" logs/talk2me.log
+   grep "GPU memory" logs/talk2me.log
+   ```
+
+### Memory Leak Symptoms
+
+1. **Backend**
+   - Process memory continuously increasing
+   - GPU memory not returning to baseline
+   - Temp files accumulating in upload folder
+   - Slower transcription over time
+
+2. **Frontend**
+   - Browser tab memory increasing
+   - Page becoming unresponsive
+   - Audio playback issues
+   - Console errors about contexts
+
+### Debug Mode
+
+Enable debug logging:
+```python
+# Backend
+app.config['DEBUG_MEMORY'] = True
+
+# Frontend (in console)
+localStorage.setItem('DEBUG_MEMORY', 'true');
+```
+
+## Performance Impact
+
+Memory management adds minimal overhead:
+- Backend: ~30ms per cleanup cycle
+- Frontend: <5ms per resource registration
+- Cleanup operations are non-blocking
+- Model reloading takes ~2-3 seconds (rare)
+
+## Future Enhancements
+
+1. **Predictive Cleanup**: Clean resources based on usage patterns
+2. **Memory Pooling**: Reuse audio buffers and contexts
+3. **Distributed Memory**: Share memory stats across instances
+4. **Alert System**: Notify admins of memory issues
+5. **Auto-scaling**: Scale resources based on memory pressure
--- a/PRODUCTION_DEPLOYMENT.md
+++ b/PRODUCTION_DEPLOYMENT.md
@@ -0,0 +1,435 @@
+# Production Deployment Guide
+
+This guide covers deploying Talk2Me in a production environment using a proper WSGI server.
+
+## Overview
+
+The Flask development server is not suitable for production use. This guide covers:
+- Gunicorn as the WSGI server
+- Nginx as a reverse proxy
+- Docker for containerization
+- Systemd for process management
+- Security best practices
+
+## Quick Start with Docker
+
+### 1. Using Docker Compose
+
+```bash
+# Clone the repository
+git clone https://github.com/your-repo/talk2me.git
+cd talk2me
+
+# Create .env file with production settings
+cat > .env <<EOF
+TTS_API_KEY=your-api-key
+ADMIN_TOKEN=your-secure-admin-token
+SECRET_KEY=your-secure-secret-key
+POSTGRES_PASSWORD=your-secure-db-password
+EOF
+
+# Build and start services
+docker-compose up -d
+
+# Check status
+docker-compose ps
+docker-compose logs -f talk2me
+```
+
+### 2. Using Docker (standalone)
+
+```bash
+# Build the image
+docker build -t talk2me .
+
+# Run the container
+docker run -d \
+  --name talk2me \
+  -p 5005:5005 \
+  -e TTS_API_KEY=your-api-key \
+  -e ADMIN_TOKEN=your-secure-token \
+  -e SECRET_KEY=your-secure-key \
+  -v $(pwd)/logs:/app/logs \
+  talk2me
+```
+
+## Manual Deployment
+
+### 1. System Requirements
+
+- Ubuntu 20.04+ or similar Linux distribution
+- Python 3.8+
+- Nginx
+- Systemd
+- 4GB+ RAM recommended
+- GPU (optional, for faster transcription)
+
+### 2. Installation
+
+Run the deployment script as root:
+
+```bash
+sudo ./deploy.sh
+```
+
+Or manually:
+
+```bash
+# Install system dependencies
+sudo apt-get update
+sudo apt-get install -y python3-pip python3-venv nginx
+
+# Create application user
+sudo useradd -m -s /bin/bash talk2me
+
+# Create directories
+sudo mkdir -p /opt/talk2me /var/log/talk2me
+sudo chown talk2me:talk2me /opt/talk2me /var/log/talk2me
+
+# Copy application files
+sudo cp -r . /opt/talk2me/
+sudo chown -R talk2me:talk2me /opt/talk2me
+
+# Install Python dependencies
+sudo -u talk2me python3 -m venv /opt/talk2me/venv
+sudo -u talk2me /opt/talk2me/venv/bin/pip install -r requirements-prod.txt
+
+# Configure and start services
+sudo cp talk2me.service /etc/systemd/system/
+sudo systemctl enable talk2me
+sudo systemctl start talk2me
+```
+
+## Gunicorn Configuration
+
+The `gunicorn_config.py` file contains production-ready settings:
+
+### Worker Configuration
+
+```python
+# Number of worker processes
+workers = multiprocessing.cpu_count() * 2 + 1
+
+# Worker timeout (increased for audio processing)
+timeout = 120
+
+# Restart workers periodically to prevent memory leaks
+max_requests = 1000
+max_requests_jitter = 50
+```
+
+### Performance Tuning
+
+For different workloads:
+
+```bash
+# CPU-bound (transcription heavy)
+export GUNICORN_WORKERS=8
+export GUNICORN_THREADS=1
+
+# I/O-bound (many concurrent requests)
+export GUNICORN_WORKERS=4
+export GUNICORN_THREADS=4
+export GUNICORN_WORKER_CLASS=gthread
+
+# Async (best concurrency)
+export GUNICORN_WORKER_CLASS=gevent
+export GUNICORN_WORKER_CONNECTIONS=1000
+```
+
+## Nginx Configuration
+
+### Basic Setup
+
+The provided `nginx.conf` includes:
+- Reverse proxy to Gunicorn
+- Static file serving
+- WebSocket support
+- Security headers
+- Gzip compression
+
+### SSL/TLS Setup
+
+```nginx
+server {
+    listen 443 ssl http2;
+    server_name your-domain.com;
+    
+    ssl_certificate /etc/letsencrypt/live/your-domain.com/fullchain.pem;
+    ssl_certificate_key /etc/letsencrypt/live/your-domain.com/privkey.pem;
+    
+    # Strong SSL configuration
+    ssl_protocols TLSv1.2 TLSv1.3;
+    ssl_ciphers ECDHE-ECDSA-AES128-GCM-SHA256:ECDHE-RSA-AES128-GCM-SHA256;
+    ssl_prefer_server_ciphers off;
+    
+    # HSTS
+    add_header Strict-Transport-Security "max-age=63072000" always;
+}
+```
+
+## Environment Variables
+
+### Required
+
+```bash
+# Security
+SECRET_KEY=your-very-secure-secret-key
+ADMIN_TOKEN=your-admin-api-token
+
+# TTS Configuration
+TTS_API_KEY=your-tts-api-key
+TTS_SERVER_URL=http://your-tts-server:5050/v1/audio/speech
+
+# Flask
+FLASK_ENV=production
+```
+
+### Optional
+
+```bash
+# Performance
+GUNICORN_WORKERS=4
+GUNICORN_THREADS=2
+MEMORY_THRESHOLD_MB=4096
+GPU_MEMORY_THRESHOLD_MB=2048
+
+# Database (for session storage)
+DATABASE_URL=postgresql://user:pass@localhost/talk2me
+REDIS_URL=redis://localhost:6379/0
+
+# Monitoring
+SENTRY_DSN=your-sentry-dsn
+```
+
+## Monitoring
+
+### Health Checks
+
+```bash
+# Basic health check
+curl http://localhost:5005/health
+
+# Detailed health check
+curl http://localhost:5005/health/detailed
+
+# Memory usage
+curl -H "X-Admin-Token: your-token" http://localhost:5005/admin/memory
+```
+
+### Logs
+
+```bash
+# Application logs
+tail -f /var/log/talk2me/talk2me.log
+
+# Error logs
+tail -f /var/log/talk2me/errors.log
+
+# Gunicorn logs
+journalctl -u talk2me -f
+
+# Nginx logs
+tail -f /var/log/nginx/access.log
+tail -f /var/log/nginx/error.log
+```
+
+### Metrics
+
+With Prometheus client installed:
+
+```bash
+# Prometheus metrics endpoint
+curl http://localhost:5005/metrics
+```
+
+## Scaling
+
+### Horizontal Scaling
+
+For multiple servers:
+
+1. Use Redis for session storage
+2. Use PostgreSQL for persistent data
+3. Load balance with Nginx:
+
+```nginx
+upstream talk2me_backends {
+    least_conn;
+    server server1:5005 weight=1;
+    server server2:5005 weight=1;
+    server server3:5005 weight=1;
+}
+```
+
+### Vertical Scaling
+
+Adjust based on load:
+
+```bash
+# High memory usage
+MEMORY_THRESHOLD_MB=8192
+GPU_MEMORY_THRESHOLD_MB=4096
+
+# More workers
+GUNICORN_WORKERS=16
+GUNICORN_THREADS=4
+
+# Larger file limits
+client_max_body_size 100M;
+```
+
+## Security
+
+### Firewall
+
+```bash
+# Allow only necessary ports
+sudo ufw allow 80/tcp
+sudo ufw allow 443/tcp
+sudo ufw allow 22/tcp
+sudo ufw enable
+```
+
+### File Permissions
+
+```bash
+# Secure file permissions
+sudo chmod 750 /opt/talk2me
+sudo chmod 640 /opt/talk2me/.env
+sudo chmod 755 /opt/talk2me/static
+```
+
+### AppArmor/SELinux
+
+Create security profiles to restrict application access.
+
+## Backup
+
+### Database Backup
+
+```bash
+# PostgreSQL
+pg_dump talk2me > backup.sql
+
+# Redis
+redis-cli BGSAVE
+```
+
+### Application Backup
+
+```bash
+# Backup application and logs
+tar -czf talk2me-backup.tar.gz \
+  /opt/talk2me \
+  /var/log/talk2me \
+  /etc/systemd/system/talk2me.service \
+  /etc/nginx/sites-available/talk2me
+```
+
+## Troubleshooting
+
+### Service Won't Start
+
+```bash
+# Check service status
+systemctl status talk2me
+
+# Check logs
+journalctl -u talk2me -n 100
+
+# Test configuration
+sudo -u talk2me /opt/talk2me/venv/bin/gunicorn --check-config wsgi:application
+```
+
+### High Memory Usage
+
+```bash
+# Trigger cleanup
+curl -X POST -H "X-Admin-Token: token" http://localhost:5005/admin/memory/cleanup
+
+# Restart workers
+systemctl reload talk2me
+```
+
+### Slow Response Times
+
+1. Check worker count
+2. Enable async workers
+3. Check GPU availability
+4. Review nginx buffering settings
+
+## Performance Optimization
+
+### 1. Enable GPU
+
+Ensure CUDA/ROCm is properly installed:
+
+```bash
+# Check GPU
+nvidia-smi  # or rocm-smi
+
+# Set in environment
+export CUDA_VISIBLE_DEVICES=0
+```
+
+### 2. Optimize Workers
+
+```python
+# For CPU-heavy workloads
+workers = cpu_count()
+threads = 1
+
+# For I/O-heavy workloads
+workers = cpu_count() * 2
+threads = 4
+```
+
+### 3. Enable Caching
+
+Use Redis for caching translations:
+
+```python
+CACHE_TYPE = 'redis'
+CACHE_REDIS_URL = 'redis://localhost:6379/0'
+```
+
+## Maintenance
+
+### Regular Tasks
+
+1. **Log Rotation**: Configured automatically
+2. **Database Cleanup**: Run weekly
+3. **Model Updates**: Check for Whisper updates
+4. **Security Updates**: Keep dependencies updated
+
+### Update Procedure
+
+```bash
+# Backup first
+./backup.sh
+
+# Update code
+git pull
+
+# Update dependencies
+sudo -u talk2me /opt/talk2me/venv/bin/pip install -r requirements-prod.txt
+
+# Restart service
+sudo systemctl restart talk2me
+```
+
+## Rollback
+
+If deployment fails:
+
+```bash
+# Stop service
+sudo systemctl stop talk2me
+
+# Restore backup
+tar -xzf talk2me-backup.tar.gz -C /
+
+# Restart service
+sudo systemctl start talk2me
+```
--- a/RATE_LIMITING.md
+++ b/RATE_LIMITING.md
@@ -0,0 +1,235 @@
+# Rate Limiting Documentation
+
+This document describes the rate limiting implementation in Talk2Me to protect against DoS attacks and resource exhaustion.
+
+## Overview
+
+Talk2Me implements a comprehensive rate limiting system with:
+- Token bucket algorithm with sliding window
+- Per-endpoint configurable limits
+- IP-based blocking (temporary and permanent)
+- Global request limits
+- Concurrent request throttling
+- Request size validation
+
+## Rate Limits by Endpoint
+
+### Transcription (`/transcribe`)
+- **Per Minute**: 10 requests
+- **Per Hour**: 100 requests
+- **Burst Size**: 3 requests
+- **Max Request Size**: 10MB
+- **Token Refresh**: 1 token per 6 seconds
+
+### Translation (`/translate`)
+- **Per Minute**: 20 requests
+- **Per Hour**: 300 requests
+- **Burst Size**: 5 requests
+- **Max Request Size**: 100KB
+- **Token Refresh**: 1 token per 3 seconds
+
+### Streaming Translation (`/translate/stream`)
+- **Per Minute**: 10 requests
+- **Per Hour**: 150 requests
+- **Burst Size**: 3 requests
+- **Max Request Size**: 100KB
+- **Token Refresh**: 1 token per 6 seconds
+
+### Text-to-Speech (`/speak`)
+- **Per Minute**: 15 requests
+- **Per Hour**: 200 requests
+- **Burst Size**: 3 requests
+- **Max Request Size**: 50KB
+- **Token Refresh**: 1 token per 4 seconds
+
+### API Endpoints
+- Push notifications, error logging: Various limits (see code)
+
+## Global Limits
+
+- **Total Requests Per Minute**: 1,000 (across all endpoints)
+- **Total Requests Per Hour**: 10,000
+- **Concurrent Requests**: 50 maximum
+
+## Rate Limiting Headers
+
+Successful responses include:
+```
+X-RateLimit-Limit: 20
+X-RateLimit-Remaining: 15
+X-RateLimit-Reset: 1234567890
+```
+
+Rate limited responses (429) include:
+```
+X-RateLimit-Limit: 20
+X-RateLimit-Remaining: 0
+X-RateLimit-Reset: 1234567890
+Retry-After: 60
+```
+
+## Client Identification
+
+Clients are identified by:
+- IP address (including X-Forwarded-For support)
+- User-Agent string
+- Combined hash for uniqueness
+
+## Automatic Blocking
+
+IPs are temporarily blocked for 1 hour if:
+- They exceed 100 requests per minute
+- They repeatedly hit rate limits
+- They exhibit suspicious patterns
+
+## Configuration
+
+### Environment Variables
+
+```bash
+# No direct environment variables for rate limiting
+# Configured in code - can be extended to use env vars
+```
+
+### Programmatic Configuration
+
+Rate limits can be adjusted in `rate_limiter.py`:
+
+```python
+self.endpoint_limits = {
+    '/transcribe': {
+        'requests_per_minute': 10,
+        'requests_per_hour': 100,
+        'burst_size': 3,
+        'token_refresh_rate': 0.167,
+        'max_request_size': 10 * 1024 * 1024  # 10MB
+    }
+}
+```
+
+## Admin Endpoints
+
+### Get Rate Limit Configuration
+```bash
+curl -H "X-Admin-Token: your-admin-token" \
+  http://localhost:5005/admin/rate-limits
+```
+
+### Get Rate Limit Statistics
+```bash
+# Global stats
+curl -H "X-Admin-Token: your-admin-token" \
+  http://localhost:5005/admin/rate-limits/stats
+
+# Client-specific stats
+curl -H "X-Admin-Token: your-admin-token" \
+  http://localhost:5005/admin/rate-limits/stats?client_id=abc123
+```
+
+### Block IP Address
+```bash
+# Temporary block (1 hour)
+curl -X POST -H "X-Admin-Token: your-admin-token" \
+  -H "Content-Type: application/json" \
+  -d '{"ip": "192.168.1.100", "duration": 3600}' \
+  http://localhost:5005/admin/block-ip
+
+# Permanent block
+curl -X POST -H "X-Admin-Token: your-admin-token" \
+  -H "Content-Type: application/json" \
+  -d '{"ip": "192.168.1.100", "permanent": true}' \
+  http://localhost:5005/admin/block-ip
+```
+
+## Algorithm Details
+
+### Token Bucket
+- Each client gets a bucket with configurable burst size
+- Tokens regenerate at a fixed rate
+- Requests consume tokens
+- Empty bucket = request denied
+
+### Sliding Window
+- Tracks requests in the last minute and hour
+- More accurate than fixed windows
+- Prevents gaming the system at window boundaries
+
+## Best Practices
+
+### For Users
+1. Implement exponential backoff when receiving 429 errors
+2. Check rate limit headers to avoid hitting limits
+3. Cache responses when possible
+4. Use bulk operations where available
+
+### For Administrators
+1. Monitor rate limit statistics regularly
+2. Adjust limits based on usage patterns
+3. Use IP blocking sparingly
+4. Set up alerts for suspicious activity
+
+## Error Responses
+
+### Rate Limited (429)
+```json
+{
+  "error": "Rate limit exceeded (per minute)",
+  "retry_after": 60
+}
+```
+
+### Request Too Large (413)
+```json
+{
+  "error": "Request too large"
+}
+```
+
+### IP Blocked (429)
+```json
+{
+  "error": "IP temporarily blocked due to excessive requests"
+}
+```
+
+## Monitoring
+
+Key metrics to monitor:
+- Rate limit hits by endpoint
+- Blocked IPs
+- Concurrent request peaks
+- Request size violations
+- Global limit approaches
+
+## Performance Impact
+
+- Minimal overhead (~1-2ms per request)
+- Memory usage scales with active clients
+- Automatic cleanup of old buckets
+- Thread-safe implementation
+
+## Security Considerations
+
+1. **DoS Protection**: Prevents resource exhaustion
+2. **Burst Control**: Limits sudden traffic spikes
+3. **Size Validation**: Prevents large payload attacks
+4. **IP Blocking**: Stops persistent attackers
+5. **Global Limits**: Protects overall system capacity
+
+## Troubleshooting
+
+### "Rate limit exceeded" errors
+- Check client request patterns
+- Verify time synchronization
+- Look for retry loops
+- Check IP blocking status
+
+### Memory usage increasing
+- Verify cleanup thread is running
+- Check for client ID explosion
+- Monitor bucket count
+
+### Legitimate users blocked
+- Review rate limit settings
+- Check for shared IP issues
+- Implement IP whitelisting if needed
--- a/README.md
+++ b/README.md
@@ -29,19 +29,34 @@ A mobile-friendly web application that translates spoken language between multip
   pip install -r requirements.txt
   ```

-2. Make sure you have Ollama installed and the Gemma 3 model loaded:
+2. Configure secrets and environment:
+   ```bash
+   # Initialize secure secrets management
+   python manage_secrets.py init
+   
+   # Set required secrets
+   python manage_secrets.py set TTS_API_KEY
+   
+   # Or use traditional .env file
+   cp .env.example .env
+   nano .env
+   ```
+   
+   **⚠️ Security Note**: Talk2Me includes encrypted secrets management. See [SECURITY.md](SECURITY.md) and [SECRETS_MANAGEMENT.md](SECRETS_MANAGEMENT.md) for details.
+
+3. Make sure you have Ollama installed and the Gemma 3 model loaded:
   ```
   ollama pull gemma3
   ```

-3. Ensure your OpenAI Edge TTS server is running on port 5050.
+4. Ensure your OpenAI Edge TTS server is running on port 5050.

-4. Run the application:
+5. Run the application:
   ```
   python app.py
   ```

-5. Open your browser and navigate to:
+6. Open your browser and navigate to:
   ```
   http://localhost:8000
   ```
@@ -64,6 +79,102 @@ A mobile-friendly web application that translates spoken language between multip
 - Ollama provides access to the Gemma 3 model for translation
 - OpenAI Edge TTS delivers natural-sounding speech output

+## CORS Configuration
+
+The application supports Cross-Origin Resource Sharing (CORS) for secure cross-origin usage. See [CORS_CONFIG.md](CORS_CONFIG.md) for detailed configuration instructions.
+
+Quick setup:
+```bash
+# Development (allow all origins)
+export CORS_ORIGINS="*"
+
+# Production (restrict to specific domains)
+export CORS_ORIGINS="https://yourdomain.com,https://app.yourdomain.com"
+export ADMIN_CORS_ORIGINS="https://admin.yourdomain.com"
+```
+
+## Connection Retry & Offline Support
+
+Talk2Me handles network interruptions gracefully with automatic retry logic:
+- Automatic request queuing during connection loss
+- Exponential backoff retry with configurable parameters
+- Visual connection status indicators
+- Priority-based request processing
+
+See [CONNECTION_RETRY.md](CONNECTION_RETRY.md) for detailed documentation.
+
+## Rate Limiting
+
+Comprehensive rate limiting protects against DoS attacks and resource exhaustion:
+- Token bucket algorithm with sliding window
+- Per-endpoint configurable limits
+- Automatic IP blocking for abusive clients
+- Global request limits and concurrent request throttling
+- Request size validation
+
+See [RATE_LIMITING.md](RATE_LIMITING.md) for detailed documentation.
+
+## Session Management
+
+Advanced session management prevents resource leaks from abandoned sessions:
+- Automatic tracking of all session resources (audio files, temp files)
+- Per-session resource limits (100 files, 100MB)
+- Automatic cleanup of idle sessions (15 minutes) and expired sessions (1 hour)
+- Real-time monitoring and metrics
+- Manual cleanup capabilities for administrators
+
+See [SESSION_MANAGEMENT.md](SESSION_MANAGEMENT.md) for detailed documentation.
+
+## Request Size Limits
+
+Comprehensive request size limiting prevents memory exhaustion:
+- Global limit: 50MB for any request
+- Audio files: 25MB maximum
+- JSON payloads: 1MB maximum
+- File type detection and enforcement
+- Dynamic configuration via admin API
+
+See [REQUEST_SIZE_LIMITS.md](REQUEST_SIZE_LIMITS.md) for detailed documentation.
+
+## Error Logging
+
+Production-ready error logging system for debugging and monitoring:
+- Structured JSON logs for easy parsing
+- Multiple log streams (app, errors, access, security, performance)
+- Automatic log rotation to prevent disk exhaustion
+- Request tracing with unique IDs
+- Performance metrics and slow request tracking
+- Admin endpoints for log analysis
+
+See [ERROR_LOGGING.md](ERROR_LOGGING.md) for detailed documentation.
+
+## Memory Management
+
+Comprehensive memory leak prevention for extended use:
+- GPU memory management with automatic cleanup
+- Whisper model reloading to prevent fragmentation
+- Frontend resource tracking (audio blobs, contexts, streams)
+- Automatic cleanup of temporary files
+- Memory monitoring and manual cleanup endpoints
+
+See [MEMORY_MANAGEMENT.md](MEMORY_MANAGEMENT.md) for detailed documentation.
+
+## Production Deployment
+
+For production use, deploy with a proper WSGI server:
+- Gunicorn with optimized worker configuration
+- Nginx reverse proxy with caching
+- Docker/Docker Compose support
+- Systemd service management
+- Comprehensive security hardening
+
+Quick start:
+```bash
+docker-compose up -d
+```
+
+See [PRODUCTION_DEPLOYMENT.md](PRODUCTION_DEPLOYMENT.md) for detailed deployment instructions.
+
 ## Mobile Support

 The interface is fully responsive and designed to work well on mobile devices.
--- a/README_TYPESCRIPT.md
+++ b/README_TYPESCRIPT.md
@@ -0,0 +1,54 @@
+# TypeScript Setup for Talk2Me
+
+This project now includes TypeScript support for better type safety and developer experience.
+
+## Installation
+
+1. Install Node.js dependencies:
+```bash
+npm install
+```
+
+2. Build TypeScript files:
+```bash
+npm run build
+```
+
+## Development
+
+For development with automatic recompilation:
+```bash
+npm run watch
+# or
+npm run dev
+```
+
+## Project Structure
+
+- `/static/js/src/` - TypeScript source files
+  - `app.ts` - Main application logic
+  - `types.ts` - Type definitions
+- `/static/js/dist/` - Compiled JavaScript files (git-ignored)
+- `tsconfig.json` - TypeScript configuration
+- `package.json` - Node.js dependencies and scripts
+
+## Available Scripts
+
+- `npm run build` - Compile TypeScript to JavaScript
+- `npm run watch` - Watch for changes and recompile
+- `npm run dev` - Same as watch
+- `npm run clean` - Remove compiled files
+- `npm run type-check` - Type-check without compiling
+
+## Type Safety Benefits
+
+The TypeScript implementation provides:
+- Compile-time type checking
+- Better IDE support with autocomplete
+- Explicit interface definitions for API responses
+- Safer refactoring
+- Self-documenting code
+
+## Next Steps
+
+After building, the compiled JavaScript will be in `/static/js/dist/app.js` and will be automatically loaded by the HTML template.
--- a/REQUEST_SIZE_LIMITS.md
+++ b/REQUEST_SIZE_LIMITS.md
@@ -0,0 +1,332 @@
+# Request Size Limits Documentation
+
+This document describes the request size limiting system implemented in Talk2Me to prevent memory exhaustion from large uploads.
+
+## Overview
+
+Talk2Me implements comprehensive request size limiting to protect against:
+- Memory exhaustion from large file uploads
+- Denial of Service (DoS) attacks using oversized requests
+- Buffer overflow attempts
+- Resource starvation from unbounded requests
+
+## Default Limits
+
+### Global Limits
+- **Maximum Content Length**: 50MB - Absolute maximum for any request
+- **Maximum Audio File Size**: 25MB - For audio uploads (transcription)
+- **Maximum JSON Payload**: 1MB - For API requests
+- **Maximum Image Size**: 10MB - For future image processing features
+- **Maximum Chunk Size**: 1MB - For streaming uploads
+
+## Features
+
+### 1. Multi-Layer Protection
+
+The system implements multiple layers of size checking:
+- Flask's built-in `MAX_CONTENT_LENGTH` configuration
+- Pre-request validation before data is loaded into memory
+- File-type specific limits
+- Endpoint-specific limits
+- Streaming request monitoring
+
+### 2. File Type Detection
+
+Automatic detection and enforcement based on file extensions:
+- Audio files: `.wav`, `.mp3`, `.ogg`, `.webm`, `.m4a`, `.flac`, `.aac`
+- Image files: `.jpg`, `.jpeg`, `.png`, `.gif`, `.webp`, `.bmp`
+- JSON payloads: Content-Type header detection
+
+### 3. Graceful Error Handling
+
+When limits are exceeded:
+- Returns 413 (Request Entity Too Large) status code
+- Provides clear error messages with size information
+- Includes both actual and allowed sizes
+- Human-readable size formatting
+
+## Configuration
+
+### Environment Variables
+
+```bash
+# Set limits via environment variables (in bytes)
+export MAX_CONTENT_LENGTH=52428800      # 50MB
+export MAX_AUDIO_SIZE=26214400          # 25MB
+export MAX_JSON_SIZE=1048576            # 1MB
+export MAX_IMAGE_SIZE=10485760          # 10MB
+```
+
+### Flask Configuration
+
+```python
+# In config.py or app.py
+app.config.update({
+    'MAX_CONTENT_LENGTH': 50 * 1024 * 1024,    # 50MB
+    'MAX_AUDIO_SIZE': 25 * 1024 * 1024,        # 25MB
+    'MAX_JSON_SIZE': 1 * 1024 * 1024,          # 1MB
+    'MAX_IMAGE_SIZE': 10 * 1024 * 1024         # 10MB
+})
+```
+
+### Dynamic Configuration
+
+Size limits can be updated at runtime via admin API.
+
+## API Endpoints
+
+### GET /admin/size-limits
+Get current size limits.
+
+```bash
+curl -H "X-Admin-Token: your-token" http://localhost:5005/admin/size-limits
+```
+
+Response:
+```json
+{
+  "limits": {
+    "max_content_length": 52428800,
+    "max_audio_size": 26214400,
+    "max_json_size": 1048576,
+    "max_image_size": 10485760
+  },
+  "limits_human": {
+    "max_content_length": "50.0MB",
+    "max_audio_size": "25.0MB",
+    "max_json_size": "1.0MB",
+    "max_image_size": "10.0MB"
+  }
+}
+```
+
+### POST /admin/size-limits
+Update size limits dynamically.
+
+```bash
+curl -X POST -H "X-Admin-Token: your-token" \
+  -H "Content-Type: application/json" \
+  -d '{"max_audio_size": "30MB", "max_json_size": 2097152}' \
+  http://localhost:5005/admin/size-limits
+```
+
+Response:
+```json
+{
+  "success": true,
+  "old_limits": {...},
+  "new_limits": {...},
+  "new_limits_human": {
+    "max_audio_size": "30.0MB",
+    "max_json_size": "2.0MB"
+  }
+}
+```
+
+## Usage Examples
+
+### 1. Endpoint-Specific Limits
+
+```python
+@app.route('/upload')
+@limit_request_size(max_size=10*1024*1024)  # 10MB limit
+def upload():
+    # Handle upload
+    pass
+
+@app.route('/upload-audio')
+@limit_request_size(max_audio_size=30*1024*1024)  # 30MB for audio
+def upload_audio():
+    # Handle audio upload
+    pass
+```
+
+### 2. Client-Side Validation
+
+```javascript
+// Check file size before upload
+const MAX_AUDIO_SIZE = 25 * 1024 * 1024; // 25MB
+
+function validateAudioFile(file) {
+    if (file.size > MAX_AUDIO_SIZE) {
+        alert(`Audio file too large. Maximum size is ${MAX_AUDIO_SIZE / 1024 / 1024}MB`);
+        return false;
+    }
+    return true;
+}
+```
+
+### 3. Chunked Uploads (Future Enhancement)
+
+```javascript
+// For files larger than limits, use chunked upload
+async function uploadLargeFile(file, chunkSize = 1024 * 1024) {
+    const chunks = Math.ceil(file.size / chunkSize);
+    
+    for (let i = 0; i < chunks; i++) {
+        const start = i * chunkSize;
+        const end = Math.min(start + chunkSize, file.size);
+        const chunk = file.slice(start, end);
+        
+        await uploadChunk(chunk, i, chunks);
+    }
+}
+```
+
+## Error Responses
+
+### 413 Request Entity Too Large
+
+When a request exceeds size limits:
+
+```json
+{
+  "error": "Request too large",
+  "max_size": 52428800,
+  "your_size": 75000000,
+  "max_size_mb": 50.0
+}
+```
+
+### File-Specific Errors
+
+For audio files:
+```json
+{
+  "error": "Audio file too large",
+  "max_size": 26214400,
+  "your_size": 35000000,
+  "max_size_mb": 25.0
+}
+```
+
+For JSON payloads:
+```json
+{
+  "error": "JSON payload too large",
+  "max_size": 1048576,
+  "your_size": 2000000,
+  "max_size_kb": 1024.0
+}
+```
+
+## Best Practices
+
+### 1. Client-Side Validation
+
+Always validate file sizes on the client side:
+```javascript
+// Add to static/js/app.js
+const SIZE_LIMITS = {
+    audio: 25 * 1024 * 1024,  // 25MB
+    json: 1 * 1024 * 1024,    // 1MB
+};
+
+function checkFileSize(file, type) {
+    const limit = SIZE_LIMITS[type];
+    if (file.size > limit) {
+        showError(`File too large. Maximum size: ${formatSize(limit)}`);
+        return false;
+    }
+    return true;
+}
+```
+
+### 2. Progressive Enhancement
+
+For better UX with large files:
+- Show upload progress
+- Implement resumable uploads
+- Compress audio client-side when possible
+- Use appropriate audio formats (WebM/Opus for smaller sizes)
+
+### 3. Server Configuration
+
+Configure your web server (Nginx/Apache) to also enforce limits:
+
+**Nginx:**
+```nginx
+client_max_body_size 50M;
+client_body_buffer_size 1M;
+```
+
+**Apache:**
+```apache
+LimitRequestBody 52428800
+```
+
+### 4. Monitoring
+
+Monitor size limit violations:
+- Track 413 errors in logs
+- Alert on repeated violations from same IP
+- Adjust limits based on usage patterns
+
+## Security Considerations
+
+1. **Memory Protection**: Pre-flight size checks prevent loading large files into memory
+2. **DoS Prevention**: Limits prevent attackers from exhausting server resources
+3. **Bandwidth Protection**: Prevents bandwidth exhaustion from large uploads
+4. **Storage Protection**: Works with session management to limit total storage per user
+
+## Integration with Other Systems
+
+### Rate Limiting
+Size limits work in conjunction with rate limiting:
+- Large requests count more against rate limits
+- Repeated size violations can trigger IP blocking
+
+### Session Management
+Size limits are enforced per session:
+- Total storage per session is limited
+- Large files count against session resource limits
+
+### Monitoring
+Size limit violations are tracked in:
+- Application logs
+- Health check endpoints
+- Admin monitoring dashboards
+
+## Troubleshooting
+
+### Common Issues
+
+#### 1. Legitimate Large Files Rejected
+
+If users need to upload larger files:
+```bash
+# Increase limit for audio files to 50MB
+curl -X POST -H "X-Admin-Token: token" \
+  -d '{"max_audio_size": "50MB"}' \
+  http://localhost:5005/admin/size-limits
+```
+
+#### 2. Chunked Transfer Encoding
+
+For requests without Content-Length header:
+- The system monitors the stream
+- Terminates connection if size exceeded
+- May require special handling for some clients
+
+#### 3. Load Balancer Limits
+
+Ensure your load balancer also enforces appropriate limits:
+- AWS ALB: Configure request size limits
+- Cloudflare: Set upload size limits
+- Nginx: Configure client_max_body_size
+
+## Performance Impact
+
+The size limiting system has minimal performance impact:
+- Pre-flight checks are O(1) operations
+- No buffering of large requests
+- Early termination of oversized requests
+- Efficient memory usage
+
+## Future Enhancements
+
+1. **Chunked Upload Support**: Native support for resumable uploads
+2. **Compression Detection**: Automatic handling of compressed uploads
+3. **Dynamic Limits**: Per-user or per-tier size limits
+4. **Bandwidth Throttling**: Rate limit large uploads
+5. **Storage Quotas**: Long-term storage limits per user
--- a/SECRETS_MANAGEMENT.md
+++ b/SECRETS_MANAGEMENT.md
@@ -0,0 +1,411 @@
+# Secrets Management Documentation
+
+This document describes the secure secrets management system implemented in Talk2Me.
+
+## Overview
+
+Talk2Me uses a comprehensive secrets management system that provides:
+- Encrypted storage of sensitive configuration
+- Secret rotation capabilities
+- Audit logging
+- Integrity verification
+- CLI management tools
+- Environment variable integration
+
+## Architecture
+
+### Components
+
+1. **SecretsManager** (`secrets_manager.py`)
+   - Handles encryption/decryption using Fernet (AES-128)
+   - Manages secret lifecycle (create, read, update, delete)
+   - Provides audit logging
+   - Supports secret rotation
+
+2. **Configuration System** (`config.py`)
+   - Integrates secrets with Flask configuration
+   - Environment-specific configurations
+   - Validation and sanitization
+
+3. **CLI Tool** (`manage_secrets.py`)
+   - Command-line interface for secret management
+   - Interactive and scriptable
+
+### Security Features
+
+- **Encryption**: AES-128 encryption using cryptography.fernet
+- **Key Derivation**: PBKDF2 with SHA256 (100,000 iterations)
+- **Master Key**: Stored separately with restricted permissions
+- **Audit Trail**: All access and modifications logged
+- **Integrity Checks**: Verify secrets haven't been tampered with
+
+## Quick Start
+
+### 1. Initialize Secrets
+
+```bash
+python manage_secrets.py init
+```
+
+This will:
+- Generate a master encryption key
+- Create initial secrets (Flask secret key, admin token)
+- Prompt for required secrets (TTS API key)
+
+### 2. Set a Secret
+
+```bash
+# Interactive (hidden input)
+python manage_secrets.py set TTS_API_KEY
+
+# Direct (be careful with shell history)
+python manage_secrets.py set TTS_API_KEY --value "your-api-key"
+
+# With metadata
+python manage_secrets.py set API_KEY --value "key" --metadata '{"service": "external-api"}'
+```
+
+### 3. List Secrets
+
+```bash
+python manage_secrets.py list
+```
+
+Output:
+```
+Key                            Created             Last Rotated         Has Value
+-------------------------------------------------------------------------------------
+FLASK_SECRET_KEY              2024-01-15          2024-01-20          ✓
+TTS_API_KEY                   2024-01-15          Never               ✓
+ADMIN_TOKEN                   2024-01-15          2024-01-18          ✓
+```
+
+### 4. Rotate Secrets
+
+```bash
+# Rotate a specific secret
+python manage_secrets.py rotate ADMIN_TOKEN
+
+# Check which secrets need rotation
+python manage_secrets.py check-rotation
+
+# Schedule automatic rotation
+python manage_secrets.py schedule-rotation API_KEY 30  # Every 30 days
+```
+
+## Configuration
+
+### Environment Variables
+
+The secrets manager checks these locations in order:
+1. Encrypted secrets storage (`.secrets.json`)
+2. `SECRET_<KEY>` environment variable
+3. `<KEY>` environment variable
+4. Default value
+
+### Master Key
+
+The master encryption key is loaded from:
+1. `MASTER_KEY` environment variable
+2. `.master_key` file (default)
+3. Auto-generated if neither exists
+
+**Important**: Protect the master key!
+- Set file permissions: `chmod 600 .master_key`
+- Back it up securely
+- Never commit to version control
+
+### Flask Integration
+
+Secrets are automatically loaded into Flask configuration:
+
+```python
+# In app.py
+from config import init_app as init_config
+from secrets_manager import init_app as init_secrets
+
+app = Flask(__name__)
+init_config(app)
+init_secrets(app)
+
+# Access secrets
+api_key = app.config['TTS_API_KEY']
+```
+
+## CLI Commands
+
+### Basic Operations
+
+```bash
+# List all secrets
+python manage_secrets.py list
+
+# Get a secret value (requires confirmation)
+python manage_secrets.py get TTS_API_KEY
+
+# Set a secret
+python manage_secrets.py set DATABASE_URL
+
+# Delete a secret
+python manage_secrets.py delete OLD_API_KEY
+
+# Rotate a secret
+python manage_secrets.py rotate ADMIN_TOKEN
+```
+
+### Advanced Operations
+
+```bash
+# Verify integrity of all secrets
+python manage_secrets.py verify
+
+# Migrate from environment variables
+python manage_secrets.py migrate
+
+# View audit log
+python manage_secrets.py audit
+python manage_secrets.py audit TTS_API_KEY --limit 50
+
+# Schedule rotation
+python manage_secrets.py schedule-rotation API_KEY 90
+```
+
+## Security Best Practices
+
+### 1. File Permissions
+
+```bash
+# Secure the secrets files
+chmod 600 .secrets.json
+chmod 600 .master_key
+```
+
+### 2. Backup Strategy
+
+- Back up `.master_key` separately from `.secrets.json`
+- Store backups in different secure locations
+- Test restore procedures regularly
+
+### 3. Rotation Policy
+
+Recommended rotation intervals:
+- API Keys: 90 days
+- Admin Tokens: 30 days
+- Database Passwords: 180 days
+- Encryption Keys: 365 days
+
+### 4. Access Control
+
+- Use environment-specific secrets
+- Implement least privilege access
+- Audit secret access regularly
+
+### 5. Git Security
+
+Ensure these files are in `.gitignore`:
+```
+.secrets.json
+.master_key
+secrets.db
+*.key
+```
+
+## Deployment
+
+### Development
+
+```bash
+# Use .env file for convenience
+cp .env.example .env
+# Edit .env with development values
+
+# Initialize secrets
+python manage_secrets.py init
+```
+
+### Production
+
+```bash
+# Set master key via environment
+export MASTER_KEY="your-production-master-key"
+
+# Or use a key management service
+export MASTER_KEY_FILE="/secure/path/to/master.key"
+
+# Load secrets from secure storage
+python manage_secrets.py set TTS_API_KEY --value "$TTS_API_KEY"
+python manage_secrets.py set ADMIN_TOKEN --value "$ADMIN_TOKEN"
+```
+
+### Docker
+
+```dockerfile
+# Dockerfile
+FROM python:3.9
+
+# Copy encrypted secrets (not the master key!)
+COPY .secrets.json /app/.secrets.json
+
+# Master key provided at runtime
+ENV MASTER_KEY=""
+
+# Run with:
+# docker run -e MASTER_KEY="$MASTER_KEY" myapp
+```
+
+### Kubernetes
+
+```yaml
+# secret.yaml
+apiVersion: v1
+kind: Secret
+metadata:
+  name: talk2me-master-key
+type: Opaque
+stringData:
+  master-key: "your-master-key"
+
+---
+# deployment.yaml
+apiVersion: apps/v1
+kind: Deployment
+spec:
+  template:
+    spec:
+      containers:
+      - name: talk2me
+        env:
+        - name: MASTER_KEY
+          valueFrom:
+            secretKeyRef:
+              name: talk2me-master-key
+              key: master-key
+```
+
+## Troubleshooting
+
+### Lost Master Key
+
+If you lose the master key:
+1. You'll need to recreate all secrets
+2. Generate new master key: `python manage_secrets.py init`
+3. Re-enter all secret values
+
+### Corrupted Secrets File
+
+```bash
+# Check integrity
+python manage_secrets.py verify
+
+# If corrupted, restore from backup or reinitialize
+```
+
+### Permission Errors
+
+```bash
+# Fix file permissions
+chmod 600 .secrets.json .master_key
+chown $USER:$USER .secrets.json .master_key
+```
+
+## Monitoring
+
+### Audit Logs
+
+Review secret access patterns:
+```bash
+# View all audit entries
+python manage_secrets.py audit
+
+# Check specific secret
+python manage_secrets.py audit TTS_API_KEY
+
+# Export for analysis
+python manage_secrets.py audit > audit.log
+```
+
+### Rotation Monitoring
+
+```bash
+# Check rotation status
+python manage_secrets.py check-rotation
+
+# Set up cron job for automatic checks
+0 0 * * * /path/to/python /path/to/manage_secrets.py check-rotation
+```
+
+## Migration Guide
+
+### From Environment Variables
+
+```bash
+# Automatic migration
+python manage_secrets.py migrate
+
+# Manual migration
+export OLD_API_KEY="your-key"
+python manage_secrets.py set API_KEY --value "$OLD_API_KEY"
+unset OLD_API_KEY
+```
+
+### From .env Files
+
+```python
+# migrate_env.py
+from dotenv import dotenv_values
+from secrets_manager import get_secrets_manager
+
+env_values = dotenv_values('.env')
+manager = get_secrets_manager()
+
+for key, value in env_values.items():
+    if key.endswith('_KEY') or key.endswith('_TOKEN'):
+        manager.set(key, value, {'migrated_from': '.env'})
+```
+
+## API Reference
+
+### Python API
+
+```python
+from secrets_manager import get_secret, set_secret
+
+# Get a secret
+api_key = get_secret('TTS_API_KEY', default='')
+
+# Set a secret
+set_secret('NEW_API_KEY', 'value', metadata={'service': 'external'})
+
+# Advanced usage
+from secrets_manager import get_secrets_manager
+
+manager = get_secrets_manager()
+manager.rotate('API_KEY')
+manager.schedule_rotation('TOKEN', days=30)
+```
+
+### Flask CLI
+
+```bash
+# Via Flask CLI
+flask secrets-list
+flask secrets-set
+flask secrets-rotate
+flask secrets-check-rotation
+```
+
+## Security Considerations
+
+1. **Never log secret values**
+2. **Use secure random generation for new secrets**
+3. **Implement proper access controls**
+4. **Regular security audits**
+5. **Incident response plan for compromised secrets**
+
+## Future Enhancements
+
+- Integration with cloud KMS (AWS, Azure, GCP)
+- Hardware security module (HSM) support
+- Secret sharing (Shamir's Secret Sharing)
+- Time-based access controls
+- Automated compliance reporting
--- a/SECURITY.md
+++ b/SECURITY.md
@@ -0,0 +1,173 @@
+# Security Configuration Guide
+
+This document outlines security best practices for deploying Talk2Me.
+
+## Secrets Management
+
+Talk2Me includes a comprehensive secrets management system with encryption, rotation, and audit logging.
+
+### Quick Start
+
+```bash
+# Initialize secrets management
+python manage_secrets.py init
+
+# Set a secret
+python manage_secrets.py set TTS_API_KEY
+
+# List secrets
+python manage_secrets.py list
+
+# Rotate secrets
+python manage_secrets.py rotate ADMIN_TOKEN
+```
+
+See [SECRETS_MANAGEMENT.md](SECRETS_MANAGEMENT.md) for detailed documentation.
+
+## Environment Variables
+
+**NEVER commit sensitive information like API keys, passwords, or secrets to version control.**
+
+### Required Security Configuration
+
+1. **TTS_API_KEY**
+   - Required for TTS server authentication
+   - Set via environment variable: `export TTS_API_KEY="your-api-key"`
+   - Or use a `.env` file (see `.env.example`)
+
+2. **SECRET_KEY**
+   - Required for Flask session security
+   - Generate a secure key: `python -c "import secrets; print(secrets.token_hex(32))"`
+   - Set via: `export SECRET_KEY="your-generated-key"`
+
+3. **ADMIN_TOKEN**
+   - Required for admin endpoints
+   - Generate a secure token: `python -c "import secrets; print(secrets.token_urlsafe(32))"`
+   - Set via: `export ADMIN_TOKEN="your-admin-token"`
+
+### Using a .env File (Recommended)
+
+1. Copy the example file:
+   ```bash
+   cp .env.example .env
+   ```
+
+2. Edit `.env` with your actual values:
+   ```bash
+   nano .env  # or your preferred editor
+   ```
+
+3. Load environment variables:
+   ```bash
+   # Using python-dotenv (add to requirements.txt)
+   pip install python-dotenv
+   
+   # Or source manually
+   source .env
+   ```
+
+### Python-dotenv Integration
+
+To automatically load `.env` files, add this to the top of `app.py`:
+
+```python
+from dotenv import load_dotenv
+load_dotenv()  # Load .env file if it exists
+```
+
+### Production Deployment
+
+For production deployments:
+
+1. **Use a secrets management service**:
+   - AWS Secrets Manager
+   - HashiCorp Vault
+   - Azure Key Vault
+   - Google Secret Manager
+
+2. **Set environment variables securely**:
+   - Use your platform's environment configuration
+   - Never expose secrets in logs or error messages
+   - Rotate keys regularly
+
+3. **Additional security measures**:
+   - Use HTTPS only
+   - Enable CORS restrictions
+   - Implement rate limiting
+   - Monitor for suspicious activity
+
+### Docker Deployment
+
+When using Docker:
+
+```dockerfile
+# Use build arguments for non-sensitive config
+ARG TTS_SERVER_URL=http://localhost:5050/v1/audio/speech
+
+# Use runtime environment for secrets
+ENV TTS_API_KEY=""
+```
+
+Run with:
+```bash
+docker run -e TTS_API_KEY="your-key" -e SECRET_KEY="your-secret" talk2me
+```
+
+### Kubernetes Deployment
+
+Use Kubernetes secrets:
+
+```yaml
+apiVersion: v1
+kind: Secret
+metadata:
+  name: talk2me-secrets
+type: Opaque
+stringData:
+  tts-api-key: "your-api-key"
+  flask-secret-key: "your-secret-key"
+  admin-token: "your-admin-token"
+```
+
+### Rate Limiting
+
+Talk2Me implements comprehensive rate limiting to prevent abuse:
+
+1. **Per-Endpoint Limits**:
+   - Transcription: 10/min, 100/hour
+   - Translation: 20/min, 300/hour
+   - TTS: 15/min, 200/hour
+
+2. **Global Limits**:
+   - 1,000 requests/minute total
+   - 50 concurrent requests maximum
+
+3. **Automatic Protection**:
+   - IP blocking for excessive requests
+   - Request size validation
+   - Burst control
+
+See [RATE_LIMITING.md](RATE_LIMITING.md) for configuration details.
+
+### Security Checklist
+
+- [ ] All API keys removed from source code
+- [ ] Environment variables configured
+- [ ] `.env` file added to `.gitignore`
+- [ ] Secrets rotated after any potential exposure
+- [ ] HTTPS enabled in production
+- [ ] CORS properly configured
+- [ ] Rate limiting enabled and configured
+- [ ] Admin endpoints protected with authentication
+- [ ] Error messages don't expose sensitive info
+- [ ] Logs sanitized of sensitive data
+- [ ] Request size limits enforced
+- [ ] IP blocking configured for abuse prevention
+
+### Reporting Security Issues
+
+If you discover a security vulnerability, please report it to:
+- Create a private security advisory on GitHub
+- Or email: security@yourdomain.com
+
+Do not create public issues for security vulnerabilities.
--- a/SESSION_MANAGEMENT.md
+++ b/SESSION_MANAGEMENT.md
@@ -0,0 +1,366 @@
+# Session Management Documentation
+
+This document describes the session management system implemented in Talk2Me to prevent resource leaks from abandoned sessions.
+
+## Overview
+
+Talk2Me implements a comprehensive session management system that tracks user sessions and associated resources (audio files, temporary files, streams) to ensure proper cleanup and prevent resource exhaustion.
+
+## Features
+
+### 1. Automatic Resource Tracking
+
+All resources created during a user session are automatically tracked:
+- Audio files (uploads and generated)
+- Temporary files
+- Active streams
+- Resource metadata (size, creation time, purpose)
+
+### 2. Resource Limits
+
+Per-session limits prevent resource exhaustion:
+- Maximum resources per session: 100
+- Maximum storage per session: 100MB
+- Automatic cleanup of oldest resources when limits are reached
+
+### 3. Session Lifecycle Management
+
+Sessions are automatically managed:
+- Created on first request
+- Updated on each request
+- Cleaned up when idle (15 minutes)
+- Removed when expired (1 hour)
+
+### 4. Automatic Cleanup
+
+Background cleanup processes run automatically:
+- Idle session cleanup (every minute)
+- Expired session cleanup (every minute)
+- Orphaned file cleanup (every minute)
+
+## Configuration
+
+Session management can be configured via environment variables or Flask config:
+
+```python
+# app.py or config.py
+app.config.update({
+    'MAX_SESSION_DURATION': 3600,        # 1 hour
+    'MAX_SESSION_IDLE_TIME': 900,        # 15 minutes
+    'MAX_RESOURCES_PER_SESSION': 100,
+    'MAX_BYTES_PER_SESSION': 104857600,  # 100MB
+    'SESSION_CLEANUP_INTERVAL': 60,      # 1 minute
+    'SESSION_STORAGE_PATH': '/path/to/sessions'
+})
+```
+
+## API Endpoints
+
+### Admin Endpoints
+
+All admin endpoints require authentication via `X-Admin-Token` header.
+
+#### GET /admin/sessions
+Get information about all active sessions.
+
+```bash
+curl -H "X-Admin-Token: your-token" http://localhost:5005/admin/sessions
+```
+
+Response:
+```json
+{
+  "sessions": [
+    {
+      "session_id": "uuid",
+      "user_id": null,
+      "ip_address": "192.168.1.1",
+      "created_at": "2024-01-15T10:00:00",
+      "last_activity": "2024-01-15T10:05:00",
+      "duration_seconds": 300,
+      "idle_seconds": 0,
+      "request_count": 5,
+      "resource_count": 3,
+      "total_bytes_used": 1048576,
+      "resources": [...]
+    }
+  ],
+  "stats": {
+    "total_sessions_created": 100,
+    "total_sessions_cleaned": 50,
+    "active_sessions": 5,
+    "avg_session_duration": 600,
+    "avg_resources_per_session": 4.2
+  }
+}
+```
+
+#### GET /admin/sessions/{session_id}
+Get detailed information about a specific session.
+
+```bash
+curl -H "X-Admin-Token: your-token" http://localhost:5005/admin/sessions/abc123
+```
+
+#### POST /admin/sessions/{session_id}/cleanup
+Manually cleanup a specific session.
+
+```bash
+curl -X POST -H "X-Admin-Token: your-token" \
+  http://localhost:5005/admin/sessions/abc123/cleanup
+```
+
+#### GET /admin/sessions/metrics
+Get session management metrics for monitoring.
+
+```bash
+curl -H "X-Admin-Token: your-token" http://localhost:5005/admin/sessions/metrics
+```
+
+Response:
+```json
+{
+  "sessions": {
+    "active": 5,
+    "total_created": 100,
+    "total_cleaned": 95
+  },
+  "resources": {
+    "active": 20,
+    "total_cleaned": 380,
+    "active_bytes": 10485760,
+    "total_bytes_cleaned": 1073741824
+  },
+  "limits": {
+    "max_session_duration": 3600,
+    "max_idle_time": 900,
+    "max_resources_per_session": 100,
+    "max_bytes_per_session": 104857600
+  }
+}
+```
+
+## CLI Commands
+
+Session management can be controlled via Flask CLI commands:
+
+```bash
+# List all active sessions
+flask sessions-list
+
+# Manual cleanup
+flask sessions-cleanup
+
+# Show statistics
+flask sessions-stats
+```
+
+## Usage Examples
+
+### 1. Monitor Active Sessions
+
+```python
+import requests
+
+headers = {'X-Admin-Token': 'your-admin-token'}
+response = requests.get('http://localhost:5005/admin/sessions', headers=headers)
+sessions = response.json()
+
+for session in sessions['sessions']:
+    print(f"Session {session['session_id']}:")
+    print(f"  IP: {session['ip_address']}")
+    print(f"  Resources: {session['resource_count']}")
+    print(f"  Storage: {session['total_bytes_used'] / 1024 / 1024:.2f} MB")
+```
+
+### 2. Cleanup Idle Sessions
+
+```python
+# Get all sessions
+response = requests.get('http://localhost:5005/admin/sessions', headers=headers)
+sessions = response.json()['sessions']
+
+# Find idle sessions
+idle_threshold = 300  # 5 minutes
+for session in sessions:
+    if session['idle_seconds'] > idle_threshold:
+        # Cleanup idle session
+        cleanup_url = f'http://localhost:5005/admin/sessions/{session["session_id"]}/cleanup'
+        requests.post(cleanup_url, headers=headers)
+        print(f"Cleaned up idle session {session['session_id']}")
+```
+
+### 3. Monitor Resource Usage
+
+```python
+# Get metrics
+response = requests.get('http://localhost:5005/admin/sessions/metrics', headers=headers)
+metrics = response.json()
+
+print(f"Active sessions: {metrics['sessions']['active']}")
+print(f"Active resources: {metrics['resources']['active']}")
+print(f"Storage used: {metrics['resources']['active_bytes'] / 1024 / 1024:.2f} MB")
+print(f"Total cleaned: {metrics['resources']['total_bytes_cleaned'] / 1024 / 1024 / 1024:.2f} GB")
+```
+
+## Resource Types
+
+The session manager tracks different types of resources:
+
+### 1. Audio Files
+- Uploaded audio files for transcription
+- Generated audio files from TTS
+- Automatically cleaned up after session ends
+
+### 2. Temporary Files
+- Processing intermediates
+- Cache files
+- Automatically cleaned up after use
+
+### 3. Streams
+- WebSocket connections
+- Server-sent event streams
+- Closed when session ends
+
+## Best Practices
+
+### 1. Session Configuration
+
+```python
+# Development
+app.config.update({
+    'MAX_SESSION_DURATION': 7200,        # 2 hours
+    'MAX_SESSION_IDLE_TIME': 1800,       # 30 minutes
+    'MAX_RESOURCES_PER_SESSION': 200,
+    'MAX_BYTES_PER_SESSION': 209715200   # 200MB
+})
+
+# Production
+app.config.update({
+    'MAX_SESSION_DURATION': 3600,        # 1 hour
+    'MAX_SESSION_IDLE_TIME': 900,        # 15 minutes
+    'MAX_RESOURCES_PER_SESSION': 100,
+    'MAX_BYTES_PER_SESSION': 104857600   # 100MB
+})
+```
+
+### 2. Monitoring
+
+Set up monitoring for:
+- Number of active sessions
+- Resource usage per session
+- Cleanup frequency
+- Failed cleanup attempts
+
+### 3. Alerting
+
+Configure alerts for:
+- High number of active sessions (>1000)
+- High resource usage (>80% of limits)
+- Failed cleanup operations
+- Orphaned files detected
+
+## Troubleshooting
+
+### Common Issues
+
+#### 1. Sessions Not Being Cleaned Up
+
+Check cleanup thread status:
+```bash
+flask sessions-stats
+```
+
+Manual cleanup:
+```bash
+flask sessions-cleanup
+```
+
+#### 2. Resource Limits Reached
+
+Check session details:
+```bash
+curl -H "X-Admin-Token: token" http://localhost:5005/admin/sessions/SESSION_ID
+```
+
+Increase limits if needed:
+```python
+app.config['MAX_RESOURCES_PER_SESSION'] = 200
+app.config['MAX_BYTES_PER_SESSION'] = 209715200  # 200MB
+```
+
+#### 3. Orphaned Files
+
+Check for orphaned files:
+```bash
+ls -la /path/to/session/storage/
+```
+
+Clean orphaned files:
+```bash
+flask sessions-cleanup
+```
+
+### Debug Logging
+
+Enable debug logging for session management:
+
+```python
+import logging
+
+# Enable session manager debug logs
+logging.getLogger('session_manager').setLevel(logging.DEBUG)
+```
+
+## Security Considerations
+
+1. **Session Hijacking**: Sessions are tied to IP addresses and user agents
+2. **Resource Exhaustion**: Strict per-session limits prevent DoS attacks
+3. **File System Access**: Session storage uses secure paths and permissions
+4. **Admin Access**: All admin endpoints require authentication
+
+## Performance Impact
+
+The session management system has minimal performance impact:
+- Memory: ~1KB per session + resource metadata
+- CPU: Background cleanup runs every minute
+- Disk I/O: Cleanup operations are batched
+- Network: No external dependencies
+
+## Integration with Other Systems
+
+### Rate Limiting
+
+Session management integrates with rate limiting:
+```python
+# Sessions are automatically tracked per IP
+# Rate limits apply per session
+```
+
+### Secrets Management
+
+Session tokens can be encrypted:
+```python
+from secrets_manager import encrypt_value
+encrypted_session = encrypt_value(session_id)
+```
+
+### Monitoring
+
+Export metrics to monitoring systems:
+```python
+# Prometheus format
+@app.route('/metrics')
+def prometheus_metrics():
+    metrics = app.session_manager.export_metrics()
+    # Format as Prometheus metrics
+    return format_prometheus(metrics)
+```
+
+## Future Enhancements
+
+1. **Session Persistence**: Store sessions in Redis/database
+2. **Distributed Sessions**: Support for multi-server deployments
+3. **Session Analytics**: Track usage patterns and trends
+4. **Resource Quotas**: Per-user resource quotas
+5. **Session Replay**: Debug issues by replaying sessions
--- a/app.py
+++ b/app.py
--- a/config.py
+++ b/config.py
@@ -0,0 +1,203 @@
+# Configuration management with secrets integration
+import os
+import logging
+from datetime import timedelta
+from secrets_manager import get_secret, get_secrets_manager
+
+logger = logging.getLogger(__name__)
+
+class Config:
+    """Base configuration with secrets management"""
+    
+    def __init__(self):
+        self.secrets_manager = get_secrets_manager()
+        self._load_config()
+    
+    def _load_config(self):
+        """Load configuration from environment and secrets"""
+        # Flask configuration
+        self.SECRET_KEY = self._get_secret('FLASK_SECRET_KEY', 
+                                          os.environ.get('SECRET_KEY', 'dev-key-change-this'))
+        
+        # Security
+        self.SESSION_COOKIE_SECURE = self._get_bool('SESSION_COOKIE_SECURE', True)
+        self.SESSION_COOKIE_HTTPONLY = True
+        self.SESSION_COOKIE_SAMESITE = 'Lax'
+        self.PERMANENT_SESSION_LIFETIME = timedelta(hours=24)
+        
+        # TTS Configuration
+        self.TTS_SERVER_URL = os.environ.get('TTS_SERVER_URL', 'http://localhost:5050/v1/audio/speech')
+        self.TTS_API_KEY = self._get_secret('TTS_API_KEY', os.environ.get('TTS_API_KEY', ''))
+        
+        # Upload configuration
+        self.UPLOAD_FOLDER = os.environ.get('UPLOAD_FOLDER', None)
+        
+        # Request size limits (in bytes)
+        self.MAX_CONTENT_LENGTH = int(os.environ.get('MAX_CONTENT_LENGTH', 50 * 1024 * 1024))  # 50MB
+        self.MAX_AUDIO_SIZE = int(os.environ.get('MAX_AUDIO_SIZE', 25 * 1024 * 1024))          # 25MB
+        self.MAX_JSON_SIZE = int(os.environ.get('MAX_JSON_SIZE', 1 * 1024 * 1024))             # 1MB
+        self.MAX_IMAGE_SIZE = int(os.environ.get('MAX_IMAGE_SIZE', 10 * 1024 * 1024))          # 10MB
+        
+        # CORS configuration
+        self.CORS_ORIGINS = os.environ.get('CORS_ORIGINS', '*').split(',')
+        self.ADMIN_CORS_ORIGINS = os.environ.get('ADMIN_CORS_ORIGINS', 'http://localhost:*').split(',')
+        
+        # Admin configuration
+        self.ADMIN_TOKEN = self._get_secret('ADMIN_TOKEN', 
+                                           os.environ.get('ADMIN_TOKEN', 'default-admin-token'))
+        
+        # Database configuration (for future use)
+        self.DATABASE_URL = self._get_secret('DATABASE_URL', 
+                                            os.environ.get('DATABASE_URL', 'sqlite:///talk2me.db'))
+        
+        # Redis configuration (for future use)
+        self.REDIS_URL = self._get_secret('REDIS_URL', 
+                                         os.environ.get('REDIS_URL', 'redis://localhost:6379/0'))
+        
+        # Whisper configuration
+        self.WHISPER_MODEL_SIZE = os.environ.get('WHISPER_MODEL_SIZE', 'base')
+        self.WHISPER_DEVICE = os.environ.get('WHISPER_DEVICE', 'auto')
+        
+        # Ollama configuration
+        self.OLLAMA_HOST = os.environ.get('OLLAMA_HOST', 'http://localhost:11434')
+        self.OLLAMA_MODEL = os.environ.get('OLLAMA_MODEL', 'gemma3:27b')
+        
+        # Rate limiting configuration
+        self.RATE_LIMIT_ENABLED = self._get_bool('RATE_LIMIT_ENABLED', True)
+        self.RATE_LIMIT_STORAGE_URL = self._get_secret('RATE_LIMIT_STORAGE_URL', 
+                                                       os.environ.get('RATE_LIMIT_STORAGE_URL', 'memory://'))
+        
+        # Logging configuration
+        self.LOG_LEVEL = os.environ.get('LOG_LEVEL', 'INFO')
+        self.LOG_FILE = os.environ.get('LOG_FILE', 'talk2me.log')
+        
+        # Feature flags
+        self.ENABLE_PUSH_NOTIFICATIONS = self._get_bool('ENABLE_PUSH_NOTIFICATIONS', True)
+        self.ENABLE_OFFLINE_MODE = self._get_bool('ENABLE_OFFLINE_MODE', True)
+        self.ENABLE_STREAMING = self._get_bool('ENABLE_STREAMING', True)
+        self.ENABLE_MULTI_SPEAKER = self._get_bool('ENABLE_MULTI_SPEAKER', True)
+        
+        # Performance tuning
+        self.WORKER_CONNECTIONS = int(os.environ.get('WORKER_CONNECTIONS', '1000'))
+        self.WORKER_TIMEOUT = int(os.environ.get('WORKER_TIMEOUT', '120'))
+        
+        # Validate configuration
+        self._validate_config()
+    
+    def _get_secret(self, key: str, default: str = None) -> str:
+        """Get secret from secrets manager or environment"""
+        value = self.secrets_manager.get(key)
+        if value is None:
+            value = default
+        
+        if value is None:
+            logger.warning(f"Configuration {key} not set")
+        
+        return value
+    
+    def _get_bool(self, key: str, default: bool = False) -> bool:
+        """Get boolean configuration value"""
+        value = os.environ.get(key, '').lower()
+        if value in ('true', '1', 'yes', 'on'):
+            return True
+        elif value in ('false', '0', 'no', 'off'):
+            return False
+        return default
+    
+    def _validate_config(self):
+        """Validate configuration values"""
+        # Check required secrets
+        if not self.SECRET_KEY or self.SECRET_KEY == 'dev-key-change-this':
+            logger.warning("Using default SECRET_KEY - this is insecure for production!")
+        
+        if not self.TTS_API_KEY:
+            logger.warning("TTS_API_KEY not configured - TTS functionality may not work")
+        
+        if self.ADMIN_TOKEN == 'default-admin-token':
+            logger.warning("Using default ADMIN_TOKEN - this is insecure for production!")
+        
+        # Validate URLs
+        if not self._is_valid_url(self.TTS_SERVER_URL):
+            logger.error(f"Invalid TTS_SERVER_URL: {self.TTS_SERVER_URL}")
+        
+        # Check file permissions
+        if self.UPLOAD_FOLDER and not os.access(self.UPLOAD_FOLDER, os.W_OK):
+            logger.warning(f"Upload folder {self.UPLOAD_FOLDER} is not writable")
+    
+    def _is_valid_url(self, url: str) -> bool:
+        """Check if URL is valid"""
+        return url.startswith(('http://', 'https://'))
+    
+    def to_dict(self) -> dict:
+        """Export configuration as dictionary (excluding secrets)"""
+        config = {}
+        for key in dir(self):
+            if key.isupper() and not key.startswith('_'):
+                value = getattr(self, key)
+                # Mask sensitive values
+                if any(sensitive in key for sensitive in ['KEY', 'TOKEN', 'PASSWORD', 'SECRET']):
+                    config[key] = '***MASKED***'
+                else:
+                    config[key] = value
+        return config
+
+class DevelopmentConfig(Config):
+    """Development configuration"""
+    def _load_config(self):
+        super()._load_config()
+        self.DEBUG = True
+        self.TESTING = False
+        self.SESSION_COOKIE_SECURE = False  # Allow HTTP in development
+
+class ProductionConfig(Config):
+    """Production configuration"""
+    def _load_config(self):
+        super()._load_config()
+        self.DEBUG = False
+        self.TESTING = False
+        
+        # Enforce security in production
+        if not self.SECRET_KEY or self.SECRET_KEY == 'dev-key-change-this':
+            raise ValueError("SECRET_KEY must be set in production")
+        
+        if self.ADMIN_TOKEN == 'default-admin-token':
+            raise ValueError("ADMIN_TOKEN must be changed in production")
+
+class TestingConfig(Config):
+    """Testing configuration"""
+    def _load_config(self):
+        super()._load_config()
+        self.DEBUG = True
+        self.TESTING = True
+        self.WTF_CSRF_ENABLED = False
+        self.RATE_LIMIT_ENABLED = False
+
+# Configuration factory
+def get_config(env: str = None) -> Config:
+    """Get configuration based on environment"""
+    if env is None:
+        env = os.environ.get('FLASK_ENV', 'development')
+    
+    configs = {
+        'development': DevelopmentConfig,
+        'production': ProductionConfig,
+        'testing': TestingConfig
+    }
+    
+    config_class = configs.get(env, DevelopmentConfig)
+    return config_class()
+
+# Convenience function for Flask app
+def init_app(app):
+    """Initialize Flask app with configuration"""
+    config = get_config()
+    
+    # Apply configuration to app
+    for key in dir(config):
+        if key.isupper():
+            app.config[key] = getattr(config, key)
+    
+    # Store config object
+    app.app_config = config
+    
+    logger.info(f"Configuration loaded for environment: {os.environ.get('FLASK_ENV', 'development')}")
--- a/deploy.sh
+++ b/deploy.sh
@@ -0,0 +1,208 @@
+#!/bin/bash
+# Production deployment script for Talk2Me
+
+set -e  # Exit on error
+
+# Colors for output
+RED='\033[0;31m'
+GREEN='\033[0;32m'
+YELLOW='\033[1;33m'
+NC='\033[0m' # No Color
+
+# Configuration
+APP_NAME="talk2me"
+APP_USER="talk2me"
+APP_DIR="/opt/talk2me"
+VENV_DIR="$APP_DIR/venv"
+LOG_DIR="/var/log/talk2me"
+PID_FILE="/var/run/talk2me.pid"
+WORKERS=${WORKERS:-4}
+
+# Functions
+print_status() {
+    echo -e "${GREEN}[INFO]${NC} $1"
+}
+
+print_error() {
+    echo -e "${RED}[ERROR]${NC} $1"
+}
+
+print_warning() {
+    echo -e "${YELLOW}[WARNING]${NC} $1"
+}
+
+# Check if running as root
+if [[ $EUID -ne 0 ]]; then
+   print_error "This script must be run as root"
+   exit 1
+fi
+
+# Create application user if doesn't exist
+if ! id "$APP_USER" &>/dev/null; then
+    print_status "Creating application user: $APP_USER"
+    useradd -m -s /bin/bash $APP_USER
+fi
+
+# Create directories
+print_status "Creating application directories"
+mkdir -p $APP_DIR $LOG_DIR
+chown -R $APP_USER:$APP_USER $APP_DIR $LOG_DIR
+
+# Copy application files
+print_status "Copying application files"
+rsync -av --exclude='venv' --exclude='__pycache__' --exclude='*.pyc' \
+      --exclude='logs' --exclude='.git' --exclude='node_modules' \
+      ./ $APP_DIR/
+
+# Create virtual environment
+print_status "Setting up Python virtual environment"
+su - $APP_USER -c "cd $APP_DIR && python3 -m venv $VENV_DIR"
+
+# Install dependencies
+print_status "Installing Python dependencies"
+su - $APP_USER -c "cd $APP_DIR && $VENV_DIR/bin/pip install --upgrade pip"
+su - $APP_USER -c "cd $APP_DIR && $VENV_DIR/bin/pip install -r requirements-prod.txt"
+
+# Install Whisper model
+print_status "Downloading Whisper model (this may take a while)"
+su - $APP_USER -c "cd $APP_DIR && $VENV_DIR/bin/python -c 'import whisper; whisper.load_model(\"base\")'"
+
+# Build frontend assets
+if [ -f "package.json" ]; then
+    print_status "Building frontend assets"
+    cd $APP_DIR
+    npm install
+    npm run build
+fi
+
+# Create systemd service
+print_status "Creating systemd service"
+cat > /etc/systemd/system/talk2me.service <<EOF
+[Unit]
+Description=Talk2Me Translation Service
+After=network.target
+
+[Service]
+Type=notify
+User=$APP_USER
+Group=$APP_USER
+WorkingDirectory=$APP_DIR
+Environment="PATH=$VENV_DIR/bin"
+Environment="FLASK_ENV=production"
+Environment="UPLOAD_FOLDER=/tmp/talk2me_uploads"
+Environment="LOGS_DIR=$LOG_DIR"
+ExecStart=$VENV_DIR/bin/gunicorn --config gunicorn_config.py wsgi:application
+ExecReload=/bin/kill -s HUP \$MAINPID
+KillMode=mixed
+TimeoutStopSec=5
+Restart=always
+RestartSec=10
+
+# Security settings
+NoNewPrivileges=true
+PrivateTmp=true
+ProtectSystem=strict
+ProtectHome=true
+ReadWritePaths=$LOG_DIR /tmp
+
+[Install]
+WantedBy=multi-user.target
+EOF
+
+# Create nginx configuration
+print_status "Creating nginx configuration"
+cat > /etc/nginx/sites-available/talk2me <<EOF
+server {
+    listen 80;
+    server_name _;  # Replace with your domain
+
+    # Security headers
+    add_header X-Content-Type-Options nosniff;
+    add_header X-Frame-Options DENY;
+    add_header X-XSS-Protection "1; mode=block";
+    add_header Referrer-Policy "strict-origin-when-cross-origin";
+
+    # File upload size limit
+    client_max_body_size 50M;
+    client_body_buffer_size 1M;
+
+    # Timeouts for long audio processing
+    proxy_connect_timeout 120s;
+    proxy_send_timeout 120s;
+    proxy_read_timeout 120s;
+
+    location / {
+        proxy_pass http://127.0.0.1:5005;
+        proxy_http_version 1.1;
+        proxy_set_header Upgrade \$http_upgrade;
+        proxy_set_header Connection 'upgrade';
+        proxy_set_header Host \$host;
+        proxy_set_header X-Real-IP \$remote_addr;
+        proxy_set_header X-Forwarded-For \$proxy_add_x_forwarded_for;
+        proxy_set_header X-Forwarded-Proto \$scheme;
+        proxy_cache_bypass \$http_upgrade;
+        
+        # Don't buffer responses
+        proxy_buffering off;
+        
+        # WebSocket support
+        proxy_set_header Connection "upgrade";
+    }
+
+    location /static {
+        alias $APP_DIR/static;
+        expires 1y;
+        add_header Cache-Control "public, immutable";
+    }
+
+    # Health check endpoint
+    location /health {
+        proxy_pass http://127.0.0.1:5005/health;
+        access_log off;
+    }
+}
+EOF
+
+# Enable nginx site
+if [ -f /etc/nginx/sites-enabled/default ]; then
+    rm /etc/nginx/sites-enabled/default
+fi
+ln -sf /etc/nginx/sites-available/talk2me /etc/nginx/sites-enabled/
+
+# Set permissions
+chown -R $APP_USER:$APP_USER $APP_DIR
+
+# Reload systemd
+print_status "Reloading systemd"
+systemctl daemon-reload
+
+# Start services
+print_status "Starting services"
+systemctl enable talk2me
+systemctl restart talk2me
+systemctl restart nginx
+
+# Wait for service to start
+sleep 5
+
+# Check service status
+if systemctl is-active --quiet talk2me; then
+    print_status "Talk2Me service is running"
+else
+    print_error "Talk2Me service failed to start"
+    journalctl -u talk2me -n 50
+    exit 1
+fi
+
+# Test health endpoint
+if curl -s http://localhost:5005/health | grep -q "healthy"; then
+    print_status "Health check passed"
+else
+    print_error "Health check failed"
+    exit 1
+fi
+
+print_status "Deployment complete!"
+print_status "Talk2Me is now running at http://$(hostname -I | awk '{print $1}')"
+print_status "Check logs at: $LOG_DIR"
+print_status "Service status: systemctl status talk2me"
--- a/docker-compose.yml
+++ b/docker-compose.yml
@@ -0,0 +1,92 @@
+version: '3.8'
+
+services:
+  talk2me:
+    build: .
+    container_name: talk2me
+    restart: unless-stopped
+    ports:
+      - "5005:5005"
+    environment:
+      - FLASK_ENV=production
+      - UPLOAD_FOLDER=/tmp/talk2me_uploads
+      - LOGS_DIR=/app/logs
+      - TTS_SERVER_URL=${TTS_SERVER_URL:-http://localhost:5050/v1/audio/speech}
+      - TTS_API_KEY=${TTS_API_KEY}
+      - ADMIN_TOKEN=${ADMIN_TOKEN:-change-me-in-production}
+      - SECRET_KEY=${SECRET_KEY:-change-me-in-production}
+      - GUNICORN_WORKERS=${GUNICORN_WORKERS:-4}
+      - GUNICORN_THREADS=${GUNICORN_THREADS:-2}
+      - MEMORY_THRESHOLD_MB=${MEMORY_THRESHOLD_MB:-4096}
+      - GPU_MEMORY_THRESHOLD_MB=${GPU_MEMORY_THRESHOLD_MB:-2048}
+    volumes:
+      - ./logs:/app/logs
+      - talk2me_uploads:/tmp/talk2me_uploads
+      - talk2me_models:/root/.cache/whisper  # Whisper models cache
+    deploy:
+      resources:
+        limits:
+          memory: 4G
+        reservations:
+          memory: 2G
+    healthcheck:
+      test: ["CMD", "curl", "-f", "http://localhost:5005/health"]
+      interval: 30s
+      timeout: 10s
+      retries: 3
+      start_period: 40s
+    networks:
+      - talk2me_network
+
+  # Nginx reverse proxy (optional, for production)
+  nginx:
+    image: nginx:alpine
+    container_name: talk2me_nginx
+    restart: unless-stopped
+    ports:
+      - "80:80"
+      - "443:443"
+    volumes:
+      - ./nginx.conf:/etc/nginx/conf.d/default.conf:ro
+      - ./static:/app/static:ro
+      - nginx_ssl:/etc/nginx/ssl
+    depends_on:
+      - talk2me
+    networks:
+      - talk2me_network
+
+  # Redis for session storage (optional)
+  redis:
+    image: redis:7-alpine
+    container_name: talk2me_redis
+    restart: unless-stopped
+    command: redis-server --maxmemory 256mb --maxmemory-policy allkeys-lru
+    volumes:
+      - redis_data:/data
+    networks:
+      - talk2me_network
+
+  # PostgreSQL for persistent storage (optional)
+  postgres:
+    image: postgres:15-alpine
+    container_name: talk2me_postgres
+    restart: unless-stopped
+    environment:
+      - POSTGRES_DB=talk2me
+      - POSTGRES_USER=talk2me
+      - POSTGRES_PASSWORD=${POSTGRES_PASSWORD:-change-me-in-production}
+    volumes:
+      - postgres_data:/var/lib/postgresql/data
+    networks:
+      - talk2me_network
+
+volumes:
+  talk2me_uploads:
+  talk2me_models:
+  redis_data:
+  postgres_data:
+  nginx_ssl:
+
+networks:
+  talk2me_network:
+    driver: bridge
--- a/error_logger.py
+++ b/error_logger.py
@@ -0,0 +1,564 @@
+# Comprehensive error logging system for production debugging
+import logging
+import logging.handlers
+import os
+import sys
+import json
+import traceback
+import time
+from datetime import datetime
+from typing import Dict, Any, Optional, Union
+from functools import wraps
+import socket
+import threading
+from flask import request, g
+from contextlib import contextmanager
+import hashlib
+
+# Create logs directory if it doesn't exist
+LOGS_DIR = os.environ.get('LOGS_DIR', 'logs')
+os.makedirs(LOGS_DIR, exist_ok=True)
+
+class StructuredFormatter(logging.Formatter):
+    """
+    Custom formatter that outputs structured JSON logs
+    """
+    def __init__(self, app_name='talk2me', environment='development'):
+        super().__init__()
+        self.app_name = app_name
+        self.environment = environment
+        self.hostname = socket.gethostname()
+        
+    def format(self, record):
+        # Base log structure
+        log_data = {
+            'timestamp': datetime.utcnow().isoformat() + 'Z',
+            'level': record.levelname,
+            'logger': record.name,
+            'message': record.getMessage(),
+            'app': self.app_name,
+            'environment': self.environment,
+            'hostname': self.hostname,
+            'thread': threading.current_thread().name,
+            'process': os.getpid()
+        }
+        
+        # Add exception info if present
+        if record.exc_info:
+            log_data['exception'] = {
+                'type': record.exc_info[0].__name__,
+                'message': str(record.exc_info[1]),
+                'traceback': traceback.format_exception(*record.exc_info)
+            }
+        
+        # Add extra fields
+        if hasattr(record, 'extra_fields'):
+            log_data.update(record.extra_fields)
+        
+        # Add Flask request context if available
+        if hasattr(record, 'request_id'):
+            log_data['request_id'] = record.request_id
+            
+        if hasattr(record, 'user_id'):
+            log_data['user_id'] = record.user_id
+            
+        if hasattr(record, 'session_id'):
+            log_data['session_id'] = record.session_id
+        
+        # Add performance metrics if available
+        if hasattr(record, 'duration'):
+            log_data['duration_ms'] = record.duration
+            
+        if hasattr(record, 'memory_usage'):
+            log_data['memory_usage_mb'] = record.memory_usage
+        
+        return json.dumps(log_data, default=str)
+
+class ErrorLogger:
+    """
+    Comprehensive error logging system with multiple handlers and features
+    """
+    def __init__(self, app=None, config=None):
+        self.config = config or {}
+        self.loggers = {}
+        self.error_counts = {}
+        self.error_signatures = {}
+        
+        if app:
+            self.init_app(app)
+    
+    def init_app(self, app):
+        """Initialize error logging for Flask app"""
+        self.app = app
+        
+        # Get configuration
+        self.log_level = self.config.get('log_level', 
+                                         app.config.get('LOG_LEVEL', 'INFO'))
+        self.log_file = self.config.get('log_file',
+                                        app.config.get('LOG_FILE', 
+                                                      os.path.join(LOGS_DIR, 'talk2me.log')))
+        self.error_log_file = self.config.get('error_log_file',
+                                             os.path.join(LOGS_DIR, 'errors.log'))
+        self.max_bytes = self.config.get('max_bytes', 50 * 1024 * 1024)  # 50MB
+        self.backup_count = self.config.get('backup_count', 10)
+        self.environment = app.config.get('FLASK_ENV', 'development')
+        
+        # Set up loggers
+        self._setup_app_logger()
+        self._setup_error_logger()
+        self._setup_access_logger()
+        self._setup_security_logger()
+        self._setup_performance_logger()
+        
+        # Add Flask error handlers
+        self._setup_flask_handlers(app)
+        
+        # Add request logging
+        app.before_request(self._before_request)
+        app.after_request(self._after_request)
+        
+        # Store logger in app
+        app.error_logger = self
+        
+        logging.info("Error logging system initialized")
+    
+    def _setup_app_logger(self):
+        """Set up main application logger"""
+        app_logger = logging.getLogger('talk2me')
+        app_logger.setLevel(getattr(logging, self.log_level.upper()))
+        
+        # Remove existing handlers
+        app_logger.handlers = []
+        
+        # Console handler with color support
+        console_handler = logging.StreamHandler(sys.stdout)
+        if sys.stdout.isatty():
+            # Use colored output for terminals
+            from colorlog import ColoredFormatter
+            console_formatter = ColoredFormatter(
+                '%(log_color)s%(asctime)s - %(name)s - %(levelname)s - %(message)s',
+                log_colors={
+                    'DEBUG': 'cyan',
+                    'INFO': 'green',
+                    'WARNING': 'yellow',
+                    'ERROR': 'red',
+                    'CRITICAL': 'red,bg_white',
+                }
+            )
+            console_handler.setFormatter(console_formatter)
+        else:
+            console_handler.setFormatter(
+                StructuredFormatter('talk2me', self.environment)
+            )
+        app_logger.addHandler(console_handler)
+        
+        # Rotating file handler
+        file_handler = logging.handlers.RotatingFileHandler(
+            self.log_file,
+            maxBytes=self.max_bytes,
+            backupCount=self.backup_count
+        )
+        file_handler.setFormatter(
+            StructuredFormatter('talk2me', self.environment)
+        )
+        app_logger.addHandler(file_handler)
+        
+        self.loggers['app'] = app_logger
+    
+    def _setup_error_logger(self):
+        """Set up dedicated error logger"""
+        error_logger = logging.getLogger('talk2me.errors')
+        error_logger.setLevel(logging.ERROR)
+        
+        # Error file handler
+        error_handler = logging.handlers.RotatingFileHandler(
+            self.error_log_file,
+            maxBytes=self.max_bytes,
+            backupCount=self.backup_count
+        )
+        error_handler.setFormatter(
+            StructuredFormatter('talk2me', self.environment)
+        )
+        error_logger.addHandler(error_handler)
+        
+        # Also send errors to syslog if available
+        try:
+            syslog_handler = logging.handlers.SysLogHandler(
+                address='/dev/log' if os.path.exists('/dev/log') else ('localhost', 514)
+            )
+            syslog_handler.setFormatter(
+                logging.Formatter('talk2me[%(process)d]: %(levelname)s %(message)s')
+            )
+            error_logger.addHandler(syslog_handler)
+        except Exception:
+            pass  # Syslog not available
+        
+        self.loggers['error'] = error_logger
+    
+    def _setup_access_logger(self):
+        """Set up access logger for HTTP requests"""
+        access_logger = logging.getLogger('talk2me.access')
+        access_logger.setLevel(logging.INFO)
+        
+        # Access log file
+        access_handler = logging.handlers.TimedRotatingFileHandler(
+            os.path.join(LOGS_DIR, 'access.log'),
+            when='midnight',
+            interval=1,
+            backupCount=30
+        )
+        access_handler.setFormatter(
+            StructuredFormatter('talk2me', self.environment)
+        )
+        access_logger.addHandler(access_handler)
+        
+        self.loggers['access'] = access_logger
+    
+    def _setup_security_logger(self):
+        """Set up security event logger"""
+        security_logger = logging.getLogger('talk2me.security')
+        security_logger.setLevel(logging.WARNING)
+        
+        # Security log file
+        security_handler = logging.handlers.RotatingFileHandler(
+            os.path.join(LOGS_DIR, 'security.log'),
+            maxBytes=self.max_bytes,
+            backupCount=self.backup_count
+        )
+        security_handler.setFormatter(
+            StructuredFormatter('talk2me', self.environment)
+        )
+        security_logger.addHandler(security_handler)
+        
+        self.loggers['security'] = security_logger
+    
+    def _setup_performance_logger(self):
+        """Set up performance metrics logger"""
+        perf_logger = logging.getLogger('talk2me.performance')
+        perf_logger.setLevel(logging.INFO)
+        
+        # Performance log file
+        perf_handler = logging.handlers.TimedRotatingFileHandler(
+            os.path.join(LOGS_DIR, 'performance.log'),
+            when='H',  # Hourly rotation
+            interval=1,
+            backupCount=168  # 7 days
+        )
+        perf_handler.setFormatter(
+            StructuredFormatter('talk2me', self.environment)
+        )
+        perf_logger.addHandler(perf_handler)
+        
+        self.loggers['performance'] = perf_logger
+    
+    def _setup_flask_handlers(self, app):
+        """Set up Flask error handlers"""
+        @app.errorhandler(Exception)
+        def handle_exception(error):
+            # Get request ID
+            request_id = getattr(g, 'request_id', 'unknown')
+            
+            # Create error signature for deduplication
+            error_signature = self._get_error_signature(error)
+            
+            # Log the error
+            self.log_error(
+                error,
+                request_id=request_id,
+                endpoint=request.endpoint,
+                method=request.method,
+                path=request.path,
+                ip=request.remote_addr,
+                user_agent=request.headers.get('User-Agent'),
+                signature=error_signature
+            )
+            
+            # Track error frequency
+            self._track_error(error_signature)
+            
+            # Return appropriate response
+            if hasattr(error, 'code'):
+                return {'error': str(error)}, error.code
+            else:
+                return {'error': 'Internal server error'}, 500
+    
+    def _before_request(self):
+        """Log request start"""
+        # Generate request ID
+        g.request_id = self._generate_request_id()
+        g.request_start_time = time.time()
+        
+        # Log access
+        self.log_access(
+            'request_start',
+            request_id=g.request_id,
+            method=request.method,
+            path=request.path,
+            ip=request.remote_addr,
+            user_agent=request.headers.get('User-Agent')
+        )
+    
+    def _after_request(self, response):
+        """Log request completion"""
+        # Calculate duration
+        duration = None
+        if hasattr(g, 'request_start_time'):
+            duration = int((time.time() - g.request_start_time) * 1000)
+        
+        # Log access
+        self.log_access(
+            'request_complete',
+            request_id=getattr(g, 'request_id', 'unknown'),
+            method=request.method,
+            path=request.path,
+            status=response.status_code,
+            duration_ms=duration,
+            content_length=response.content_length
+        )
+        
+        # Log performance metrics for slow requests
+        if duration and duration > 1000:  # Over 1 second
+            self.log_performance(
+                'slow_request',
+                request_id=getattr(g, 'request_id', 'unknown'),
+                endpoint=request.endpoint,
+                duration_ms=duration
+            )
+        
+        return response
+    
+    def log_error(self, error: Exception, **kwargs):
+        """Log an error with context"""
+        error_logger = self.loggers.get('error', logging.getLogger())
+        
+        # Create log record with extra fields
+        extra = {
+            'extra_fields': kwargs,
+            'request_id': kwargs.get('request_id'),
+            'user_id': kwargs.get('user_id'),
+            'session_id': kwargs.get('session_id')
+        }
+        
+        # Log with full traceback
+        error_logger.error(
+            f"Error in {kwargs.get('endpoint', 'unknown')}: {str(error)}",
+            exc_info=sys.exc_info(),
+            extra=extra
+        )
+    
+    def log_access(self, message: str, **kwargs):
+        """Log access event"""
+        access_logger = self.loggers.get('access', logging.getLogger())
+        
+        extra = {
+            'extra_fields': kwargs,
+            'request_id': kwargs.get('request_id')
+        }
+        
+        access_logger.info(message, extra=extra)
+    
+    def log_security(self, event: str, severity: str = 'warning', **kwargs):
+        """Log security event"""
+        security_logger = self.loggers.get('security', logging.getLogger())
+        
+        extra = {
+            'extra_fields': {
+                'event': event,
+                'severity': severity,
+                **kwargs
+            },
+            'request_id': kwargs.get('request_id')
+        }
+        
+        log_method = getattr(security_logger, severity.lower(), security_logger.warning)
+        log_method(f"Security event: {event}", extra=extra)
+    
+    def log_performance(self, metric: str, value: Union[int, float] = None, **kwargs):
+        """Log performance metric"""
+        perf_logger = self.loggers.get('performance', logging.getLogger())
+        
+        extra = {
+            'extra_fields': {
+                'metric': metric,
+                'value': value,
+                **kwargs
+            },
+            'request_id': kwargs.get('request_id')
+        }
+        
+        perf_logger.info(f"Performance metric: {metric}", extra=extra)
+    
+    def _generate_request_id(self):
+        """Generate unique request ID"""
+        return f"{int(time.time() * 1000)}-{os.urandom(8).hex()}"
+    
+    def _get_error_signature(self, error: Exception):
+        """Generate signature for error deduplication"""
+        # Create signature from error type and key parts of traceback
+        tb_summary = traceback.format_exception_only(type(error), error)
+        signature_data = f"{type(error).__name__}:{tb_summary[0] if tb_summary else ''}"
+        return hashlib.md5(signature_data.encode()).hexdigest()
+    
+    def _track_error(self, signature: str):
+        """Track error frequency"""
+        now = time.time()
+        
+        if signature not in self.error_counts:
+            self.error_counts[signature] = []
+        
+        # Add current timestamp
+        self.error_counts[signature].append(now)
+        
+        # Clean old entries (keep last hour)
+        self.error_counts[signature] = [
+            ts for ts in self.error_counts[signature]
+            if now - ts < 3600
+        ]
+        
+        # Alert if error rate is high
+        error_count = len(self.error_counts[signature])
+        if error_count > 10:  # More than 10 in an hour
+            self.log_security(
+                'high_error_rate',
+                severity='error',
+                signature=signature,
+                count=error_count,
+                message="High error rate detected"
+            )
+    
+    def get_error_summary(self):
+        """Get summary of recent errors"""
+        summary = {}
+        now = time.time()
+        
+        for signature, timestamps in self.error_counts.items():
+            recent_count = len([ts for ts in timestamps if now - ts < 3600])
+            if recent_count > 0:
+                summary[signature] = {
+                    'count_last_hour': recent_count,
+                    'last_seen': max(timestamps)
+                }
+        
+        return summary
+
+# Decorators for easy logging
+def log_errors(logger_name='talk2me'):
+    """Decorator to log function errors"""
+    def decorator(func):
+        @wraps(func)
+        def wrapper(*args, **kwargs):
+            try:
+                return func(*args, **kwargs)
+            except Exception as e:
+                logger = logging.getLogger(logger_name)
+                logger.error(
+                    f"Error in {func.__name__}: {str(e)}",
+                    exc_info=sys.exc_info(),
+                    extra={
+                        'extra_fields': {
+                            'function': func.__name__,
+                            'module': func.__module__
+                        }
+                    }
+                )
+                raise
+        return wrapper
+    return decorator
+
+def log_performance(metric_name=None):
+    """Decorator to log function performance"""
+    def decorator(func):
+        @wraps(func)
+        def wrapper(*args, **kwargs):
+            start_time = time.time()
+            
+            try:
+                result = func(*args, **kwargs)
+                duration = int((time.time() - start_time) * 1000)
+                
+                # Log performance
+                logger = logging.getLogger('talk2me.performance')
+                logger.info(
+                    f"Performance: {metric_name or func.__name__}",
+                    extra={
+                        'extra_fields': {
+                            'metric': metric_name or func.__name__,
+                            'duration_ms': duration,
+                            'function': func.__name__,
+                            'module': func.__module__
+                        }
+                    }
+                )
+                
+                return result
+            except Exception:
+                duration = int((time.time() - start_time) * 1000)
+                logger = logging.getLogger('talk2me.performance')
+                logger.warning(
+                    f"Performance (failed): {metric_name or func.__name__}",
+                    extra={
+                        'extra_fields': {
+                            'metric': metric_name or func.__name__,
+                            'duration_ms': duration,
+                            'function': func.__name__,
+                            'module': func.__module__,
+                            'status': 'failed'
+                        }
+                    }
+                )
+                raise
+        return wrapper
+    return decorator
+
+@contextmanager
+def log_context(**kwargs):
+    """Context manager to add context to logs"""
+    # Store current context
+    old_context = {}
+    for key, value in kwargs.items():
+        if hasattr(g, key):
+            old_context[key] = getattr(g, key)
+        setattr(g, key, value)
+    
+    try:
+        yield
+    finally:
+        # Restore old context
+        for key in kwargs:
+            if key in old_context:
+                setattr(g, key, old_context[key])
+            else:
+                delattr(g, key)
+
+# Utility functions
+def configure_logging(app, **kwargs):
+    """Configure logging for the application"""
+    config = {
+        'log_level': kwargs.get('log_level', app.config.get('LOG_LEVEL', 'INFO')),
+        'log_file': kwargs.get('log_file', app.config.get('LOG_FILE')),
+        'error_log_file': kwargs.get('error_log_file'),
+        'max_bytes': kwargs.get('max_bytes', 50 * 1024 * 1024),
+        'backup_count': kwargs.get('backup_count', 10)
+    }
+    
+    error_logger = ErrorLogger(app, config)
+    return error_logger
+
+def get_logger(name='talk2me'):
+    """Get a logger instance"""
+    return logging.getLogger(name)
+
+def log_exception(error, message=None, **kwargs):
+    """Log an exception with context"""
+    logger = logging.getLogger('talk2me.errors')
+    
+    extra = {
+        'extra_fields': kwargs,
+        'request_id': getattr(g, 'request_id', None)
+    }
+    
+    logger.error(
+        message or f"Exception: {str(error)}",
+        exc_info=(type(error), error, error.__traceback__),
+        extra=extra
+    )
--- a/gunicorn_config.py
+++ b/gunicorn_config.py
@@ -0,0 +1,86 @@
+"""
+Gunicorn configuration for production deployment
+"""
+import multiprocessing
+import os
+
+# Server socket
+bind = os.environ.get('GUNICORN_BIND', '0.0.0.0:5005')
+backlog = 2048
+
+# Worker processes
+# Use 2-4 workers per CPU core
+workers = int(os.environ.get('GUNICORN_WORKERS', multiprocessing.cpu_count() * 2 + 1))
+worker_class = 'sync'  # Use 'gevent' for async if needed
+worker_connections = 1000
+timeout = 120  # Increased for audio processing
+keepalive = 5
+
+# Restart workers after this many requests, to help prevent memory leaks
+max_requests = 1000
+max_requests_jitter = 50
+
+# Preload the application
+preload_app = True
+
+# Server mechanics
+daemon = False
+pidfile = os.environ.get('GUNICORN_PID', '/tmp/talk2me.pid')
+user = None
+group = None
+tmp_upload_dir = None
+
+# Logging
+accesslog = os.environ.get('GUNICORN_ACCESS_LOG', '-')
+errorlog = os.environ.get('GUNICORN_ERROR_LOG', '-')
+loglevel = os.environ.get('GUNICORN_LOG_LEVEL', 'info')
+access_log_format = '%(h)s %(l)s %(u)s %(t)s "%(r)s" %(s)s %(b)s "%(f)s" "%(a)s" %(D)s'
+
+# Process naming
+proc_name = 'talk2me'
+
+# Server hooks
+def when_ready(server):
+    """Called just after the server is started."""
+    server.log.info("Server is ready. Spawning workers")
+
+def worker_int(worker):
+    """Called just after a worker exited on SIGINT or SIGQUIT."""
+    worker.log.info("Worker received INT or QUIT signal")
+
+def pre_fork(server, worker):
+    """Called just before a worker is forked."""
+    server.log.info(f"Forking worker {worker}")
+
+def post_fork(server, worker):
+    """Called just after a worker has been forked."""
+    server.log.info(f"Worker spawned (pid: {worker.pid})")
+
+def worker_exit(server, worker):
+    """Called just after a worker has been killed."""
+    server.log.info(f"Worker exit (pid: {worker.pid})")
+
+def pre_request(worker, req):
+    """Called just before a worker processes the request."""
+    worker.log.debug(f"{req.method} {req.path}")
+
+def post_request(worker, req, environ, resp):
+    """Called after a worker processes the request."""
+    worker.log.debug(f"{req.method} {req.path} - {resp.status}")
+
+# SSL/TLS (uncomment if using HTTPS directly)
+# keyfile = '/path/to/keyfile'
+# certfile = '/path/to/certfile'
+# ssl_version = 'TLSv1_2'
+# cert_reqs = 'required'
+# ca_certs = '/path/to/ca_certs'
+
+# Thread option (if using threaded workers)
+threads = int(os.environ.get('GUNICORN_THREADS', 1))
+
+# Silent health checks in logs
+def pre_request(worker, req):
+    if req.path in ['/health', '/health/live']:
+        # Don't log health checks
+        return
+    worker.log.debug(f"{req.method} {req.path}")
--- a/health-monitor.py
+++ b/health-monitor.py
@@ -0,0 +1,91 @@
+#!/usr/bin/env python3
+"""
+Health monitoring script for Talk2Me application
+Usage: python health-monitor.py [--detailed] [--interval SECONDS]
+"""
+
+import requests
+import time
+import argparse
+import json
+from datetime import datetime
+
+def check_health(url, detailed=False):
+    """Check health of the Talk2Me service"""
+    endpoint = f"{url}/health/detailed" if detailed else f"{url}/health"
+    
+    try:
+        response = requests.get(endpoint, timeout=5)
+        data = response.json()
+        
+        if detailed:
+            print(f"\n=== Health Check at {datetime.now().strftime('%Y-%m-%d %H:%M:%S')} ===")
+            print(f"Overall Status: {data['status'].upper()}")
+            print("\nComponent Status:")
+            for component, status in data['components'].items():
+                status_icon = "✅" if status.get('status') == 'healthy' else "❌"
+                print(f"  {status_icon} {component}: {status.get('status', 'unknown')}")
+                if 'error' in status:
+                    print(f"     Error: {status['error']}")
+                if 'device' in status:
+                    print(f"     Device: {status['device']}")
+                if 'model_size' in status:
+                    print(f"     Model: {status['model_size']}")
+            
+            if 'metrics' in data:
+                print("\nMetrics:")
+                uptime = data['metrics'].get('uptime', 0)
+                hours = int(uptime // 3600)
+                minutes = int((uptime % 3600) // 60)
+                print(f"  Uptime: {hours}h {minutes}m")
+                print(f"  Request Count: {data['metrics'].get('request_count', 0)}")
+        else:
+            status_icon = "✅" if response.status_code == 200 else "❌"
+            print(f"{status_icon} {datetime.now().strftime('%H:%M:%S')} - Status: {data.get('status', 'unknown')}")
+        
+        return response.status_code == 200
+    
+    except requests.exceptions.ConnectionError:
+        print(f"❌ {datetime.now().strftime('%H:%M:%S')} - Connection failed")
+        return False
+    except requests.exceptions.Timeout:
+        print(f"❌ {datetime.now().strftime('%H:%M:%S')} - Request timeout")
+        return False
+    except Exception as e:
+        print(f"❌ {datetime.now().strftime('%H:%M:%S')} - Error: {str(e)}")
+        return False
+
+def main():
+    parser = argparse.ArgumentParser(description='Monitor Talk2Me service health')
+    parser.add_argument('--url', default='http://localhost:5005', help='Service URL')
+    parser.add_argument('--detailed', action='store_true', help='Show detailed health info')
+    parser.add_argument('--interval', type=int, default=30, help='Check interval in seconds')
+    parser.add_argument('--once', action='store_true', help='Run once and exit')
+    
+    args = parser.parse_args()
+    
+    print(f"Monitoring {args.url}")
+    print("Press Ctrl+C to stop\n")
+    
+    consecutive_failures = 0
+    
+    try:
+        while True:
+            success = check_health(args.url, args.detailed)
+            
+            if not success:
+                consecutive_failures += 1
+                if consecutive_failures >= 3:
+                    print(f"\n⚠️  ALERT: Service has been down for {consecutive_failures} consecutive checks!")
+            else:
+                consecutive_failures = 0
+            
+            if args.once:
+                break
+            
+            time.sleep(args.interval)
+    except KeyboardInterrupt:
+        print("\n\nMonitoring stopped.")
+
+if __name__ == "__main__":
+    main()
--- a/maintenance.sh
+++ b/maintenance.sh
@@ -0,0 +1,117 @@
+#!/bin/bash
+
+# Maintenance script for Talk2Me application
+# This script helps manage temporary files and disk space
+
+UPLOAD_FOLDER="${UPLOAD_FOLDER:-/tmp/talk2me_uploads}"
+MAX_AGE_MINUTES=5
+
+echo "Talk2Me Maintenance Script"
+echo "========================="
+
+# Function to check disk usage
+check_disk_usage() {
+    echo -e "\nDisk Usage:"
+    df -h "$UPLOAD_FOLDER" 2>/dev/null || df -h /tmp
+}
+
+# Function to show temp file stats
+show_temp_stats() {
+    echo -e "\nTemporary File Statistics:"
+    if [ -d "$UPLOAD_FOLDER" ]; then
+        file_count=$(find "$UPLOAD_FOLDER" -type f 2>/dev/null | wc -l)
+        total_size=$(du -sh "$UPLOAD_FOLDER" 2>/dev/null | cut -f1)
+        echo "  Upload folder: $UPLOAD_FOLDER"
+        echo "  File count: $file_count"
+        echo "  Total size: ${total_size:-0}"
+        
+        if [ $file_count -gt 0 ]; then
+            echo -e "\n  Oldest files:"
+            find "$UPLOAD_FOLDER" -type f -printf '%T+ %p\n' 2>/dev/null | sort | head -5
+        fi
+    else
+        echo "  Upload folder does not exist: $UPLOAD_FOLDER"
+    fi
+}
+
+# Function to clean old temp files
+clean_temp_files() {
+    echo -e "\nCleaning temporary files older than $MAX_AGE_MINUTES minutes..."
+    if [ -d "$UPLOAD_FOLDER" ]; then
+        # Count files before cleanup
+        before_count=$(find "$UPLOAD_FOLDER" -type f 2>/dev/null | wc -l)
+        
+        # Remove old files
+        find "$UPLOAD_FOLDER" -type f -mmin +$MAX_AGE_MINUTES -delete 2>/dev/null
+        
+        # Count files after cleanup
+        after_count=$(find "$UPLOAD_FOLDER" -type f 2>/dev/null | wc -l)
+        removed=$((before_count - after_count))
+        
+        echo "  Removed $removed files"
+    else
+        echo "  Upload folder does not exist: $UPLOAD_FOLDER"
+    fi
+}
+
+# Function to setup upload folder
+setup_upload_folder() {
+    echo -e "\nSetting up upload folder..."
+    if [ ! -d "$UPLOAD_FOLDER" ]; then
+        mkdir -p "$UPLOAD_FOLDER"
+        chmod 755 "$UPLOAD_FOLDER"
+        echo "  Created: $UPLOAD_FOLDER"
+    else
+        echo "  Exists: $UPLOAD_FOLDER"
+    fi
+}
+
+# Function to monitor in real-time
+monitor_realtime() {
+    echo -e "\nMonitoring temporary files (Press Ctrl+C to stop)..."
+    while true; do
+        clear
+        echo "Talk2Me File Monitor - $(date)"
+        echo "================================"
+        show_temp_stats
+        check_disk_usage
+        sleep 5
+    done
+}
+
+# Main menu
+case "${1:-help}" in
+    status)
+        show_temp_stats
+        check_disk_usage
+        ;;
+    clean)
+        clean_temp_files
+        show_temp_stats
+        ;;
+    setup)
+        setup_upload_folder
+        ;;
+    monitor)
+        monitor_realtime
+        ;;
+    all)
+        setup_upload_folder
+        clean_temp_files
+        show_temp_stats
+        check_disk_usage
+        ;;
+    *)
+        echo "Usage: $0 {status|clean|setup|monitor|all}"
+        echo ""
+        echo "Commands:"
+        echo "  status  - Show current temp file statistics"
+        echo "  clean   - Clean old temporary files"
+        echo "  setup   - Create upload folder if needed"
+        echo "  monitor - Real-time monitoring"
+        echo "  all     - Run setup, clean, and show status"
+        echo ""
+        echo "Environment Variables:"
+        echo "  UPLOAD_FOLDER - Set custom upload folder (default: /tmp/talk2me_uploads)"
+        ;;
+esac
--- a/manage_secrets.py
+++ b/manage_secrets.py
@@ -0,0 +1,271 @@
+#!/usr/bin/env python3
+"""
+Secret management CLI tool for Talk2Me
+
+Usage:
+    python manage_secrets.py list
+    python manage_secrets.py get <key>
+    python manage_secrets.py set <key> <value>
+    python manage_secrets.py rotate <key>
+    python manage_secrets.py delete <key>
+    python manage_secrets.py check-rotation
+    python manage_secrets.py verify
+    python manage_secrets.py migrate
+"""
+
+import sys
+import os
+import click
+import getpass
+from datetime import datetime
+from secrets_manager import get_secrets_manager, SecretsManager
+import json
+
+# Initialize secrets manager
+manager = get_secrets_manager()
+
+@click.group()
+def cli():
+    """Talk2Me Secrets Management Tool"""
+    pass
+
+@cli.command()
+def list():
+    """List all secrets (without values)"""
+    secrets = manager.list_secrets()
+    
+    if not secrets:
+        click.echo("No secrets found.")
+        return
+    
+    click.echo(f"\nFound {len(secrets)} secrets:\n")
+    
+    # Format as table
+    click.echo(f"{'Key':<30} {'Created':<20} {'Last Rotated':<20} {'Has Value'}")
+    click.echo("-" * 90)
+    
+    for secret in secrets:
+        created = secret['created'][:10] if secret['created'] else 'Unknown'
+        rotated = secret['rotated'][:10] if secret['rotated'] else 'Never'
+        has_value = '✓' if secret['has_value'] else '✗'
+        
+        click.echo(f"{secret['key']:<30} {created:<20} {rotated:<20} {has_value}")
+
+@cli.command()
+@click.argument('key')
+def get(key):
+    """Get a secret value (requires confirmation)"""
+    if not click.confirm(f"Are you sure you want to display the value of '{key}'?"):
+        return
+    
+    value = manager.get(key)
+    
+    if value is None:
+        click.echo(f"Secret '{key}' not found.")
+    else:
+        click.echo(f"\nSecret '{key}':")
+        click.echo(f"Value: {value}")
+        
+        # Show metadata
+        secrets = manager.list_secrets()
+        for secret in secrets:
+            if secret['key'] == key:
+                if secret.get('metadata'):
+                    click.echo(f"Metadata: {json.dumps(secret['metadata'], indent=2)}")
+                break
+
+@cli.command()
+@click.argument('key')
+@click.option('--value', help='Secret value (will prompt if not provided)')
+@click.option('--metadata', help='JSON metadata')
+def set(key, value, metadata):
+    """Set a secret value"""
+    if not value:
+        value = getpass.getpass(f"Enter value for '{key}': ")
+        confirm = getpass.getpass(f"Confirm value for '{key}': ")
+        
+        if value != confirm:
+            click.echo("Values do not match. Aborted.")
+            return
+    
+    # Parse metadata if provided
+    metadata_dict = None
+    if metadata:
+        try:
+            metadata_dict = json.loads(metadata)
+        except json.JSONDecodeError:
+            click.echo("Invalid JSON metadata")
+            return
+    
+    # Validate the secret if validator exists
+    if not manager.validate(key, value):
+        click.echo(f"Validation failed for '{key}'")
+        return
+    
+    manager.set(key, value, metadata_dict, user='cli')
+    click.echo(f"Secret '{key}' set successfully.")
+
+@cli.command()
+@click.argument('key')
+@click.option('--new-value', help='New secret value (will auto-generate if not provided)')
+def rotate(key):
+    """Rotate a secret"""
+    try:
+        if not click.confirm(f"Are you sure you want to rotate '{key}'?"):
+            return
+        
+        old_value, new_value = manager.rotate(key, new_value, user='cli')
+        
+        click.echo(f"\nSecret '{key}' rotated successfully.")
+        click.echo(f"New value: {new_value}")
+        
+        if click.confirm("Do you want to see the old value?"):
+            click.echo(f"Old value: {old_value}")
+            
+    except KeyError:
+        click.echo(f"Secret '{key}' not found.")
+    except ValueError as e:
+        click.echo(f"Error: {e}")
+
+@cli.command()
+@click.argument('key')
+def delete(key):
+    """Delete a secret"""
+    if not click.confirm(f"Are you sure you want to delete '{key}'? This cannot be undone."):
+        return
+    
+    if manager.delete(key, user='cli'):
+        click.echo(f"Secret '{key}' deleted successfully.")
+    else:
+        click.echo(f"Secret '{key}' not found.")
+
+@cli.command()
+def check_rotation():
+    """Check which secrets need rotation"""
+    needs_rotation = manager.check_rotation_needed()
+    
+    if not needs_rotation:
+        click.echo("No secrets need rotation.")
+        return
+    
+    click.echo(f"\n{len(needs_rotation)} secrets need rotation:")
+    for key in needs_rotation:
+        click.echo(f"  - {key}")
+    
+    if click.confirm("\nDo you want to rotate all of them now?"):
+        for key in needs_rotation:
+            try:
+                old_value, new_value = manager.rotate(key, user='cli')
+                click.echo(f"✓ Rotated {key}")
+            except Exception as e:
+                click.echo(f"✗ Failed to rotate {key}: {e}")
+
+@cli.command()
+def verify():
+    """Verify integrity of all secrets"""
+    click.echo("Verifying secrets integrity...")
+    
+    if manager.verify_integrity():
+        click.echo("✓ All secrets passed integrity check")
+    else:
+        click.echo("✗ Integrity check failed!")
+        click.echo("Some secrets may be corrupted. Check logs for details.")
+
+@cli.command()
+def migrate():
+    """Migrate secrets from environment variables"""
+    click.echo("Migrating secrets from environment variables...")
+    
+    # List of known secrets to migrate
+    secrets_to_migrate = [
+        ('TTS_API_KEY', 'TTS API Key'),
+        ('SECRET_KEY', 'Flask Secret Key'),
+        ('ADMIN_TOKEN', 'Admin Token'),
+        ('DATABASE_URL', 'Database URL'),
+        ('REDIS_URL', 'Redis URL'),
+    ]
+    
+    migrated = 0
+    
+    for env_key, description in secrets_to_migrate:
+        value = os.environ.get(env_key)
+        if value and value != manager.get(env_key):
+            if click.confirm(f"Migrate {description} from environment?"):
+                manager.set(env_key, value, {'migrated_from': 'environment'}, user='migration')
+                click.echo(f"✓ Migrated {env_key}")
+                migrated += 1
+    
+    click.echo(f"\nMigrated {migrated} secrets.")
+
+@cli.command()
+@click.argument('key')
+@click.argument('days', type=int)
+def schedule_rotation(key, days):
+    """Schedule automatic rotation for a secret"""
+    manager.schedule_rotation(key, days)
+    click.echo(f"Scheduled rotation for '{key}' every {days} days.")
+
+@cli.command()
+@click.argument('key', required=False)
+@click.option('--limit', default=20, help='Number of entries to show')
+def audit(key, limit):
+    """Show audit log"""
+    logs = manager.get_audit_log(key, limit)
+    
+    if not logs:
+        click.echo("No audit log entries found.")
+        return
+    
+    click.echo(f"\nShowing last {len(logs)} audit log entries:\n")
+    
+    for entry in logs:
+        timestamp = entry['timestamp'][:19]  # Trim microseconds
+        action = entry['action'].ljust(15)
+        key_str = entry['key'].ljust(20)
+        user = entry['user']
+        
+        click.echo(f"{timestamp} | {action} | {key_str} | {user}")
+        
+        if entry.get('details'):
+            click.echo(f"{'':>20} Details: {json.dumps(entry['details'])}")
+
+@cli.command()
+def init():
+    """Initialize secrets configuration"""
+    click.echo("Initializing Talk2Me secrets configuration...")
+    
+    # Check if already initialized
+    if os.path.exists('.secrets.json'):
+        if not click.confirm(".secrets.json already exists. Overwrite?"):
+            return
+    
+    # Generate initial secrets
+    import secrets as py_secrets
+    
+    initial_secrets = {
+        'FLASK_SECRET_KEY': py_secrets.token_hex(32),
+        'ADMIN_TOKEN': py_secrets.token_urlsafe(32),
+    }
+    
+    click.echo("\nGenerating initial secrets...")
+    
+    for key, value in initial_secrets.items():
+        manager.set(key, value, {'generated': True}, user='init')
+        click.echo(f"✓ Generated {key}")
+    
+    # Prompt for required secrets
+    click.echo("\nPlease provide the following secrets:")
+    
+    tts_api_key = getpass.getpass("TTS API Key (press Enter to skip): ")
+    if tts_api_key:
+        manager.set('TTS_API_KEY', tts_api_key, user='init')
+        click.echo("✓ Set TTS_API_KEY")
+    
+    click.echo("\nSecrets initialized successfully!")
+    click.echo("\nIMPORTANT:")
+    click.echo("1. Keep .secrets.json secure and never commit it to version control")
+    click.echo("2. Back up your master key from .master_key")
+    click.echo("3. Set appropriate file permissions (owner read/write only)")
+
+if __name__ == '__main__':
+    cli()
--- a/memory_manager.py
+++ b/memory_manager.py
@@ -0,0 +1,403 @@
+# Memory management system to prevent leaks and monitor usage
+import gc
+import os
+import psutil
+import torch
+import logging
+import threading
+import time
+from typing import Dict, Optional, Callable
+from dataclasses import dataclass, field
+from datetime import datetime
+import weakref
+import tempfile
+import shutil
+
+logger = logging.getLogger(__name__)
+
+@dataclass
+class MemoryStats:
+    """Current memory statistics"""
+    timestamp: float = field(default_factory=time.time)
+    process_memory_mb: float = 0.0
+    system_memory_percent: float = 0.0
+    gpu_memory_mb: float = 0.0
+    gpu_memory_percent: float = 0.0
+    temp_files_count: int = 0
+    temp_files_size_mb: float = 0.0
+    active_sessions: int = 0
+    gc_collections: Dict[int, int] = field(default_factory=dict)
+
+class MemoryManager:
+    """
+    Comprehensive memory management system to prevent leaks
+    """
+    def __init__(self, app=None, config=None):
+        self.config = config or {}
+        self.app = app
+        self._cleanup_callbacks = []
+        self._resource_registry = weakref.WeakValueDictionary()
+        self._monitoring_thread = None
+        self._shutdown = False
+        
+        # Memory thresholds
+        self.memory_threshold_mb = self.config.get('memory_threshold_mb', 4096)  # 4GB
+        self.gpu_memory_threshold_mb = self.config.get('gpu_memory_threshold_mb', 2048)  # 2GB
+        self.cleanup_interval = self.config.get('cleanup_interval', 30)  # 30 seconds
+        
+        # Whisper model reference
+        self.whisper_model = None
+        self.model_reload_count = 0
+        self.last_model_reload = time.time()
+        
+        if app:
+            self.init_app(app)
+    
+    def init_app(self, app):
+        """Initialize memory management for Flask app"""
+        self.app = app
+        app.memory_manager = self
+        
+        # Start monitoring thread
+        self._start_monitoring()
+        
+        # Register cleanup on shutdown
+        import atexit
+        atexit.register(self.shutdown)
+        
+        logger.info("Memory manager initialized")
+    
+    def set_whisper_model(self, model):
+        """Register the Whisper model for management"""
+        self.whisper_model = model
+        logger.info("Whisper model registered with memory manager")
+    
+    def _start_monitoring(self):
+        """Start background memory monitoring"""
+        self._monitoring_thread = threading.Thread(
+            target=self._monitor_memory,
+            daemon=True
+        )
+        self._monitoring_thread.start()
+    
+    def _monitor_memory(self):
+        """Background thread to monitor and manage memory"""
+        logger.info("Memory monitoring thread started")
+        
+        while not self._shutdown:
+            try:
+                # Collect memory statistics
+                stats = self.get_memory_stats()
+                
+                # Check if we need to free memory
+                if self._should_cleanup(stats):
+                    logger.warning(f"Memory threshold exceeded - Process: {stats.process_memory_mb:.1f}MB, "
+                                 f"GPU: {stats.gpu_memory_mb:.1f}MB")
+                    self.cleanup_memory(aggressive=True)
+                
+                # Log stats periodically
+                if int(time.time()) % 300 == 0:  # Every 5 minutes
+                    logger.info(f"Memory stats - Process: {stats.process_memory_mb:.1f}MB, "
+                              f"System: {stats.system_memory_percent:.1f}%, "
+                              f"GPU: {stats.gpu_memory_mb:.1f}MB")
+                
+            except Exception as e:
+                logger.error(f"Error in memory monitoring: {e}")
+            
+            time.sleep(self.cleanup_interval)
+    
+    def _should_cleanup(self, stats: MemoryStats) -> bool:
+        """Determine if memory cleanup is needed"""
+        # Check process memory
+        if stats.process_memory_mb > self.memory_threshold_mb:
+            return True
+        
+        # Check system memory
+        if stats.system_memory_percent > 85:
+            return True
+        
+        # Check GPU memory
+        if stats.gpu_memory_mb > self.gpu_memory_threshold_mb:
+            return True
+        
+        return False
+    
+    def get_memory_stats(self) -> MemoryStats:
+        """Get current memory statistics"""
+        stats = MemoryStats()
+        
+        try:
+            # Process memory
+            process = psutil.Process()
+            memory_info = process.memory_info()
+            stats.process_memory_mb = memory_info.rss / 1024 / 1024
+            
+            # System memory
+            system_memory = psutil.virtual_memory()
+            stats.system_memory_percent = system_memory.percent
+            
+            # GPU memory if available
+            if torch.cuda.is_available():
+                stats.gpu_memory_mb = torch.cuda.memory_allocated() / 1024 / 1024
+                stats.gpu_memory_percent = (torch.cuda.memory_allocated() / 
+                                           torch.cuda.get_device_properties(0).total_memory * 100)
+            
+            # Temp files
+            temp_dir = self.app.config.get('UPLOAD_FOLDER', tempfile.gettempdir())
+            if os.path.exists(temp_dir):
+                temp_files = list(os.listdir(temp_dir))
+                stats.temp_files_count = len(temp_files)
+                stats.temp_files_size_mb = sum(
+                    os.path.getsize(os.path.join(temp_dir, f)) 
+                    for f in temp_files if os.path.isfile(os.path.join(temp_dir, f))
+                ) / 1024 / 1024
+            
+            # Session count
+            if hasattr(self.app, 'session_manager'):
+                stats.active_sessions = len(self.app.session_manager.sessions)
+            
+            # GC stats
+            gc_stats = gc.get_stats()
+            for i, stat in enumerate(gc_stats):
+                if isinstance(stat, dict):
+                    stats.gc_collections[i] = stat.get('collections', 0)
+            
+        except Exception as e:
+            logger.error(f"Error collecting memory stats: {e}")
+        
+        return stats
+    
+    def cleanup_memory(self, aggressive=False):
+        """Perform memory cleanup"""
+        logger.info(f"Starting memory cleanup (aggressive={aggressive})")
+        freed_mb = 0
+        
+        try:
+            # 1. Force garbage collection
+            gc.collect()
+            if aggressive:
+                gc.collect(2)  # Full collection
+            
+            # 2. Clear GPU memory cache
+            if torch.cuda.is_available():
+                before_gpu = torch.cuda.memory_allocated() / 1024 / 1024
+                torch.cuda.empty_cache()
+                torch.cuda.synchronize()
+                after_gpu = torch.cuda.memory_allocated() / 1024 / 1024
+                freed_mb += (before_gpu - after_gpu)
+                logger.info(f"Freed {before_gpu - after_gpu:.1f}MB GPU memory")
+            
+            # 3. Clean old temporary files
+            if hasattr(self.app, 'config'):
+                temp_dir = self.app.config.get('UPLOAD_FOLDER')
+                if temp_dir and os.path.exists(temp_dir):
+                    freed_mb += self._cleanup_temp_files(temp_dir, aggressive)
+            
+            # 4. Trigger session cleanup
+            if hasattr(self.app, 'session_manager'):
+                self.app.session_manager.cleanup_expired_sessions()
+                if aggressive:
+                    self.app.session_manager.cleanup_idle_sessions()
+            
+            # 5. Run registered cleanup callbacks
+            for callback in self._cleanup_callbacks:
+                try:
+                    callback()
+                except Exception as e:
+                    logger.error(f"Cleanup callback error: {e}")
+            
+            # 6. Reload Whisper model if needed (aggressive mode only)
+            if aggressive and self.whisper_model and torch.cuda.is_available():
+                current_gpu_mb = torch.cuda.memory_allocated() / 1024 / 1024
+                if current_gpu_mb > self.gpu_memory_threshold_mb * 0.8:
+                    self._reload_whisper_model()
+            
+            logger.info(f"Memory cleanup completed - freed approximately {freed_mb:.1f}MB")
+            
+        except Exception as e:
+            logger.error(f"Error during memory cleanup: {e}")
+    
+    def _cleanup_temp_files(self, temp_dir: str, aggressive: bool) -> float:
+        """Clean up temporary files"""
+        freed_mb = 0
+        current_time = time.time()
+        max_age = 300 if not aggressive else 60  # 5 minutes or 1 minute
+        
+        try:
+            for filename in os.listdir(temp_dir):
+                filepath = os.path.join(temp_dir, filename)
+                if os.path.isfile(filepath):
+                    file_age = current_time - os.path.getmtime(filepath)
+                    if file_age > max_age:
+                        file_size = os.path.getsize(filepath) / 1024 / 1024
+                        try:
+                            os.remove(filepath)
+                            freed_mb += file_size
+                            logger.debug(f"Removed old temp file: {filename}")
+                        except Exception as e:
+                            logger.error(f"Failed to remove {filepath}: {e}")
+        except Exception as e:
+            logger.error(f"Error cleaning temp files: {e}")
+        
+        return freed_mb
+    
+    def _reload_whisper_model(self):
+        """Reload Whisper model to clear GPU memory fragmentation"""
+        if not self.whisper_model:
+            return
+        
+        # Don't reload too frequently
+        if time.time() - self.last_model_reload < 300:  # 5 minutes
+            return
+        
+        try:
+            logger.info("Reloading Whisper model to clear GPU memory")
+            
+            # Get model info
+            import whisper
+            model_size = getattr(self.whisper_model, 'model_size', 'base')
+            device = next(self.whisper_model.parameters()).device
+            
+            # Clear the old model
+            del self.whisper_model
+            torch.cuda.empty_cache()
+            gc.collect()
+            
+            # Reload model
+            self.whisper_model = whisper.load_model(model_size, device=device)
+            self.model_reload_count += 1
+            self.last_model_reload = time.time()
+            
+            # Update app reference
+            if hasattr(self.app, 'whisper_model'):
+                self.app.whisper_model = self.whisper_model
+            
+            logger.info(f"Whisper model reloaded successfully (reload #{self.model_reload_count})")
+            
+        except Exception as e:
+            logger.error(f"Failed to reload Whisper model: {e}")
+    
+    def register_cleanup_callback(self, callback: Callable):
+        """Register a callback to be called during cleanup"""
+        self._cleanup_callbacks.append(callback)
+    
+    def register_resource(self, resource, name: str = None):
+        """Register a resource for tracking"""
+        if name:
+            self._resource_registry[name] = resource
+    
+    def release_resource(self, name: str):
+        """Release a tracked resource"""
+        if name in self._resource_registry:
+            del self._resource_registry[name]
+    
+    def get_metrics(self) -> Dict:
+        """Get memory management metrics"""
+        stats = self.get_memory_stats()
+        
+        return {
+            'memory': {
+                'process_mb': round(stats.process_memory_mb, 1),
+                'system_percent': round(stats.system_memory_percent, 1),
+                'gpu_mb': round(stats.gpu_memory_mb, 1),
+                'gpu_percent': round(stats.gpu_memory_percent, 1)
+            },
+            'temp_files': {
+                'count': stats.temp_files_count,
+                'size_mb': round(stats.temp_files_size_mb, 1)
+            },
+            'sessions': {
+                'active': stats.active_sessions
+            },
+            'model': {
+                'reload_count': self.model_reload_count,
+                'last_reload': datetime.fromtimestamp(self.last_model_reload).isoformat()
+            },
+            'thresholds': {
+                'memory_mb': self.memory_threshold_mb,
+                'gpu_mb': self.gpu_memory_threshold_mb
+            }
+        }
+    
+    def shutdown(self):
+        """Shutdown memory manager"""
+        logger.info("Shutting down memory manager")
+        self._shutdown = True
+        
+        # Final cleanup
+        self.cleanup_memory(aggressive=True)
+        
+        # Wait for monitoring thread
+        if self._monitoring_thread:
+            self._monitoring_thread.join(timeout=5)
+
+# Context manager for audio processing
+class AudioProcessingContext:
+    """Context manager to ensure audio resources are cleaned up"""
+    def __init__(self, memory_manager: MemoryManager, name: str = None):
+        self.memory_manager = memory_manager
+        self.name = name or f"audio_{int(time.time() * 1000)}"
+        self.temp_files = []
+        self.start_time = None
+        self.start_memory = None
+    
+    def __enter__(self):
+        self.start_time = time.time()
+        if torch.cuda.is_available():
+            self.start_memory = torch.cuda.memory_allocated()
+        return self
+    
+    def __exit__(self, exc_type, exc_val, exc_tb):
+        # Clean up temp files
+        for filepath in self.temp_files:
+            try:
+                if os.path.exists(filepath):
+                    os.remove(filepath)
+            except Exception as e:
+                logger.error(f"Failed to remove temp file {filepath}: {e}")
+        
+        # Clear GPU cache if used
+        if torch.cuda.is_available():
+            torch.cuda.empty_cache()
+            
+            # Log memory usage
+            if self.start_memory is not None:
+                memory_used = torch.cuda.memory_allocated() - self.start_memory
+                duration = time.time() - self.start_time
+                logger.debug(f"Audio processing '{self.name}' - Duration: {duration:.2f}s, "
+                           f"GPU memory: {memory_used / 1024 / 1024:.1f}MB")
+        
+        # Force garbage collection if there was an error
+        if exc_type is not None:
+            gc.collect()
+    
+    def add_temp_file(self, filepath: str):
+        """Register a temporary file for cleanup"""
+        self.temp_files.append(filepath)
+
+# Utility functions
+def with_memory_management(func):
+    """Decorator to add memory management to functions"""
+    def wrapper(*args, **kwargs):
+        # Get memory manager from app context
+        from flask import current_app
+        memory_manager = getattr(current_app, 'memory_manager', None)
+        
+        if memory_manager:
+            with AudioProcessingContext(memory_manager, name=func.__name__):
+                return func(*args, **kwargs)
+        else:
+            return func(*args, **kwargs)
+    
+    return wrapper
+
+def init_memory_management(app, **kwargs):
+    """Initialize memory management for the application"""
+    config = {
+        'memory_threshold_mb': kwargs.get('memory_threshold_mb', 4096),
+        'gpu_memory_threshold_mb': kwargs.get('gpu_memory_threshold_mb', 2048),
+        'cleanup_interval': kwargs.get('cleanup_interval', 30)
+    }
+    
+    memory_manager = MemoryManager(app, config)
+    return memory_manager
--- a/nginx.conf
+++ b/nginx.conf
@@ -0,0 +1,108 @@
+upstream talk2me {
+    server talk2me:5005 fail_timeout=0;
+}
+
+server {
+    listen 80;
+    server_name _;
+    
+    # Redirect to HTTPS in production
+    # return 301 https://$server_name$request_uri;
+    
+    # Security headers
+    add_header X-Content-Type-Options nosniff always;
+    add_header X-Frame-Options DENY always;
+    add_header X-XSS-Protection "1; mode=block" always;
+    add_header Referrer-Policy "strict-origin-when-cross-origin" always;
+    add_header Content-Security-Policy "default-src 'self'; script-src 'self' 'unsafe-inline'; style-src 'self' 'unsafe-inline'; img-src 'self' data:; font-src 'self'; connect-src 'self'; media-src 'self';" always;
+
+    # File upload limits
+    client_max_body_size 50M;
+    client_body_buffer_size 1M;
+    client_body_timeout 120s;
+
+    # Timeouts
+    proxy_connect_timeout 120s;
+    proxy_send_timeout 120s;
+    proxy_read_timeout 120s;
+    send_timeout 120s;
+
+    # Gzip compression
+    gzip on;
+    gzip_vary on;
+    gzip_min_length 1024;
+    gzip_types text/plain text/css text/xml text/javascript application/x-javascript application/xml+rss application/json application/javascript;
+
+    # Static files
+    location /static {
+        alias /app/static;
+        expires 1y;
+        add_header Cache-Control "public, immutable";
+        
+        # Gzip static files
+        gzip_static on;
+    }
+
+    # Service worker
+    location /service-worker.js {
+        proxy_pass http://talk2me;
+        proxy_set_header Host $host;
+        proxy_set_header X-Real-IP $remote_addr;
+        add_header Cache-Control "no-cache, no-store, must-revalidate";
+    }
+
+    # WebSocket support for future features
+    location /ws {
+        proxy_pass http://talk2me;
+        proxy_http_version 1.1;
+        proxy_set_header Upgrade $http_upgrade;
+        proxy_set_header Connection "upgrade";
+        proxy_set_header Host $host;
+        proxy_set_header X-Real-IP $remote_addr;
+        proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
+        proxy_set_header X-Forwarded-Proto $scheme;
+        
+        # WebSocket timeouts
+        proxy_read_timeout 86400s;
+        proxy_send_timeout 86400s;
+    }
+
+    # Health check (don't log)
+    location /health {
+        proxy_pass http://talk2me/health;
+        access_log off;
+        proxy_set_header Host $host;
+        proxy_set_header X-Real-IP $remote_addr;
+    }
+
+    # Main application
+    location / {
+        proxy_pass http://talk2me;
+        proxy_redirect off;
+        proxy_buffering off;
+        
+        proxy_set_header Host $host;
+        proxy_set_header X-Real-IP $remote_addr;
+        proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
+        proxy_set_header X-Forwarded-Proto $scheme;
+        proxy_set_header X-Forwarded-Host $server_name;
+        
+        # Don't buffer responses
+        proxy_buffering off;
+        proxy_request_buffering off;
+    }
+}
+
+# HTTPS configuration (uncomment for production)
+# server {
+#     listen 443 ssl http2;
+#     server_name your-domain.com;
+#     
+#     ssl_certificate /etc/nginx/ssl/cert.pem;
+#     ssl_certificate_key /etc/nginx/ssl/key.pem;
+#     ssl_protocols TLSv1.2 TLSv1.3;
+#     ssl_ciphers ECDHE-ECDSA-AES128-GCM-SHA256:ECDHE-RSA-AES128-GCM-SHA256:ECDHE-ECDSA-AES256-GCM-SHA384:ECDHE-RSA-AES256-GCM-SHA384;
+#     ssl_prefer_server_ciphers off;
+#     
+#     # Include all location blocks from above
+# }
--- a/package-lock.json
+++ b/package-lock.json
@@ -0,0 +1,48 @@
+{
+  "name": "talk2me",
+  "version": "1.0.0",
+  "lockfileVersion": 3,
+  "requires": true,
+  "packages": {
+    "": {
+      "name": "talk2me",
+      "version": "1.0.0",
+      "license": "ISC",
+      "devDependencies": {
+        "@types/node": "^20.10.0",
+        "typescript": "^5.3.0"
+      }
+    },
+    "node_modules/@types/node": {
+      "version": "20.17.57",
+      "resolved": "https://registry.npmjs.org/@types/node/-/node-20.17.57.tgz",
+      "integrity": "sha512-f3T4y6VU4fVQDKVqJV4Uppy8c1p/sVvS3peyqxyWnzkqXFJLRU7Y1Bl7rMS1Qe9z0v4M6McY0Fp9yBsgHJUsWQ==",
+      "dev": true,
+      "license": "MIT",
+      "dependencies": {
+        "undici-types": "~6.19.2"
+      }
+    },
+    "node_modules/typescript": {
+      "version": "5.8.3",
+      "resolved": "https://registry.npmjs.org/typescript/-/typescript-5.8.3.tgz",
+      "integrity": "sha512-p1diW6TqL9L07nNxvRMM7hMMw4c5XOo/1ibL4aAIGmSAt9slTE1Xgw5KWuof2uTOvCg9BY7ZRi+GaF+7sfgPeQ==",
+      "dev": true,
+      "license": "Apache-2.0",
+      "bin": {
+        "tsc": "bin/tsc",
+        "tsserver": "bin/tsserver"
+      },
+      "engines": {
+        "node": ">=14.17"
+      }
+    },
+    "node_modules/undici-types": {
+      "version": "6.19.8",
+      "resolved": "https://registry.npmjs.org/undici-types/-/undici-types-6.19.8.tgz",
+      "integrity": "sha512-ve2KP6f/JnbPBFyobGHuerC9g1FYGn/F8n1LWTwNxCEzd6IfqTwUQcNXgEtmmQ6DlRrC1hrSrBnCZPokRrDHjw==",
+      "dev": true,
+      "license": "MIT"
+    }
+  }
+}
--- a/package.json
+++ b/package.json
@@ -0,0 +1,26 @@
+{
+  "name": "talk2me",
+  "version": "1.0.0",
+  "description": "Real-time voice translation web application",
+  "main": "index.js",
+  "scripts": {
+    "build": "tsc",
+    "watch": "tsc --watch",
+    "dev": "tsc --watch",
+    "clean": "rm -rf static/js/dist",
+    "type-check": "tsc --noEmit"
+  },
+  "keywords": [
+    "translation",
+    "voice",
+    "pwa",
+    "typescript"
+  ],
+  "author": "",
+  "license": "ISC",
+  "devDependencies": {
+    "@types/node": "^20.10.0",
+    "typescript": "^5.3.0"
+  },
+  "dependencies": {}
+}
--- a/rate_limiter.py
+++ b/rate_limiter.py
@@ -0,0 +1,408 @@
+# Rate limiting implementation for Flask
+import time
+import logging
+from functools import wraps
+from collections import defaultdict, deque
+from threading import Lock
+from flask import request, jsonify, g
+from datetime import datetime, timedelta
+import hashlib
+import json
+
+logger = logging.getLogger(__name__)
+
+class RateLimiter:
+    """
+    Token bucket rate limiter with sliding window and multiple strategies
+    """
+    def __init__(self):
+        self.buckets = defaultdict(lambda: {
+            'tokens': 0,
+            'last_update': time.time(),
+            'requests': deque(maxlen=1000)  # Track last 1000 requests
+        })
+        self.lock = Lock()
+        
+        # Default limits (can be overridden per endpoint)
+        self.default_limits = {
+            'requests_per_minute': 30,
+            'requests_per_hour': 500,
+            'burst_size': 10,
+            'token_refresh_rate': 0.5  # tokens per second
+        }
+        
+        # Endpoint-specific limits
+        self.endpoint_limits = {
+            '/transcribe': {
+                'requests_per_minute': 10,
+                'requests_per_hour': 100,
+                'burst_size': 3,
+                'token_refresh_rate': 0.167,  # 1 token per 6 seconds
+                'max_request_size': 10 * 1024 * 1024  # 10MB
+            },
+            '/translate': {
+                'requests_per_minute': 20,
+                'requests_per_hour': 300,
+                'burst_size': 5,
+                'token_refresh_rate': 0.333,  # 1 token per 3 seconds
+                'max_request_size': 100 * 1024  # 100KB
+            },
+            '/translate/stream': {
+                'requests_per_minute': 10,
+                'requests_per_hour': 150,
+                'burst_size': 3,
+                'token_refresh_rate': 0.167,
+                'max_request_size': 100 * 1024  # 100KB
+            },
+            '/speak': {
+                'requests_per_minute': 15,
+                'requests_per_hour': 200,
+                'burst_size': 3,
+                'token_refresh_rate': 0.25,  # 1 token per 4 seconds
+                'max_request_size': 50 * 1024  # 50KB
+            }
+        }
+        
+        # IP-based blocking
+        self.blocked_ips = set()
+        self.temp_blocked_ips = {}  # IP -> unblock_time
+        
+        # Global limits
+        self.global_limits = {
+            'total_requests_per_minute': 1000,
+            'total_requests_per_hour': 10000,
+            'concurrent_requests': 50
+        }
+        self.global_requests = deque(maxlen=10000)
+        self.concurrent_requests = 0
+        
+    def get_client_id(self, request):
+        """Get unique client identifier"""
+        # Use IP address + user agent for better identification
+        ip = request.remote_addr or 'unknown'
+        user_agent = request.headers.get('User-Agent', '')
+        
+        # Handle proxied requests
+        forwarded_for = request.headers.get('X-Forwarded-For')
+        if forwarded_for:
+            ip = forwarded_for.split(',')[0].strip()
+        
+        # Create unique identifier
+        identifier = f"{ip}:{user_agent}"
+        return hashlib.md5(identifier.encode()).hexdigest()
+    
+    def get_limits(self, endpoint):
+        """Get rate limits for endpoint"""
+        return self.endpoint_limits.get(endpoint, self.default_limits)
+    
+    def is_ip_blocked(self, ip):
+        """Check if IP is blocked"""
+        # Check permanent blocks
+        if ip in self.blocked_ips:
+            return True
+        
+        # Check temporary blocks
+        if ip in self.temp_blocked_ips:
+            if time.time() < self.temp_blocked_ips[ip]:
+                return True
+            else:
+                # Unblock if time expired
+                del self.temp_blocked_ips[ip]
+        
+        return False
+    
+    def block_ip_temporarily(self, ip, duration=3600):
+        """Block IP temporarily (default 1 hour)"""
+        self.temp_blocked_ips[ip] = time.time() + duration
+        logger.warning(f"IP {ip} temporarily blocked for {duration} seconds")
+    
+    def check_global_limits(self):
+        """Check global rate limits"""
+        now = time.time()
+        
+        # Clean old requests
+        minute_ago = now - 60
+        hour_ago = now - 3600
+        
+        self.global_requests = deque(
+            (t for t in self.global_requests if t > hour_ago),
+            maxlen=10000
+        )
+        
+        # Count requests
+        requests_last_minute = sum(1 for t in self.global_requests if t > minute_ago)
+        requests_last_hour = len(self.global_requests)
+        
+        # Check limits
+        if requests_last_minute >= self.global_limits['total_requests_per_minute']:
+            return False, "Global rate limit exceeded (per minute)"
+        
+        if requests_last_hour >= self.global_limits['total_requests_per_hour']:
+            return False, "Global rate limit exceeded (per hour)"
+        
+        if self.concurrent_requests >= self.global_limits['concurrent_requests']:
+            return False, "Too many concurrent requests"
+        
+        return True, None
+    
+    def check_rate_limit(self, client_id, endpoint, request_size=0):
+        """Check if request should be allowed"""
+        with self.lock:
+            # Check global limits first
+            global_ok, global_msg = self.check_global_limits()
+            if not global_ok:
+                return False, global_msg, None
+            
+            # Get limits for endpoint
+            limits = self.get_limits(endpoint)
+            
+            # Check request size if applicable
+            if request_size > 0 and 'max_request_size' in limits:
+                if request_size > limits['max_request_size']:
+                    return False, "Request too large", None
+            
+            # Get or create bucket
+            bucket = self.buckets[client_id]
+            now = time.time()
+            
+            # Update tokens based on time passed
+            time_passed = now - bucket['last_update']
+            new_tokens = time_passed * limits['token_refresh_rate']
+            bucket['tokens'] = min(
+                limits['burst_size'],
+                bucket['tokens'] + new_tokens
+            )
+            bucket['last_update'] = now
+            
+            # Clean old requests from sliding window
+            minute_ago = now - 60
+            hour_ago = now - 3600
+            bucket['requests'] = deque(
+                (t for t in bucket['requests'] if t > hour_ago),
+                maxlen=1000
+            )
+            
+            # Count requests in windows
+            requests_last_minute = sum(1 for t in bucket['requests'] if t > minute_ago)
+            requests_last_hour = len(bucket['requests'])
+            
+            # Check sliding window limits
+            if requests_last_minute >= limits['requests_per_minute']:
+                return False, "Rate limit exceeded (per minute)", {
+                    'retry_after': 60,
+                    'limit': limits['requests_per_minute'],
+                    'remaining': 0,
+                    'reset': int(minute_ago + 60)
+                }
+            
+            if requests_last_hour >= limits['requests_per_hour']:
+                return False, "Rate limit exceeded (per hour)", {
+                    'retry_after': 3600,
+                    'limit': limits['requests_per_hour'],
+                    'remaining': 0,
+                    'reset': int(hour_ago + 3600)
+                }
+            
+            # Check token bucket
+            if bucket['tokens'] < 1:
+                retry_after = int(1 / limits['token_refresh_rate'])
+                return False, "Rate limit exceeded (burst)", {
+                    'retry_after': retry_after,
+                    'limit': limits['burst_size'],
+                    'remaining': 0,
+                    'reset': int(now + retry_after)
+                }
+            
+            # Request allowed - consume token and record
+            bucket['tokens'] -= 1
+            bucket['requests'].append(now)
+            self.global_requests.append(now)
+            
+            # Calculate remaining
+            remaining_minute = limits['requests_per_minute'] - requests_last_minute - 1
+            remaining_hour = limits['requests_per_hour'] - requests_last_hour - 1
+            
+            return True, None, {
+                'limit': limits['requests_per_minute'],
+                'remaining': remaining_minute,
+                'reset': int(minute_ago + 60)
+            }
+    
+    def increment_concurrent(self):
+        """Increment concurrent request counter"""
+        with self.lock:
+            self.concurrent_requests += 1
+    
+    def decrement_concurrent(self):
+        """Decrement concurrent request counter"""
+        with self.lock:
+            self.concurrent_requests = max(0, self.concurrent_requests - 1)
+    
+    def get_client_stats(self, client_id):
+        """Get statistics for a client"""
+        with self.lock:
+            if client_id not in self.buckets:
+                return None
+            
+            bucket = self.buckets[client_id]
+            now = time.time()
+            minute_ago = now - 60
+            hour_ago = now - 3600
+            
+            requests_last_minute = sum(1 for t in bucket['requests'] if t > minute_ago)
+            requests_last_hour = len([t for t in bucket['requests'] if t > hour_ago])
+            
+            return {
+                'requests_last_minute': requests_last_minute,
+                'requests_last_hour': requests_last_hour,
+                'tokens_available': bucket['tokens'],
+                'last_request': bucket['last_update']
+            }
+    
+    def cleanup_old_buckets(self, max_age=86400):
+        """Clean up old unused buckets (default 24 hours)"""
+        with self.lock:
+            now = time.time()
+            to_remove = []
+            
+            for client_id, bucket in self.buckets.items():
+                if now - bucket['last_update'] > max_age:
+                    to_remove.append(client_id)
+            
+            for client_id in to_remove:
+                del self.buckets[client_id]
+            
+            if to_remove:
+                logger.info(f"Cleaned up {len(to_remove)} old rate limit buckets")
+
+# Global rate limiter instance
+rate_limiter = RateLimiter()
+
+def rate_limit(endpoint=None, 
+               requests_per_minute=None,
+               requests_per_hour=None,
+               burst_size=None,
+               check_size=False):
+    """
+    Rate limiting decorator for Flask routes
+    
+    Usage:
+        @app.route('/api/endpoint')
+        @rate_limit(requests_per_minute=10, check_size=True)
+        def endpoint():
+            return jsonify({'status': 'ok'})
+    """
+    def decorator(f):
+        @wraps(f)
+        def decorated_function(*args, **kwargs):
+            # Get client ID
+            client_id = rate_limiter.get_client_id(request)
+            ip = request.remote_addr
+            
+            # Check if IP is blocked
+            if rate_limiter.is_ip_blocked(ip):
+                return jsonify({
+                    'error': 'IP temporarily blocked due to excessive requests'
+                }), 429
+            
+            # Get endpoint
+            endpoint_path = endpoint or request.endpoint
+            
+            # Override default limits if specified
+            if any([requests_per_minute, requests_per_hour, burst_size]):
+                limits = rate_limiter.get_limits(endpoint_path).copy()
+                if requests_per_minute:
+                    limits['requests_per_minute'] = requests_per_minute
+                if requests_per_hour:
+                    limits['requests_per_hour'] = requests_per_hour
+                if burst_size:
+                    limits['burst_size'] = burst_size
+                rate_limiter.endpoint_limits[endpoint_path] = limits
+            
+            # Check request size if needed
+            request_size = 0
+            if check_size:
+                request_size = request.content_length or 0
+            
+            # Check rate limit
+            allowed, message, headers = rate_limiter.check_rate_limit(
+                client_id, endpoint_path, request_size
+            )
+            
+            if not allowed:
+                # Log excessive requests
+                logger.warning(f"Rate limit exceeded for {client_id} on {endpoint_path}: {message}")
+                
+                # Check if we should temporarily block this IP
+                stats = rate_limiter.get_client_stats(client_id)
+                if stats and stats['requests_last_minute'] > 100:
+                    rate_limiter.block_ip_temporarily(ip, 3600)  # 1 hour block
+                
+                response = jsonify({
+                    'error': message,
+                    'retry_after': headers.get('retry_after') if headers else 60
+                })
+                response.status_code = 429
+                
+                # Add rate limit headers
+                if headers:
+                    response.headers['X-RateLimit-Limit'] = str(headers['limit'])
+                    response.headers['X-RateLimit-Remaining'] = str(headers['remaining'])
+                    response.headers['X-RateLimit-Reset'] = str(headers['reset'])
+                    response.headers['Retry-After'] = str(headers['retry_after'])
+                
+                return response
+            
+            # Track concurrent requests
+            rate_limiter.increment_concurrent()
+            
+            try:
+                # Add rate limit info to response
+                g.rate_limit_headers = headers
+                response = f(*args, **kwargs)
+                
+                # Add headers to successful response
+                if headers and hasattr(response, 'headers'):
+                    response.headers['X-RateLimit-Limit'] = str(headers['limit'])
+                    response.headers['X-RateLimit-Remaining'] = str(headers['remaining'])
+                    response.headers['X-RateLimit-Reset'] = str(headers['reset'])
+                
+                return response
+            finally:
+                rate_limiter.decrement_concurrent()
+        
+        return decorated_function
+    return decorator
+
+def cleanup_rate_limiter():
+    """Cleanup function to be called periodically"""
+    rate_limiter.cleanup_old_buckets()
+
+# IP whitelist/blacklist management
+class IPFilter:
+    def __init__(self):
+        self.whitelist = set()
+        self.blacklist = set()
+        
+    def add_to_whitelist(self, ip):
+        self.whitelist.add(ip)
+        self.blacklist.discard(ip)
+        
+    def add_to_blacklist(self, ip):
+        self.blacklist.add(ip)
+        self.whitelist.discard(ip)
+        
+    def is_allowed(self, ip):
+        if ip in self.blacklist:
+            return False
+        if self.whitelist and ip not in self.whitelist:
+            return False
+        return True
+
+ip_filter = IPFilter()
+
+def ip_filter_check():
+    """Middleware to check IP filtering"""
+    ip = request.remote_addr
+    if not ip_filter.is_allowed(ip):
+        return jsonify({'error': 'Access denied'}), 403
--- a/request_size_limiter.py
+++ b/request_size_limiter.py
@@ -0,0 +1,302 @@
+# Request size limiting middleware for preventing memory exhaustion
+import logging
+from functools import wraps
+from flask import request, jsonify, current_app
+import os
+
+logger = logging.getLogger(__name__)
+
+# Default size limits (in bytes)
+DEFAULT_LIMITS = {
+    'max_content_length': 50 * 1024 * 1024,  # 50MB global max
+    'max_audio_size': 25 * 1024 * 1024,      # 25MB for audio files
+    'max_json_size': 1 * 1024 * 1024,        # 1MB for JSON payloads
+    'max_image_size': 10 * 1024 * 1024,      # 10MB for images
+    'max_chunk_size': 1 * 1024 * 1024,       # 1MB chunks for streaming
+}
+
+# File extension to MIME type mapping
+AUDIO_EXTENSIONS = {'.wav', '.mp3', '.ogg', '.webm', '.m4a', '.flac', '.aac'}
+IMAGE_EXTENSIONS = {'.jpg', '.jpeg', '.png', '.gif', '.webp', '.bmp'}
+
+class RequestSizeLimiter:
+    """
+    Middleware to enforce request size limits and prevent memory exhaustion
+    """
+    def __init__(self, app=None, config=None):
+        self.config = config or {}
+        self.limits = {**DEFAULT_LIMITS, **self.config}
+        
+        if app:
+            self.init_app(app)
+    
+    def init_app(self, app):
+        """Initialize the Flask application with size limiting"""
+        # Set Flask's MAX_CONTENT_LENGTH
+        app.config['MAX_CONTENT_LENGTH'] = self.limits['max_content_length']
+        
+        # Store limiter in app
+        app.request_size_limiter = self
+        
+        # Add before_request handler
+        app.before_request(self.check_request_size)
+        
+        # Add error handler for 413 Request Entity Too Large
+        app.register_error_handler(413, self.handle_413)
+        
+        logger.info(f"Request size limiter initialized with max content length: {self.limits['max_content_length'] / 1024 / 1024:.1f}MB")
+    
+    def check_request_size(self):
+        """Check request size before processing"""
+        # Skip size check for GET, HEAD, OPTIONS
+        if request.method in ('GET', 'HEAD', 'OPTIONS'):
+            return None
+        
+        # Get content length
+        content_length = request.content_length
+        
+        if content_length is None:
+            # No content-length header, check for chunked encoding
+            if request.headers.get('Transfer-Encoding') == 'chunked':
+                logger.warning(f"Chunked request from {request.remote_addr} to {request.endpoint}")
+                # For chunked requests, we'll need to monitor the stream
+                return None
+            else:
+                # No content, allow it
+                return None
+        
+        # Check against global limit
+        if content_length > self.limits['max_content_length']:
+            logger.warning(f"Request from {request.remote_addr} exceeds global limit: {content_length} bytes")
+            return jsonify({
+                'error': 'Request too large',
+                'max_size': self.limits['max_content_length'],
+                'your_size': content_length
+            }), 413
+        
+        # Check endpoint-specific limits
+        endpoint = request.endpoint
+        if endpoint:
+            endpoint_limit = self.get_endpoint_limit(endpoint)
+            if endpoint_limit and content_length > endpoint_limit:
+                logger.warning(f"Request from {request.remote_addr} to {endpoint} exceeds endpoint limit: {content_length} bytes")
+                return jsonify({
+                    'error': f'Request too large for {endpoint}',
+                    'max_size': endpoint_limit,
+                    'your_size': content_length
+                }), 413
+        
+        # Check file-specific limits
+        if request.files:
+            for file_key, file_obj in request.files.items():
+                # Check file size
+                file_obj.seek(0, os.SEEK_END)
+                file_size = file_obj.tell()
+                file_obj.seek(0)  # Reset position
+                
+                # Determine file type
+                filename = file_obj.filename or ''
+                file_ext = os.path.splitext(filename)[1].lower()
+                
+                # Apply type-specific limits
+                if file_ext in AUDIO_EXTENSIONS:
+                    max_size = self.limits.get('max_audio_size', self.limits['max_content_length'])
+                    if file_size > max_size:
+                        logger.warning(f"Audio file from {request.remote_addr} exceeds limit: {file_size} bytes")
+                        return jsonify({
+                            'error': 'Audio file too large',
+                            'max_size': max_size,
+                            'your_size': file_size,
+                            'max_size_mb': round(max_size / 1024 / 1024, 1)
+                        }), 413
+                
+                elif file_ext in IMAGE_EXTENSIONS:
+                    max_size = self.limits.get('max_image_size', self.limits['max_content_length'])
+                    if file_size > max_size:
+                        logger.warning(f"Image file from {request.remote_addr} exceeds limit: {file_size} bytes")
+                        return jsonify({
+                            'error': 'Image file too large',
+                            'max_size': max_size,
+                            'your_size': file_size,
+                            'max_size_mb': round(max_size / 1024 / 1024, 1)
+                        }), 413
+        
+        # Check JSON payload size
+        if request.is_json:
+            try:
+                # Get raw data size
+                data_size = len(request.get_data())
+                max_json = self.limits.get('max_json_size', self.limits['max_content_length'])
+                
+                if data_size > max_json:
+                    logger.warning(f"JSON payload from {request.remote_addr} exceeds limit: {data_size} bytes")
+                    return jsonify({
+                        'error': 'JSON payload too large',
+                        'max_size': max_json,
+                        'your_size': data_size,
+                        'max_size_kb': round(max_json / 1024, 1)
+                    }), 413
+            except Exception as e:
+                logger.error(f"Error checking JSON size: {e}")
+        
+        return None
+    
+    def get_endpoint_limit(self, endpoint):
+        """Get size limit for specific endpoint"""
+        endpoint_limits = {
+            'transcribe': self.limits.get('max_audio_size', 25 * 1024 * 1024),
+            'speak': self.limits.get('max_json_size', 1 * 1024 * 1024),
+            'translate': self.limits.get('max_json_size', 1 * 1024 * 1024),
+            'translate_stream': self.limits.get('max_json_size', 1 * 1024 * 1024),
+        }
+        return endpoint_limits.get(endpoint)
+    
+    def handle_413(self, error):
+        """Handle 413 Request Entity Too Large errors"""
+        logger.warning(f"413 error from {request.remote_addr}: {error}")
+        return jsonify({
+            'error': 'Request entity too large',
+            'message': 'The request payload is too large. Please reduce the size and try again.',
+            'max_size': self.limits['max_content_length'],
+            'max_size_mb': round(self.limits['max_content_length'] / 1024 / 1024, 1)
+        }), 413
+    
+    def update_limits(self, **kwargs):
+        """Update size limits dynamically"""
+        old_limits = self.limits.copy()
+        self.limits.update(kwargs)
+        
+        # Update Flask's MAX_CONTENT_LENGTH if changed
+        if 'max_content_length' in kwargs and current_app:
+            current_app.config['MAX_CONTENT_LENGTH'] = kwargs['max_content_length']
+        
+        logger.info(f"Updated size limits: {kwargs}")
+        return old_limits
+
+def limit_request_size(**limit_kwargs):
+    """
+    Decorator to apply custom size limits to specific routes
+    
+    Usage:
+        @app.route('/upload')
+        @limit_request_size(max_size=10*1024*1024)  # 10MB limit
+        def upload():
+            ...
+    """
+    def decorator(f):
+        @wraps(f)
+        def wrapper(*args, **kwargs):
+            # Check content length
+            content_length = request.content_length
+            max_size = limit_kwargs.get('max_size', DEFAULT_LIMITS['max_content_length'])
+            
+            if content_length and content_length > max_size:
+                logger.warning(f"Request to {request.endpoint} exceeds custom limit: {content_length} bytes")
+                return jsonify({
+                    'error': 'Request too large',
+                    'max_size': max_size,
+                    'your_size': content_length,
+                    'max_size_mb': round(max_size / 1024 / 1024, 1)
+                }), 413
+            
+            # Check specific file types if specified
+            if 'max_audio_size' in limit_kwargs and request.files:
+                for file_obj in request.files.values():
+                    if file_obj.filename:
+                        ext = os.path.splitext(file_obj.filename)[1].lower()
+                        if ext in AUDIO_EXTENSIONS:
+                            file_obj.seek(0, os.SEEK_END)
+                            file_size = file_obj.tell()
+                            file_obj.seek(0)
+                            
+                            if file_size > limit_kwargs['max_audio_size']:
+                                return jsonify({
+                                    'error': 'Audio file too large',
+                                    'max_size': limit_kwargs['max_audio_size'],
+                                    'your_size': file_size,
+                                    'max_size_mb': round(limit_kwargs['max_audio_size'] / 1024 / 1024, 1)
+                                }), 413
+            
+            return f(*args, **kwargs)
+        return wrapper
+    return decorator
+
+class StreamSizeLimiter:
+    """
+    Helper class to limit streaming request sizes
+    """
+    def __init__(self, stream, max_size):
+        self.stream = stream
+        self.max_size = max_size
+        self.bytes_read = 0
+    
+    def read(self, size=-1):
+        """Read from stream with size limit enforcement"""
+        if size == -1:
+            # Read all remaining, but respect limit
+            size = self.max_size - self.bytes_read
+        
+        # Check if we would exceed limit
+        if self.bytes_read + size > self.max_size:
+            raise ValueError(f"Stream size exceeds limit of {self.max_size} bytes")
+        
+        data = self.stream.read(size)
+        self.bytes_read += len(data)
+        
+        return data
+    
+    def readline(self, size=-1):
+        """Read line from stream with size limit enforcement"""
+        if size == -1:
+            size = self.max_size - self.bytes_read
+        
+        if self.bytes_read + size > self.max_size:
+            raise ValueError(f"Stream size exceeds limit of {self.max_size} bytes")
+        
+        line = self.stream.readline(size)
+        self.bytes_read += len(line)
+        
+        return line
+
+# Utility functions
+def get_request_size():
+    """Get the size of the current request"""
+    if request.content_length:
+        return request.content_length
+    
+    # For chunked requests, read and measure
+    try:
+        data = request.get_data()
+        return len(data)
+    except Exception:
+        return 0
+
+def format_size(size_bytes):
+    """Format size in human-readable format"""
+    for unit in ['B', 'KB', 'MB', 'GB']:
+        if size_bytes < 1024.0:
+            return f"{size_bytes:.1f} {unit}"
+        size_bytes /= 1024.0
+    return f"{size_bytes:.1f} TB"
+
+# Configuration helper
+def configure_size_limits(app, **kwargs):
+    """
+    Configure size limits for the application
+    
+    Args:
+        app: Flask application
+        max_content_length: Global maximum request size
+        max_audio_size: Maximum audio file size
+        max_json_size: Maximum JSON payload size
+        max_image_size: Maximum image file size
+    """
+    config = {
+        'max_content_length': kwargs.get('max_content_length', DEFAULT_LIMITS['max_content_length']),
+        'max_audio_size': kwargs.get('max_audio_size', DEFAULT_LIMITS['max_audio_size']),
+        'max_json_size': kwargs.get('max_json_size', DEFAULT_LIMITS['max_json_size']),
+        'max_image_size': kwargs.get('max_image_size', DEFAULT_LIMITS['max_image_size']),
+    }
+    
+    limiter = RequestSizeLimiter(app, config)
+    return limiter
--- a/requirements-prod.txt
+++ b/requirements-prod.txt
@@ -0,0 +1,27 @@
+# Production requirements for Talk2Me
+# Includes base requirements plus production WSGI server
+
+# Include base requirements
+-r requirements.txt
+
+# Production WSGI server
+gunicorn==21.2.0
+
+# Async workers (optional, for better concurrency)
+gevent==23.9.1
+greenlet==3.0.1
+
+# Production monitoring
+prometheus-client==0.19.0
+
+# Production caching (optional)
+redis==5.0.1
+hiredis==2.3.2
+
+# Database for production (optional, for session storage)
+psycopg2-binary==2.9.9
+SQLAlchemy==2.0.23
+
+# Additional production utilities
+python-json-logger==2.0.7  # JSON logging
+sentry-sdk[flask]==1.39.1  # Error tracking (optional)
--- a/requirements.txt
+++ b/requirements.txt
@@ -1,5 +1,12 @@
 flask
+flask-cors
 requests
 openai-whisper
 torch
 ollama
+pywebpush
+cryptography
+python-dotenv
+click
+colorlog
+psutil
--- a/secrets_manager.py
+++ b/secrets_manager.py
@@ -0,0 +1,411 @@
+# Secrets management system for secure configuration
+import os
+import json
+import base64
+import logging
+from typing import Any, Dict, Optional, List
+from datetime import datetime, timedelta
+from cryptography.fernet import Fernet
+from cryptography.hazmat.primitives import hashes
+from cryptography.hazmat.primitives.kdf.pbkdf2 import PBKDF2HMAC
+import hashlib
+import hmac
+import secrets
+from functools import lru_cache
+from threading import Lock
+
+logger = logging.getLogger(__name__)
+
+class SecretsManager:
+    """
+    Secure secrets management with encryption, rotation, and audit logging
+    """
+    def __init__(self, config_file: str = None):
+        self.config_file = config_file or os.environ.get('SECRETS_CONFIG', '.secrets.json')
+        self.lock = Lock()
+        self._secrets_cache = {}
+        self._encryption_key = None
+        self._master_key = None
+        self._audit_log = []
+        self._rotation_schedule = {}
+        self._validators = {}
+        
+        # Initialize encryption
+        self._init_encryption()
+        
+        # Load secrets
+        self._load_secrets()
+        
+    def _init_encryption(self):
+        """Initialize encryption key from environment or generate new one"""
+        # Try to get master key from environment
+        master_key = os.environ.get('MASTER_KEY')
+        
+        if not master_key:
+            # Try to load from secure file
+            key_file = os.environ.get('MASTER_KEY_FILE', '.master_key')
+            if os.path.exists(key_file):
+                try:
+                    with open(key_file, 'rb') as f:
+                        master_key = f.read().decode('utf-8').strip()
+                except Exception as e:
+                    logger.error(f"Failed to load master key from file: {e}")
+            
+        if not master_key:
+            # Generate new master key
+            logger.warning("No master key found. Generating new one.")
+            master_key = Fernet.generate_key().decode('utf-8')
+            
+            # Save to secure file (should be protected by OS permissions)
+            key_file = os.environ.get('MASTER_KEY_FILE', '.master_key')
+            try:
+                with open(key_file, 'wb') as f:
+                    f.write(master_key.encode('utf-8'))
+                os.chmod(key_file, 0o600)  # Owner read/write only
+                logger.info(f"Master key saved to {key_file}")
+            except Exception as e:
+                logger.error(f"Failed to save master key: {e}")
+        
+        self._master_key = master_key.encode('utf-8')
+        
+        # Derive encryption key from master key
+        kdf = PBKDF2HMAC(
+            algorithm=hashes.SHA256(),
+            length=32,
+            salt=b'talk2me-secrets-salt',  # In production, use random salt
+            iterations=100000,
+        )
+        key = base64.urlsafe_b64encode(kdf.derive(self._master_key))
+        self._encryption_key = Fernet(key)
+    
+    def _load_secrets(self):
+        """Load encrypted secrets from file"""
+        if not os.path.exists(self.config_file):
+            logger.info(f"No secrets file found at {self.config_file}")
+            return
+        
+        try:
+            with open(self.config_file, 'r') as f:
+                data = json.load(f)
+            
+            # Decrypt secrets
+            for key, value in data.get('secrets', {}).items():
+                if isinstance(value, dict) and 'encrypted' in value:
+                    try:
+                        decrypted = self._decrypt(value['encrypted'])
+                        self._secrets_cache[key] = {
+                            'value': decrypted,
+                            'created': value.get('created'),
+                            'rotated': value.get('rotated'),
+                            'metadata': value.get('metadata', {})
+                        }
+                    except Exception as e:
+                        logger.error(f"Failed to decrypt secret {key}: {e}")
+                else:
+                    # Plain text (for migration)
+                    self._secrets_cache[key] = {
+                        'value': value,
+                        'created': datetime.now().isoformat(),
+                        'rotated': None,
+                        'metadata': {}
+                    }
+            
+            # Load rotation schedule
+            self._rotation_schedule = data.get('rotation_schedule', {})
+            
+            # Load audit log
+            self._audit_log = data.get('audit_log', [])
+            
+            logger.info(f"Loaded {len(self._secrets_cache)} secrets")
+            
+        except Exception as e:
+            logger.error(f"Failed to load secrets: {e}")
+    
+    def _save_secrets(self):
+        """Save encrypted secrets to file"""
+        with self.lock:
+            data = {
+                'secrets': {},
+                'rotation_schedule': self._rotation_schedule,
+                'audit_log': self._audit_log[-1000:]  # Keep last 1000 entries
+            }
+            
+            # Encrypt secrets
+            for key, secret_data in self._secrets_cache.items():
+                data['secrets'][key] = {
+                    'encrypted': self._encrypt(secret_data['value']),
+                    'created': secret_data.get('created'),
+                    'rotated': secret_data.get('rotated'),
+                    'metadata': secret_data.get('metadata', {})
+                }
+            
+            # Save to file
+            try:
+                # Write to temporary file first
+                temp_file = f"{self.config_file}.tmp"
+                with open(temp_file, 'w') as f:
+                    json.dump(data, f, indent=2)
+                
+                # Set secure permissions
+                os.chmod(temp_file, 0o600)  # Owner read/write only
+                
+                # Atomic rename
+                os.rename(temp_file, self.config_file)
+                
+                logger.info(f"Saved {len(self._secrets_cache)} secrets")
+            except Exception as e:
+                logger.error(f"Failed to save secrets: {e}")
+                raise
+    
+    def _encrypt(self, value: str) -> str:
+        """Encrypt a value"""
+        if not isinstance(value, str):
+            value = str(value)
+        return self._encryption_key.encrypt(value.encode('utf-8')).decode('utf-8')
+    
+    def _decrypt(self, encrypted_value: str) -> str:
+        """Decrypt a value"""
+        return self._encryption_key.decrypt(encrypted_value.encode('utf-8')).decode('utf-8')
+    
+    def _audit(self, action: str, key: str, user: str = None, details: dict = None):
+        """Add entry to audit log"""
+        entry = {
+            'timestamp': datetime.now().isoformat(),
+            'action': action,
+            'key': key,
+            'user': user or 'system',
+            'details': details or {}
+        }
+        self._audit_log.append(entry)
+        logger.info(f"Audit: {action} on {key} by {user or 'system'}")
+    
+    def get(self, key: str, default: Any = None) -> Any:
+        """Get a secret value"""
+        # Try cache first
+        if key in self._secrets_cache:
+            self._audit('access', key)
+            return self._secrets_cache[key]['value']
+        
+        # Try environment variable
+        env_key = f"SECRET_{key.upper()}"
+        env_value = os.environ.get(env_key)
+        if env_value:
+            self._audit('access', key, details={'source': 'environment'})
+            return env_value
+        
+        # Try regular environment variable
+        env_value = os.environ.get(key)
+        if env_value:
+            self._audit('access', key, details={'source': 'environment'})
+            return env_value
+        
+        self._audit('access_failed', key)
+        return default
+    
+    def set(self, key: str, value: str, metadata: dict = None, user: str = None):
+        """Set a secret value"""
+        with self.lock:
+            old_value = self._secrets_cache.get(key, {}).get('value')
+            
+            self._secrets_cache[key] = {
+                'value': value,
+                'created': self._secrets_cache.get(key, {}).get('created', datetime.now().isoformat()),
+                'rotated': datetime.now().isoformat() if old_value else None,
+                'metadata': metadata or {}
+            }
+            
+            self._audit('set' if not old_value else 'update', key, user)
+            self._save_secrets()
+    
+    def delete(self, key: str, user: str = None):
+        """Delete a secret"""
+        with self.lock:
+            if key in self._secrets_cache:
+                del self._secrets_cache[key]
+                self._audit('delete', key, user)
+                self._save_secrets()
+                return True
+            return False
+    
+    def rotate(self, key: str, new_value: str = None, user: str = None):
+        """Rotate a secret"""
+        with self.lock:
+            if key not in self._secrets_cache:
+                raise KeyError(f"Secret {key} not found")
+            
+            old_value = self._secrets_cache[key]['value']
+            
+            # Generate new value if not provided
+            if not new_value:
+                if key.endswith('_KEY') or key.endswith('_TOKEN'):
+                    new_value = secrets.token_urlsafe(32)
+                elif key.endswith('_PASSWORD'):
+                    new_value = secrets.token_urlsafe(24)
+                else:
+                    raise ValueError(f"Cannot auto-generate value for {key}")
+            
+            # Update secret
+            self._secrets_cache[key]['value'] = new_value
+            self._secrets_cache[key]['rotated'] = datetime.now().isoformat()
+            
+            self._audit('rotate', key, user, {'generated': new_value is None})
+            self._save_secrets()
+            
+            return old_value, new_value
+    
+    def list_secrets(self) -> List[Dict[str, Any]]:
+        """List all secrets (without values)"""
+        secrets_list = []
+        for key, data in self._secrets_cache.items():
+            secrets_list.append({
+                'key': key,
+                'created': data.get('created'),
+                'rotated': data.get('rotated'),
+                'metadata': data.get('metadata', {}),
+                'has_value': bool(data.get('value'))
+            })
+        return secrets_list
+    
+    def add_validator(self, key: str, validator):
+        """Add a validator function for a secret"""
+        self._validators[key] = validator
+    
+    def validate(self, key: str, value: str) -> bool:
+        """Validate a secret value"""
+        if key in self._validators:
+            try:
+                return self._validators[key](value)
+            except Exception as e:
+                logger.error(f"Validation failed for {key}: {e}")
+                return False
+        return True
+    
+    def schedule_rotation(self, key: str, days: int):
+        """Schedule automatic rotation for a secret"""
+        self._rotation_schedule[key] = {
+            'days': days,
+            'last_rotated': self._secrets_cache.get(key, {}).get('rotated', datetime.now().isoformat())
+        }
+        self._save_secrets()
+    
+    def check_rotation_needed(self) -> List[str]:
+        """Check which secrets need rotation"""
+        needs_rotation = []
+        now = datetime.now()
+        
+        for key, schedule in self._rotation_schedule.items():
+            last_rotated = datetime.fromisoformat(schedule['last_rotated'])
+            if now - last_rotated > timedelta(days=schedule['days']):
+                needs_rotation.append(key)
+        
+        return needs_rotation
+    
+    def get_audit_log(self, key: str = None, limit: int = 100) -> List[Dict]:
+        """Get audit log entries"""
+        logs = self._audit_log
+        
+        if key:
+            logs = [log for log in logs if log['key'] == key]
+        
+        return logs[-limit:]
+    
+    def export_for_environment(self) -> Dict[str, str]:
+        """Export secrets as environment variables"""
+        env_vars = {}
+        for key, data in self._secrets_cache.items():
+            env_key = f"SECRET_{key.upper()}"
+            env_vars[env_key] = data['value']
+        return env_vars
+    
+    def verify_integrity(self) -> bool:
+        """Verify integrity of secrets"""
+        try:
+            # Try to decrypt all secrets
+            for key, secret_data in self._secrets_cache.items():
+                if 'value' in secret_data:
+                    # Re-encrypt and compare
+                    encrypted = self._encrypt(secret_data['value'])
+                    decrypted = self._decrypt(encrypted)
+                    if decrypted != secret_data['value']:
+                        logger.error(f"Integrity check failed for {key}")
+                        return False
+            
+            logger.info("Integrity check passed")
+            return True
+        except Exception as e:
+            logger.error(f"Integrity check failed: {e}")
+            return False
+
+# Global instance
+_secrets_manager = None
+_secrets_lock = Lock()
+
+def get_secrets_manager(config_file: str = None) -> SecretsManager:
+    """Get or create global secrets manager instance"""
+    global _secrets_manager
+    
+    with _secrets_lock:
+        if _secrets_manager is None:
+            _secrets_manager = SecretsManager(config_file)
+        return _secrets_manager
+
+def get_secret(key: str, default: Any = None) -> Any:
+    """Convenience function to get a secret"""
+    manager = get_secrets_manager()
+    return manager.get(key, default)
+
+def set_secret(key: str, value: str, metadata: dict = None):
+    """Convenience function to set a secret"""
+    manager = get_secrets_manager()
+    manager.set(key, value, metadata)
+
+# Flask integration
+def init_app(app):
+    """Initialize secrets management for Flask app"""
+    manager = get_secrets_manager()
+    
+    # Load secrets into app config
+    app.config['SECRET_KEY'] = manager.get('FLASK_SECRET_KEY') or app.config.get('SECRET_KEY')
+    app.config['TTS_API_KEY'] = manager.get('TTS_API_KEY') or app.config.get('TTS_API_KEY')
+    
+    # Add secret manager to app
+    app.secrets_manager = manager
+    
+    # Add CLI commands
+    @app.cli.command('secrets-list')
+    def list_secrets_cmd():
+        """List all secrets"""
+        secrets = manager.list_secrets()
+        for secret in secrets:
+            print(f"{secret['key']}: created={secret['created']}, rotated={secret['rotated']}")
+    
+    @app.cli.command('secrets-set')
+    def set_secret_cmd():
+        """Set a secret"""
+        import click
+        key = click.prompt('Secret key')
+        value = click.prompt('Secret value', hide_input=True)
+        manager.set(key, value, user='cli')
+        print(f"Secret {key} set successfully")
+    
+    @app.cli.command('secrets-rotate')
+    def rotate_secret_cmd():
+        """Rotate a secret"""
+        import click
+        key = click.prompt('Secret key to rotate')
+        old_value, new_value = manager.rotate(key, user='cli')
+        print(f"Secret {key} rotated successfully")
+        print(f"New value: {new_value}")
+    
+    @app.cli.command('secrets-check-rotation')
+    def check_rotation_cmd():
+        """Check which secrets need rotation"""
+        needs_rotation = manager.check_rotation_needed()
+        if needs_rotation:
+            print("Secrets needing rotation:")
+            for key in needs_rotation:
+                print(f"  - {key}")
+        else:
+            print("No secrets need rotation")
+    
+    logger.info("Secrets management initialized")
--- a/session_manager.py
+++ b/session_manager.py
@@ -0,0 +1,607 @@
+# Session management system for preventing resource leaks
+import time
+import uuid
+import logging
+from datetime import datetime, timedelta
+from typing import Dict, Any, Optional, List, Tuple
+from dataclasses import dataclass, field
+from threading import Lock, Thread
+import json
+import os
+import tempfile
+import shutil
+from collections import defaultdict
+from functools import wraps
+from flask import session, request, g, current_app
+
+logger = logging.getLogger(__name__)
+
+@dataclass
+class SessionResource:
+    """Represents a resource associated with a session"""
+    resource_id: str
+    resource_type: str  # 'audio_file', 'temp_file', 'websocket', 'stream'
+    path: Optional[str] = None
+    created_at: float = field(default_factory=time.time)
+    last_accessed: float = field(default_factory=time.time)
+    size_bytes: int = 0
+    metadata: Dict[str, Any] = field(default_factory=dict)
+
+@dataclass
+class UserSession:
+    """Represents a user session with associated resources"""
+    session_id: str
+    user_id: Optional[str] = None
+    ip_address: Optional[str] = None
+    user_agent: Optional[str] = None
+    created_at: float = field(default_factory=time.time)
+    last_activity: float = field(default_factory=time.time)
+    resources: Dict[str, SessionResource] = field(default_factory=dict)
+    request_count: int = 0
+    total_bytes_used: int = 0
+    active_streams: int = 0
+    metadata: Dict[str, Any] = field(default_factory=dict)
+
+class SessionManager:
+    """
+    Manages user sessions and associated resources to prevent leaks
+    """
+    def __init__(self, config: Dict[str, Any] = None):
+        self.config = config or {}
+        self.sessions: Dict[str, UserSession] = {}
+        self.lock = Lock()
+        
+        # Configuration
+        self.max_session_duration = self.config.get('max_session_duration', 3600)  # 1 hour
+        self.max_idle_time = self.config.get('max_idle_time', 900)  # 15 minutes
+        self.max_resources_per_session = self.config.get('max_resources_per_session', 100)
+        self.max_bytes_per_session = self.config.get('max_bytes_per_session', 100 * 1024 * 1024)  # 100MB
+        self.cleanup_interval = self.config.get('cleanup_interval', 60)  # 1 minute
+        self.session_storage_path = self.config.get('session_storage_path', 
+                                                   os.path.join(tempfile.gettempdir(), 'talk2me_sessions'))
+        
+        # Statistics
+        self.stats = {
+            'total_sessions_created': 0,
+            'total_sessions_cleaned': 0,
+            'total_resources_cleaned': 0,
+            'total_bytes_cleaned': 0,
+            'active_sessions': 0,
+            'active_resources': 0,
+            'active_bytes': 0
+        }
+        
+        # Resource cleanup handlers
+        self.cleanup_handlers = {
+            'audio_file': self._cleanup_audio_file,
+            'temp_file': self._cleanup_temp_file,
+            'websocket': self._cleanup_websocket,
+            'stream': self._cleanup_stream
+        }
+        
+        # Initialize storage
+        self._init_storage()
+        
+        # Start cleanup thread
+        self.cleanup_thread = Thread(target=self._cleanup_loop, daemon=True)
+        self.cleanup_thread.start()
+        
+        logger.info("Session manager initialized")
+    
+    def _init_storage(self):
+        """Initialize session storage directory"""
+        try:
+            os.makedirs(self.session_storage_path, mode=0o755, exist_ok=True)
+            logger.info(f"Session storage initialized at {self.session_storage_path}")
+        except Exception as e:
+            logger.error(f"Failed to create session storage: {e}")
+    
+    def create_session(self, session_id: str = None, user_id: str = None, 
+                      ip_address: str = None, user_agent: str = None) -> UserSession:
+        """Create a new session"""
+        with self.lock:
+            if not session_id:
+                session_id = str(uuid.uuid4())
+            
+            if session_id in self.sessions:
+                logger.warning(f"Session {session_id} already exists")
+                return self.sessions[session_id]
+            
+            session = UserSession(
+                session_id=session_id,
+                user_id=user_id,
+                ip_address=ip_address,
+                user_agent=user_agent
+            )
+            
+            self.sessions[session_id] = session
+            self.stats['total_sessions_created'] += 1
+            self.stats['active_sessions'] = len(self.sessions)
+            
+            # Create session directory
+            session_dir = os.path.join(self.session_storage_path, session_id)
+            try:
+                os.makedirs(session_dir, mode=0o755, exist_ok=True)
+            except Exception as e:
+                logger.error(f"Failed to create session directory: {e}")
+            
+            logger.info(f"Created session {session_id}")
+            return session
+    
+    def get_session(self, session_id: str) -> Optional[UserSession]:
+        """Get a session by ID"""
+        with self.lock:
+            session = self.sessions.get(session_id)
+            if session:
+                session.last_activity = time.time()
+            return session
+    
+    def add_resource(self, session_id: str, resource_type: str, 
+                    resource_id: str = None, path: str = None, 
+                    size_bytes: int = 0, metadata: Dict[str, Any] = None) -> Optional[SessionResource]:
+        """Add a resource to a session"""
+        with self.lock:
+            session = self.sessions.get(session_id)
+            if not session:
+                logger.warning(f"Session {session_id} not found")
+                return None
+            
+            # Check limits
+            if len(session.resources) >= self.max_resources_per_session:
+                logger.warning(f"Session {session_id} reached resource limit")
+                self._cleanup_oldest_resources(session, 1)
+            
+            if session.total_bytes_used + size_bytes > self.max_bytes_per_session:
+                logger.warning(f"Session {session_id} reached size limit")
+                bytes_to_free = (session.total_bytes_used + size_bytes) - self.max_bytes_per_session
+                self._cleanup_resources_by_size(session, bytes_to_free)
+            
+            # Create resource
+            if not resource_id:
+                resource_id = str(uuid.uuid4())
+            
+            resource = SessionResource(
+                resource_id=resource_id,
+                resource_type=resource_type,
+                path=path,
+                size_bytes=size_bytes,
+                metadata=metadata or {}
+            )
+            
+            session.resources[resource_id] = resource
+            session.total_bytes_used += size_bytes
+            session.last_activity = time.time()
+            
+            # Update stats
+            self.stats['active_resources'] += 1
+            self.stats['active_bytes'] += size_bytes
+            
+            logger.debug(f"Added {resource_type} resource {resource_id} to session {session_id}")
+            return resource
+    
+    def remove_resource(self, session_id: str, resource_id: str) -> bool:
+        """Remove a resource from a session"""
+        with self.lock:
+            session = self.sessions.get(session_id)
+            if not session:
+                return False
+            
+            resource = session.resources.get(resource_id)
+            if not resource:
+                return False
+            
+            # Cleanup resource
+            self._cleanup_resource(resource)
+            
+            # Remove from session
+            del session.resources[resource_id]
+            session.total_bytes_used -= resource.size_bytes
+            
+            # Update stats
+            self.stats['active_resources'] -= 1
+            self.stats['active_bytes'] -= resource.size_bytes
+            self.stats['total_resources_cleaned'] += 1
+            self.stats['total_bytes_cleaned'] += resource.size_bytes
+            
+            logger.debug(f"Removed resource {resource_id} from session {session_id}")
+            return True
+    
+    def update_session_activity(self, session_id: str):
+        """Update session last activity time"""
+        with self.lock:
+            session = self.sessions.get(session_id)
+            if session:
+                session.last_activity = time.time()
+                session.request_count += 1
+    
+    def cleanup_session(self, session_id: str) -> bool:
+        """Clean up a session and all its resources"""
+        with self.lock:
+            session = self.sessions.get(session_id)
+            if not session:
+                return False
+            
+            # Cleanup all resources
+            for resource_id in list(session.resources.keys()):
+                self.remove_resource(session_id, resource_id)
+            
+            # Remove session directory
+            session_dir = os.path.join(self.session_storage_path, session_id)
+            try:
+                if os.path.exists(session_dir):
+                    shutil.rmtree(session_dir)
+            except Exception as e:
+                logger.error(f"Failed to remove session directory: {e}")
+            
+            # Remove session
+            del self.sessions[session_id]
+            
+            # Update stats
+            self.stats['active_sessions'] = len(self.sessions)
+            self.stats['total_sessions_cleaned'] += 1
+            
+            logger.info(f"Cleaned up session {session_id}")
+            return True
+    
+    def _cleanup_resource(self, resource: SessionResource):
+        """Clean up a single resource"""
+        handler = self.cleanup_handlers.get(resource.resource_type)
+        if handler:
+            try:
+                handler(resource)
+            except Exception as e:
+                logger.error(f"Failed to cleanup {resource.resource_type} {resource.resource_id}: {e}")
+    
+    def _cleanup_audio_file(self, resource: SessionResource):
+        """Clean up audio file resource"""
+        if resource.path and os.path.exists(resource.path):
+            try:
+                os.remove(resource.path)
+                logger.debug(f"Removed audio file {resource.path}")
+            except Exception as e:
+                logger.error(f"Failed to remove audio file {resource.path}: {e}")
+    
+    def _cleanup_temp_file(self, resource: SessionResource):
+        """Clean up temporary file resource"""
+        if resource.path and os.path.exists(resource.path):
+            try:
+                os.remove(resource.path)
+                logger.debug(f"Removed temp file {resource.path}")
+            except Exception as e:
+                logger.error(f"Failed to remove temp file {resource.path}: {e}")
+    
+    def _cleanup_websocket(self, resource: SessionResource):
+        """Clean up websocket resource"""
+        # Implement websocket cleanup if needed
+        pass
+    
+    def _cleanup_stream(self, resource: SessionResource):
+        """Clean up stream resource"""
+        # Implement stream cleanup if needed
+        if resource.metadata.get('stream_id'):
+            # Close any open streams
+            pass
+    
+    def _cleanup_oldest_resources(self, session: UserSession, count: int):
+        """Clean up oldest resources from a session"""
+        # Sort resources by creation time
+        sorted_resources = sorted(
+            session.resources.items(),
+            key=lambda x: x[1].created_at
+        )
+        
+        # Remove oldest resources
+        for resource_id, _ in sorted_resources[:count]:
+            self.remove_resource(session.session_id, resource_id)
+    
+    def _cleanup_resources_by_size(self, session: UserSession, bytes_to_free: int):
+        """Clean up resources to free up space"""
+        freed_bytes = 0
+        
+        # Sort resources by size (largest first)
+        sorted_resources = sorted(
+            session.resources.items(),
+            key=lambda x: x[1].size_bytes,
+            reverse=True
+        )
+        
+        # Remove resources until we've freed enough space
+        for resource_id, resource in sorted_resources:
+            if freed_bytes >= bytes_to_free:
+                break
+            
+            freed_bytes += resource.size_bytes
+            self.remove_resource(session.session_id, resource_id)
+    
+    def _cleanup_loop(self):
+        """Background cleanup thread"""
+        while True:
+            try:
+                time.sleep(self.cleanup_interval)
+                self.cleanup_expired_sessions()
+                self.cleanup_idle_sessions()
+                self.cleanup_orphaned_files()
+            except Exception as e:
+                logger.error(f"Error in cleanup loop: {e}")
+    
+    def cleanup_expired_sessions(self):
+        """Clean up sessions that have exceeded max duration"""
+        with self.lock:
+            now = time.time()
+            expired_sessions = []
+            
+            for session_id, session in self.sessions.items():
+                if now - session.created_at > self.max_session_duration:
+                    expired_sessions.append(session_id)
+            
+            for session_id in expired_sessions:
+                logger.info(f"Cleaning up expired session {session_id}")
+                self.cleanup_session(session_id)
+    
+    def cleanup_idle_sessions(self):
+        """Clean up sessions that have been idle too long"""
+        with self.lock:
+            now = time.time()
+            idle_sessions = []
+            
+            for session_id, session in self.sessions.items():
+                if now - session.last_activity > self.max_idle_time:
+                    idle_sessions.append(session_id)
+            
+            for session_id in idle_sessions:
+                logger.info(f"Cleaning up idle session {session_id}")
+                self.cleanup_session(session_id)
+    
+    def cleanup_orphaned_files(self):
+        """Clean up orphaned files in session storage"""
+        try:
+            if not os.path.exists(self.session_storage_path):
+                return
+            
+            # Get all session directories
+            session_dirs = set(os.listdir(self.session_storage_path))
+            
+            # Get active session IDs
+            with self.lock:
+                active_sessions = set(self.sessions.keys())
+            
+            # Find orphaned directories
+            orphaned_dirs = session_dirs - active_sessions
+            
+            # Clean up orphaned directories
+            for dir_name in orphaned_dirs:
+                dir_path = os.path.join(self.session_storage_path, dir_name)
+                if os.path.isdir(dir_path):
+                    try:
+                        shutil.rmtree(dir_path)
+                        logger.info(f"Cleaned up orphaned session directory {dir_name}")
+                    except Exception as e:
+                        logger.error(f"Failed to remove orphaned directory {dir_path}: {e}")
+        
+        except Exception as e:
+            logger.error(f"Error cleaning orphaned files: {e}")
+    
+    def get_session_info(self, session_id: str) -> Optional[Dict[str, Any]]:
+        """Get detailed information about a session"""
+        with self.lock:
+            session = self.sessions.get(session_id)
+            if not session:
+                return None
+            
+            return {
+                'session_id': session.session_id,
+                'user_id': session.user_id,
+                'ip_address': session.ip_address,
+                'created_at': datetime.fromtimestamp(session.created_at).isoformat(),
+                'last_activity': datetime.fromtimestamp(session.last_activity).isoformat(),
+                'duration_seconds': int(time.time() - session.created_at),
+                'idle_seconds': int(time.time() - session.last_activity),
+                'request_count': session.request_count,
+                'resource_count': len(session.resources),
+                'total_bytes_used': session.total_bytes_used,
+                'active_streams': session.active_streams,
+                'resources': [
+                    {
+                        'resource_id': r.resource_id,
+                        'resource_type': r.resource_type,
+                        'size_bytes': r.size_bytes,
+                        'created_at': datetime.fromtimestamp(r.created_at).isoformat(),
+                        'last_accessed': datetime.fromtimestamp(r.last_accessed).isoformat()
+                    }
+                    for r in session.resources.values()
+                ]
+            }
+    
+    def get_all_sessions_info(self) -> List[Dict[str, Any]]:
+        """Get information about all active sessions"""
+        with self.lock:
+            return [
+                self.get_session_info(session_id)
+                for session_id in self.sessions.keys()
+            ]
+    
+    def get_stats(self) -> Dict[str, Any]:
+        """Get session manager statistics"""
+        with self.lock:
+            return {
+                **self.stats,
+                'uptime_seconds': int(time.time() - self.stats.get('start_time', time.time())),
+                'avg_session_duration': self._calculate_avg_session_duration(),
+                'avg_resources_per_session': self._calculate_avg_resources_per_session(),
+                'total_storage_used': self._calculate_total_storage_used()
+            }
+    
+    def _calculate_avg_session_duration(self) -> float:
+        """Calculate average session duration"""
+        if not self.sessions:
+            return 0
+        
+        total_duration = sum(
+            time.time() - session.created_at
+            for session in self.sessions.values()
+        )
+        return total_duration / len(self.sessions)
+    
+    def _calculate_avg_resources_per_session(self) -> float:
+        """Calculate average resources per session"""
+        if not self.sessions:
+            return 0
+        
+        total_resources = sum(
+            len(session.resources)
+            for session in self.sessions.values()
+        )
+        return total_resources / len(self.sessions)
+    
+    def _calculate_total_storage_used(self) -> int:
+        """Calculate total storage used"""
+        total = 0
+        try:
+            for root, dirs, files in os.walk(self.session_storage_path):
+                for file in files:
+                    filepath = os.path.join(root, file)
+                    total += os.path.getsize(filepath)
+        except Exception as e:
+            logger.error(f"Error calculating storage used: {e}")
+        return total
+    
+    def export_metrics(self) -> Dict[str, Any]:
+        """Export metrics for monitoring"""
+        with self.lock:
+            return {
+                'sessions': {
+                    'active': self.stats['active_sessions'],
+                    'total_created': self.stats['total_sessions_created'],
+                    'total_cleaned': self.stats['total_sessions_cleaned']
+                },
+                'resources': {
+                    'active': self.stats['active_resources'],
+                    'total_cleaned': self.stats['total_resources_cleaned'],
+                    'active_bytes': self.stats['active_bytes'],
+                    'total_bytes_cleaned': self.stats['total_bytes_cleaned']
+                },
+                'limits': {
+                    'max_session_duration': self.max_session_duration,
+                    'max_idle_time': self.max_idle_time,
+                    'max_resources_per_session': self.max_resources_per_session,
+                    'max_bytes_per_session': self.max_bytes_per_session
+                }
+            }
+
+# Global session manager instance
+_session_manager = None
+_session_lock = Lock()
+
+def get_session_manager(config: Dict[str, Any] = None) -> SessionManager:
+    """Get or create global session manager instance"""
+    global _session_manager
+    
+    with _session_lock:
+        if _session_manager is None:
+            _session_manager = SessionManager(config)
+        return _session_manager
+
+# Flask integration
+def init_app(app):
+    """Initialize session management for Flask app"""
+    config = {
+        'max_session_duration': app.config.get('MAX_SESSION_DURATION', 3600),
+        'max_idle_time': app.config.get('MAX_SESSION_IDLE_TIME', 900),
+        'max_resources_per_session': app.config.get('MAX_RESOURCES_PER_SESSION', 100),
+        'max_bytes_per_session': app.config.get('MAX_BYTES_PER_SESSION', 100 * 1024 * 1024),
+        'cleanup_interval': app.config.get('SESSION_CLEANUP_INTERVAL', 60),
+        'session_storage_path': app.config.get('SESSION_STORAGE_PATH', 
+                                              os.path.join(app.config.get('UPLOAD_FOLDER', tempfile.gettempdir()), 'sessions'))
+    }
+    
+    manager = get_session_manager(config)
+    app.session_manager = manager
+    
+    # Add before_request handler
+    @app.before_request
+    def before_request_session():
+        # Get or create session
+        session_id = session.get('session_id')
+        if not session_id:
+            session_id = str(uuid.uuid4())
+            session['session_id'] = session_id
+            session.permanent = True
+        
+        # Get session from manager
+        user_session = manager.get_session(session_id)
+        if not user_session:
+            user_session = manager.create_session(
+                session_id=session_id,
+                ip_address=request.remote_addr,
+                user_agent=request.headers.get('User-Agent')
+            )
+        
+        # Update activity
+        manager.update_session_activity(session_id)
+        
+        # Store in g for request access
+        g.user_session = user_session
+        g.session_manager = manager
+    
+    # Add CLI commands
+    @app.cli.command('sessions-list')
+    def list_sessions_cmd():
+        """List all active sessions"""
+        sessions = manager.get_all_sessions_info()
+        for session_info in sessions:
+            print(f"\nSession: {session_info['session_id']}")
+            print(f"  Created: {session_info['created_at']}")
+            print(f"  Last activity: {session_info['last_activity']}")
+            print(f"  Resources: {session_info['resource_count']}")
+            print(f"  Bytes used: {session_info['total_bytes_used']}")
+    
+    @app.cli.command('sessions-cleanup')
+    def cleanup_sessions_cmd():
+        """Manual session cleanup"""
+        manager.cleanup_expired_sessions()
+        manager.cleanup_idle_sessions()
+        manager.cleanup_orphaned_files()
+        print("Session cleanup completed")
+    
+    @app.cli.command('sessions-stats')
+    def session_stats_cmd():
+        """Show session statistics"""
+        stats = manager.get_stats()
+        print(json.dumps(stats, indent=2))
+    
+    logger.info("Session management initialized")
+
+# Decorator for session resource tracking
+def track_resource(resource_type: str):
+    """Decorator to track resources for a session"""
+    def decorator(func):
+        @wraps(func)
+        def wrapper(*args, **kwargs):
+            result = func(*args, **kwargs)
+            
+            # Track resource if in request context
+            if hasattr(g, 'user_session') and hasattr(g, 'session_manager'):
+                if isinstance(result, (str, bytes)) or hasattr(result, 'filename'):
+                    # Determine path and size
+                    path = None
+                    size = 0
+                    
+                    if isinstance(result, str) and os.path.exists(result):
+                        path = result
+                        size = os.path.getsize(result)
+                    elif hasattr(result, 'filename'):
+                        path = result.filename
+                        if os.path.exists(path):
+                            size = os.path.getsize(path)
+                    
+                    # Add resource to session
+                    g.session_manager.add_resource(
+                        session_id=g.user_session.session_id,
+                        resource_type=resource_type,
+                        path=path,
+                        size_bytes=size
+                    )
+            
+            return result
+        return wrapper
+    return decorator
--- a/static/css/styles.css
+++ b/static/css/styles.css
@@ -0,0 +1,559 @@
+/* Main styles for Talk2Me application */
+
+/* Loading animations */
+.loading-dots {
+    display: inline-flex;
+    align-items: center;
+    gap: 4px;
+}
+
+.loading-dots span {
+    width: 8px;
+    height: 8px;
+    border-radius: 50%;
+    background-color: #007bff;
+    animation: dotPulse 1.4s infinite ease-in-out both;
+}
+
+.loading-dots span:nth-child(1) {
+    animation-delay: -0.32s;
+}
+
+.loading-dots span:nth-child(2) {
+    animation-delay: -0.16s;
+}
+
+@keyframes dotPulse {
+    0%, 80%, 100% {
+        transform: scale(0);
+        opacity: 0.5;
+    }
+    40% {
+        transform: scale(1);
+        opacity: 1;
+    }
+}
+
+/* Wave animation for recording */
+.recording-wave {
+    position: relative;
+    display: inline-block;
+    width: 40px;
+    height: 40px;
+}
+
+.recording-wave span {
+    position: absolute;
+    bottom: 0;
+    width: 4px;
+    height: 100%;
+    background: #fff;
+    border-radius: 2px;
+    animation: wave 1.2s linear infinite;
+}
+
+.recording-wave span:nth-child(1) {
+    left: 0;
+    animation-delay: 0s;
+}
+
+.recording-wave span:nth-child(2) {
+    left: 8px;
+    animation-delay: -1.1s;
+}
+
+.recording-wave span:nth-child(3) {
+    left: 16px;
+    animation-delay: -1s;
+}
+
+.recording-wave span:nth-child(4) {
+    left: 24px;
+    animation-delay: -0.9s;
+}
+
+.recording-wave span:nth-child(5) {
+    left: 32px;
+    animation-delay: -0.8s;
+}
+
+@keyframes wave {
+    0%, 40%, 100% {
+        transform: scaleY(0.4);
+    }
+    20% {
+        transform: scaleY(1);
+    }
+}
+
+/* Spinner animation */
+.spinner-custom {
+    width: 40px;
+    height: 40px;
+    position: relative;
+    display: inline-block;
+}
+
+.spinner-custom::before {
+    content: '';
+    position: absolute;
+    width: 100%;
+    height: 100%;
+    border-radius: 50%;
+    border: 3px solid rgba(0, 123, 255, 0.2);
+}
+
+.spinner-custom::after {
+    content: '';
+    position: absolute;
+    width: 100%;
+    height: 100%;
+    border-radius: 50%;
+    border: 3px solid transparent;
+    border-top-color: #007bff;
+    animation: spin 0.8s linear infinite;
+}
+
+@keyframes spin {
+    to {
+        transform: rotate(360deg);
+    }
+}
+
+/* Translation animation */
+.translation-animation {
+    position: relative;
+    display: inline-flex;
+    align-items: center;
+    gap: 10px;
+}
+
+.translation-animation .arrow {
+    width: 30px;
+    height: 2px;
+    background: #28a745;
+    position: relative;
+    animation: moveArrow 1.5s infinite;
+}
+
+.translation-animation .arrow::after {
+    content: '';
+    position: absolute;
+    right: -8px;
+    top: -4px;
+    width: 0;
+    height: 0;
+    border-left: 8px solid #28a745;
+    border-top: 5px solid transparent;
+    border-bottom: 5px solid transparent;
+}
+
+@keyframes moveArrow {
+    0%, 100% {
+        transform: translateX(0);
+    }
+    50% {
+        transform: translateX(10px);
+    }
+}
+
+/* Processing text animation */
+.processing-text {
+    display: inline-block;
+    position: relative;
+    font-style: italic;
+    color: #6c757d;
+}
+
+.processing-text::after {
+    content: '';
+    position: absolute;
+    bottom: -2px;
+    left: 0;
+    width: 100%;
+    height: 2px;
+    background: linear-gradient(90deg, 
+        transparent 0%, 
+        #007bff 50%, 
+        transparent 100%);
+    animation: processLine 2s linear infinite;
+}
+
+@keyframes processLine {
+    0% {
+        transform: translateX(-100%);
+    }
+    100% {
+        transform: translateX(100%);
+    }
+}
+
+/* Fade in animation for results */
+.fade-in {
+    animation: fadeIn 0.5s ease-in;
+}
+
+@keyframes fadeIn {
+    from {
+        opacity: 0;
+        transform: translateY(10px);
+    }
+    to {
+        opacity: 1;
+        transform: translateY(0);
+    }
+}
+
+/* Pulse animation for buttons */
+.btn-pulse {
+    animation: pulse 2s infinite;
+}
+
+@keyframes pulse {
+    0% {
+        box-shadow: 0 0 0 0 rgba(0, 123, 255, 0.7);
+    }
+    70% {
+        box-shadow: 0 0 0 10px rgba(0, 123, 255, 0);
+    }
+    100% {
+        box-shadow: 0 0 0 0 rgba(0, 123, 255, 0);
+    }
+}
+
+/* Loading overlay */
+.loading-overlay {
+    position: fixed;
+    top: 0;
+    left: 0;
+    right: 0;
+    bottom: 0;
+    background: rgba(255, 255, 255, 0.9);
+    display: flex;
+    align-items: center;
+    justify-content: center;
+    z-index: 9999;
+    opacity: 0;
+    pointer-events: none;
+    transition: opacity 0.3s ease;
+}
+
+.loading-overlay.active {
+    opacity: 1;
+    pointer-events: all;
+}
+
+.loading-content {
+    text-align: center;
+}
+
+.loading-content .spinner-custom {
+    margin-bottom: 20px;
+}
+
+/* Status indicator animations */
+.status-indicator {
+    transition: all 0.3s ease;
+}
+
+.status-indicator.processing {
+    font-weight: 500;
+    color: #007bff;
+}
+
+.status-indicator.success {
+    color: #28a745;
+}
+
+.status-indicator.error {
+    color: #dc3545;
+}
+
+/* Card loading state */
+.card-loading {
+    position: relative;
+    overflow: hidden;
+}
+
+.card-loading::after {
+    content: '';
+    position: absolute;
+    top: 0;
+    left: -100%;
+    width: 100%;
+    height: 100%;
+    background: linear-gradient(
+        90deg,
+        transparent,
+        rgba(255, 255, 255, 0.4),
+        transparent
+    );
+    animation: shimmer 2s infinite;
+}
+
+@keyframes shimmer {
+    100% {
+        left: 100%;
+    }
+}
+
+/* Text skeleton loader */
+.skeleton-loader {
+    background: #eee;
+    background: linear-gradient(90deg, #eee 25%, #f5f5f5 50%, #eee 75%);
+    background-size: 200% 100%;
+    animation: loading 1.5s infinite;
+    border-radius: 4px;
+    height: 20px;
+    margin: 10px 0;
+}
+
+@keyframes loading {
+    0% {
+        background-position: 200% 0;
+    }
+    100% {
+        background-position: -200% 0;
+    }
+}
+
+/* Audio playing animation */
+.audio-playing {
+    display: inline-flex;
+    align-items: flex-end;
+    gap: 2px;
+    height: 20px;
+}
+
+.audio-playing span {
+    width: 3px;
+    background: #28a745;
+    animation: audioBar 0.5s ease-in-out infinite alternate;
+}
+
+.audio-playing span:nth-child(1) {
+    height: 40%;
+    animation-delay: 0s;
+}
+
+.audio-playing span:nth-child(2) {
+    height: 60%;
+    animation-delay: 0.1s;
+}
+
+.audio-playing span:nth-child(3) {
+    height: 80%;
+    animation-delay: 0.2s;
+}
+
+.audio-playing span:nth-child(4) {
+    height: 60%;
+    animation-delay: 0.3s;
+}
+
+.audio-playing span:nth-child(5) {
+    height: 40%;
+    animation-delay: 0.4s;
+}
+
+@keyframes audioBar {
+    to {
+        height: 100%;
+    }
+}
+
+/* Smooth transitions */
+.btn {
+    transition: all 0.3s ease;
+}
+
+.card {
+    transition: transform 0.3s ease, box-shadow 0.3s ease;
+}
+
+.card:hover {
+    transform: translateY(-2px);
+    box-shadow: 0 6px 12px rgba(0, 0, 0, 0.15);
+}
+
+/* Success notification */
+.success-notification {
+    position: fixed;
+    top: 20px;
+    left: 50%;
+    transform: translateX(-50%);
+    background-color: #28a745;
+    color: white;
+    padding: 12px 24px;
+    border-radius: 8px;
+    box-shadow: 0 4px 12px rgba(0, 0, 0, 0.15);
+    display: flex;
+    align-items: center;
+    gap: 10px;
+    z-index: 9999;
+    opacity: 0;
+    transition: opacity 0.3s ease, transform 0.3s ease;
+    pointer-events: none;
+}
+
+.success-notification.show {
+    opacity: 1;
+    transform: translateX(-50%) translateY(0);
+    pointer-events: all;
+}
+
+.success-notification i {
+    font-size: 18px;
+}
+
+/* Mobile optimizations */
+@media (max-width: 768px) {
+    .loading-overlay {
+        background: rgba(255, 255, 255, 0.95);
+    }
+    
+    .spinner-custom,
+    .recording-wave {
+        transform: scale(0.8);
+    }
+    
+    .success-notification {
+        width: 90%;
+        max-width: 300px;
+        font-size: 14px;
+    }
+}
+
+/* Streaming translation styles */
+.streaming-text {
+    position: relative;
+    min-height: 1.5em;
+}
+
+.streaming-active::after {
+    content: '▊';
+    display: inline-block;
+    animation: cursor-blink 1s infinite;
+    color: #007bff;
+    font-weight: bold;
+}
+
+@keyframes cursor-blink {
+    0%, 49% {
+        opacity: 1;
+    }
+    50%, 100% {
+        opacity: 0;
+    }
+}
+
+/* Smooth text appearance for streaming */
+.streaming-text {
+    transition: all 0.1s ease-out;
+}
+
+/* Multi-speaker styles */
+.speaker-button {
+    position: relative;
+    padding: 8px 16px;
+    border-radius: 20px;
+    border: 2px solid;
+    background-color: white;
+    font-weight: 500;
+    transition: all 0.3s ease;
+    min-width: 120px;
+}
+
+.speaker-button.active {
+    color: white !important;
+    transform: scale(1.05);
+    box-shadow: 0 2px 8px rgba(0,0,0,0.2);
+}
+
+.speaker-avatar {
+    display: inline-flex;
+    align-items: center;
+    justify-content: center;
+    width: 30px;
+    height: 30px;
+    border-radius: 50%;
+    background-color: rgba(255,255,255,0.3);
+    color: inherit;
+    font-weight: bold;
+    font-size: 12px;
+    margin-right: 8px;
+}
+
+.speaker-button.active .speaker-avatar {
+    background-color: rgba(255,255,255,0.3);
+}
+
+.conversation-entry {
+    margin-bottom: 16px;
+    padding: 12px;
+    border-radius: 12px;
+    background-color: #f8f9fa;
+    position: relative;
+    animation: slideIn 0.3s ease-out;
+}
+
+@keyframes slideIn {
+    from {
+        opacity: 0;
+        transform: translateY(10px);
+    }
+    to {
+        opacity: 1;
+        transform: translateY(0);
+    }
+}
+
+.conversation-speaker {
+    display: flex;
+    align-items: center;
+    margin-bottom: 8px;
+    font-weight: 600;
+}
+
+.conversation-speaker-avatar {
+    display: inline-flex;
+    align-items: center;
+    justify-content: center;
+    width: 25px;
+    height: 25px;
+    border-radius: 50%;
+    color: white;
+    font-size: 11px;
+    margin-right: 8px;
+}
+
+.conversation-text {
+    margin-left: 33px;
+    line-height: 1.5;
+}
+
+.conversation-time {
+    font-size: 0.8rem;
+    color: #6c757d;
+    margin-left: auto;
+}
+
+.conversation-translation {
+    font-style: italic;
+    opacity: 0.9;
+}
+
+/* Speaker list responsive */
+@media (max-width: 768px) {
+    .speaker-button {
+        min-width: 100px;
+        padding: 6px 12px;
+        font-size: 0.9rem;
+    }
+    
+    .speaker-avatar {
+        width: 25px;
+        height: 25px;
+        font-size: 10px;
+    }
+}
--- a/static/js/app.js
+++ b/static/js/app.js
--- a/static/js/src/apiClient.ts
+++ b/static/js/src/apiClient.ts
@@ -0,0 +1,155 @@
+// API Client with CORS support
+export interface ApiClientConfig {
+    baseUrl?: string;
+    credentials?: RequestCredentials;
+    headers?: HeadersInit;
+}
+
+export class ApiClient {
+    private static instance: ApiClient;
+    private config: ApiClientConfig;
+    
+    private constructor() {
+        // Default configuration
+        this.config = {
+            baseUrl: '', // Use same origin by default
+            credentials: 'same-origin', // Change to 'include' for cross-origin requests
+            headers: {
+                'X-Requested-With': 'XMLHttpRequest' // Identify as AJAX request
+            }
+        };
+        
+        // Check if we're in a cross-origin context
+        this.detectCrossOrigin();
+    }
+    
+    static getInstance(): ApiClient {
+        if (!ApiClient.instance) {
+            ApiClient.instance = new ApiClient();
+        }
+        return ApiClient.instance;
+    }
+    
+    // Detect if we're making cross-origin requests
+    private detectCrossOrigin(): void {
+        // Check if the app is loaded from a different origin
+        const currentScript = document.currentScript as HTMLScriptElement | null;
+        const scriptSrc = currentScript?.src || '';
+        if (scriptSrc && !scriptSrc.startsWith(window.location.origin)) {
+            // We're likely in a cross-origin context
+            this.config.credentials = 'include';
+            console.log('Cross-origin context detected, enabling credentials');
+        }
+        
+        // Also check for explicit configuration in meta tags
+        const corsOrigin = document.querySelector('meta[name="cors-origin"]');
+        if (corsOrigin) {
+            const origin = corsOrigin.getAttribute('content');
+            if (origin && origin !== window.location.origin) {
+                this.config.baseUrl = origin;
+                this.config.credentials = 'include';
+                console.log(`Using CORS origin: ${origin}`);
+            }
+        }
+    }
+    
+    // Configure the API client
+    configure(config: Partial<ApiClientConfig>): void {
+        this.config = { ...this.config, ...config };
+    }
+    
+    // Make a fetch request with CORS support
+    async fetch(url: string, options: RequestInit = {}): Promise<Response> {
+        // Construct full URL
+        const fullUrl = this.config.baseUrl ? `${this.config.baseUrl}${url}` : url;
+        
+        // Merge headers
+        const headers = new Headers(options.headers);
+        if (this.config.headers) {
+            const configHeaders = new Headers(this.config.headers);
+            configHeaders.forEach((value, key) => {
+                if (!headers.has(key)) {
+                    headers.set(key, value);
+                }
+            });
+        }
+        
+        // Merge options with defaults
+        const fetchOptions: RequestInit = {
+            ...options,
+            headers,
+            credentials: options.credentials || this.config.credentials
+        };
+        
+        // Add CORS mode if cross-origin
+        if (this.config.baseUrl && this.config.baseUrl !== window.location.origin) {
+            fetchOptions.mode = 'cors';
+        }
+        
+        try {
+            const response = await fetch(fullUrl, fetchOptions);
+            
+            // Check for CORS errors
+            if (!response.ok && response.type === 'opaque') {
+                throw new Error('CORS request failed - check server CORS configuration');
+            }
+            
+            return response;
+        } catch (error) {
+            // Enhanced error handling for CORS issues
+            if (error instanceof TypeError && error.message.includes('Failed to fetch')) {
+                console.error('CORS Error: Failed to fetch. Check that:', {
+                    requestedUrl: fullUrl,
+                    origin: window.location.origin,
+                    credentials: fetchOptions.credentials,
+                    mode: fetchOptions.mode
+                });
+                throw new Error('CORS request failed. The server may not allow requests from this origin.');
+            }
+            throw error;
+        }
+    }
+    
+    // Convenience methods
+    async get(url: string, options?: RequestInit): Promise<Response> {
+        return this.fetch(url, { ...options, method: 'GET' });
+    }
+    
+    async post(url: string, body?: any, options?: RequestInit): Promise<Response> {
+        const init: RequestInit = { ...options, method: 'POST' };
+        
+        if (body) {
+            if (body instanceof FormData) {
+                init.body = body;
+            } else {
+                init.headers = {
+                    ...init.headers,
+                    'Content-Type': 'application/json'
+                };
+                init.body = JSON.stringify(body);
+            }
+        }
+        
+        return this.fetch(url, init);
+    }
+    
+    // JSON convenience methods
+    async getJSON<T>(url: string, options?: RequestInit): Promise<T> {
+        const response = await this.get(url, options);
+        if (!response.ok) {
+            throw new Error(`HTTP error! status: ${response.status}`);
+        }
+        return response.json();
+    }
+    
+    async postJSON<T>(url: string, body?: any, options?: RequestInit): Promise<T> {
+        const response = await this.post(url, body, options);
+        if (!response.ok) {
+            throw new Error(`HTTP error! status: ${response.status}`);
+        }
+        return response.json();
+    }
+}
+
+// Export a singleton instance
+export const apiClient = ApiClient.getInstance();
--- a/static/js/src/app.ts
+++ b/static/js/src/app.ts
--- a/static/js/src/connectionManager.ts
+++ b/static/js/src/connectionManager.ts
@@ -0,0 +1,321 @@
+// Connection management with retry logic
+export interface ConnectionConfig {
+    maxRetries: number;
+    initialDelay: number;
+    maxDelay: number;
+    backoffMultiplier: number;
+    timeout: number;
+    onlineCheckInterval: number;
+}
+
+export interface RetryOptions {
+    retries?: number;
+    delay?: number;
+    onRetry?: (attempt: number, error: Error) => void;
+}
+
+export type ConnectionStatus = 'online' | 'offline' | 'connecting' | 'error';
+
+export interface ConnectionState {
+    status: ConnectionStatus;
+    lastError?: Error;
+    retryCount: number;
+    lastOnlineTime?: Date;
+}
+
+export class ConnectionManager {
+    private static instance: ConnectionManager;
+    private config: ConnectionConfig;
+    private state: ConnectionState;
+    private listeners: Map<string, (state: ConnectionState) => void> = new Map();
+    private onlineCheckTimer?: number;
+    private reconnectTimer?: number;
+    
+    private constructor() {
+        this.config = {
+            maxRetries: 3,
+            initialDelay: 1000, // 1 second
+            maxDelay: 30000, // 30 seconds
+            backoffMultiplier: 2,
+            timeout: 10000, // 10 seconds
+            onlineCheckInterval: 5000 // 5 seconds
+        };
+        
+        this.state = {
+            status: navigator.onLine ? 'online' : 'offline',
+            retryCount: 0
+        };
+        
+        this.setupEventListeners();
+        this.startOnlineCheck();
+    }
+    
+    static getInstance(): ConnectionManager {
+        if (!ConnectionManager.instance) {
+            ConnectionManager.instance = new ConnectionManager();
+        }
+        return ConnectionManager.instance;
+    }
+    
+    // Configure connection settings
+    configure(config: Partial<ConnectionConfig>): void {
+        this.config = { ...this.config, ...config };
+    }
+    
+    // Setup browser online/offline event listeners
+    private setupEventListeners(): void {
+        window.addEventListener('online', () => {
+            console.log('Browser online event detected');
+            this.updateState({ status: 'online', retryCount: 0 });
+            this.checkServerConnection();
+        });
+        
+        window.addEventListener('offline', () => {
+            console.log('Browser offline event detected');
+            this.updateState({ status: 'offline' });
+        });
+        
+        // Listen for visibility changes to check connection when tab becomes active
+        document.addEventListener('visibilitychange', () => {
+            if (!document.hidden && this.state.status === 'offline') {
+                this.checkServerConnection();
+            }
+        });
+    }
+    
+    // Start periodic online checking
+    private startOnlineCheck(): void {
+        this.onlineCheckTimer = window.setInterval(() => {
+            if (this.state.status === 'offline' || this.state.status === 'error') {
+                this.checkServerConnection();
+            }
+        }, this.config.onlineCheckInterval);
+    }
+    
+    // Check actual server connection
+    async checkServerConnection(): Promise<boolean> {
+        if (!navigator.onLine) {
+            this.updateState({ status: 'offline' });
+            return false;
+        }
+        
+        this.updateState({ status: 'connecting' });
+        
+        try {
+            const controller = new AbortController();
+            const timeoutId = setTimeout(() => controller.abort(), 5000);
+            
+            const response = await fetch('/health', {
+                method: 'GET',
+                signal: controller.signal,
+                cache: 'no-cache'
+            });
+            
+            clearTimeout(timeoutId);
+            
+            if (response.ok) {
+                this.updateState({ 
+                    status: 'online', 
+                    retryCount: 0,
+                    lastOnlineTime: new Date()
+                });
+                return true;
+            } else {
+                throw new Error(`Server returned status ${response.status}`);
+            }
+        } catch (error) {
+            this.updateState({ 
+                status: 'error',
+                lastError: error as Error
+            });
+            return false;
+        }
+    }
+    
+    // Retry a failed request with exponential backoff
+    async retryRequest<T>(
+        request: () => Promise<T>,
+        options: RetryOptions = {}
+    ): Promise<T> {
+        const {
+            retries = this.config.maxRetries,
+            delay = this.config.initialDelay,
+            onRetry
+        } = options;
+        
+        let lastError: Error;
+        
+        for (let attempt = 0; attempt <= retries; attempt++) {
+            try {
+                // Check if we're online before attempting
+                if (!navigator.onLine) {
+                    throw new Error('No internet connection');
+                }
+                
+                // Add timeout to request
+                const result = await this.withTimeout(request(), this.config.timeout);
+                
+                // Success - reset retry count
+                if (this.state.retryCount > 0) {
+                    this.updateState({ retryCount: 0 });
+                }
+                
+                return result;
+            } catch (error) {
+                lastError = error as Error;
+                
+                // Don't retry if offline
+                if (!navigator.onLine) {
+                    this.updateState({ status: 'offline' });
+                    throw new Error('Request failed: No internet connection');
+                }
+                
+                // Don't retry on client errors (4xx)
+                if (this.isClientError(error)) {
+                    throw error;
+                }
+                
+                // Call retry callback if provided
+                if (onRetry && attempt < retries) {
+                    onRetry(attempt + 1, lastError);
+                }
+                
+                // If we have retries left, wait and try again
+                if (attempt < retries) {
+                    const backoffDelay = Math.min(
+                        delay * Math.pow(this.config.backoffMultiplier, attempt),
+                        this.config.maxDelay
+                    );
+                    
+                    console.log(`Retry attempt ${attempt + 1}/${retries} after ${backoffDelay}ms`);
+                    
+                    // Update retry count in state
+                    this.updateState({ retryCount: attempt + 1 });
+                    
+                    await this.delay(backoffDelay);
+                }
+            }
+        }
+        
+        // All retries exhausted
+        this.updateState({ 
+            status: 'error',
+            lastError: lastError!
+        });
+        
+        throw new Error(`Request failed after ${retries} retries: ${lastError!.message}`);
+    }
+    
+    // Add timeout to a promise
+    private withTimeout<T>(promise: Promise<T>, timeout: number): Promise<T> {
+        return Promise.race([
+            promise,
+            new Promise<T>((_, reject) => {
+                setTimeout(() => reject(new Error('Request timeout')), timeout);
+            })
+        ]);
+    }
+    
+    // Check if error is a client error (4xx)
+    private isClientError(error: any): boolean {
+        if (error.response && error.response.status >= 400 && error.response.status < 500) {
+            return true;
+        }
+        
+        // Check for specific error messages that shouldn't be retried
+        const message = error.message?.toLowerCase() || '';
+        const noRetryErrors = ['unauthorized', 'forbidden', 'bad request', 'not found'];
+        
+        return noRetryErrors.some(e => message.includes(e));
+    }
+    
+    // Delay helper
+    private delay(ms: number): Promise<void> {
+        return new Promise(resolve => setTimeout(resolve, ms));
+    }
+    
+    // Update connection state
+    private updateState(updates: Partial<ConnectionState>): void {
+        this.state = { ...this.state, ...updates };
+        this.notifyListeners();
+    }
+    
+    // Subscribe to connection state changes
+    subscribe(id: string, callback: (state: ConnectionState) => void): void {
+        this.listeners.set(id, callback);
+        // Immediately call with current state
+        callback(this.state);
+    }
+    
+    // Unsubscribe from connection state changes
+    unsubscribe(id: string): void {
+        this.listeners.delete(id);
+    }
+    
+    // Notify all listeners of state change
+    private notifyListeners(): void {
+        this.listeners.forEach(callback => callback(this.state));
+    }
+    
+    // Get current connection state
+    getState(): ConnectionState {
+        return { ...this.state };
+    }
+    
+    // Check if currently online
+    isOnline(): boolean {
+        return this.state.status === 'online';
+    }
+    
+    // Manual reconnect attempt
+    async reconnect(): Promise<boolean> {
+        console.log('Manual reconnect requested');
+        return this.checkServerConnection();
+    }
+    
+    // Cleanup
+    destroy(): void {
+        if (this.onlineCheckTimer) {
+            clearInterval(this.onlineCheckTimer);
+        }
+        if (this.reconnectTimer) {
+            clearTimeout(this.reconnectTimer);
+        }
+        this.listeners.clear();
+    }
+}
+
+// Helper function for retrying fetch requests
+export async function fetchWithRetry(
+    url: string,
+    options: RequestInit = {},
+    retryOptions: RetryOptions = {}
+): Promise<Response> {
+    const connectionManager = ConnectionManager.getInstance();
+    
+    return connectionManager.retryRequest(async () => {
+        const response = await fetch(url, options);
+        
+        if (!response.ok && response.status >= 500) {
+            // Server error - throw to trigger retry
+            throw new Error(`Server error: ${response.status}`);
+        }
+        
+        return response;
+    }, retryOptions);
+}
+
+// Helper function for retrying JSON requests
+export async function fetchJSONWithRetry<T>(
+    url: string,
+    options: RequestInit = {},
+    retryOptions: RetryOptions = {}
+): Promise<T> {
+    const response = await fetchWithRetry(url, options, retryOptions);
+    
+    if (!response.ok) {
+        throw new Error(`HTTP error! status: ${response.status}`);
+    }
+    
+    return response.json();
+}
--- a/static/js/src/connectionUI.ts
+++ b/static/js/src/connectionUI.ts
@@ -0,0 +1,325 @@
+// Connection status UI component
+import { ConnectionManager, ConnectionState } from './connectionManager';
+import { RequestQueueManager } from './requestQueue';
+
+export class ConnectionUI {
+    private static instance: ConnectionUI;
+    private connectionManager: ConnectionManager;
+    private queueManager: RequestQueueManager;
+    private statusElement: HTMLElement | null = null;
+    private retryButton: HTMLButtonElement | null = null;
+    private offlineMessage: HTMLElement | null = null;
+    
+    private constructor() {
+        this.connectionManager = ConnectionManager.getInstance();
+        this.queueManager = RequestQueueManager.getInstance();
+        this.createUI();
+        this.subscribeToConnectionChanges();
+    }
+    
+    static getInstance(): ConnectionUI {
+        if (!ConnectionUI.instance) {
+            ConnectionUI.instance = new ConnectionUI();
+        }
+        return ConnectionUI.instance;
+    }
+    
+    private createUI(): void {
+        // Create connection status indicator
+        this.statusElement = document.createElement('div');
+        this.statusElement.id = 'connectionStatus';
+        this.statusElement.className = 'connection-status';
+        this.statusElement.innerHTML = `
+            <span class="connection-icon"></span>
+            <span class="connection-text">Checking connection...</span>
+        `;
+        
+        // Create offline message banner
+        this.offlineMessage = document.createElement('div');
+        this.offlineMessage.id = 'offlineMessage';
+        this.offlineMessage.className = 'offline-message';
+        this.offlineMessage.innerHTML = `
+            <div class="offline-content">
+                <i class="fas fa-wifi-slash"></i>
+                <span class="offline-text">You're offline. Some features may be limited.</span>
+                <button class="btn btn-sm btn-outline-light retry-connection">
+                    <i class="fas fa-sync"></i> Retry
+                </button>
+                <div class="queued-info" style="display: none;">
+                    <small class="queued-count"></small>
+                </div>
+            </div>
+        `;
+        this.offlineMessage.style.display = 'none';
+        
+        // Add to page
+        document.body.appendChild(this.statusElement);
+        document.body.appendChild(this.offlineMessage);
+        
+        // Get retry button reference
+        this.retryButton = this.offlineMessage.querySelector('.retry-connection') as HTMLButtonElement;
+        this.retryButton?.addEventListener('click', () => this.handleRetry());
+        
+        // Add CSS if not already present
+        if (!document.getElementById('connection-ui-styles')) {
+            const style = document.createElement('style');
+            style.id = 'connection-ui-styles';
+            style.textContent = `
+                .connection-status {
+                    position: fixed;
+                    bottom: 20px;
+                    right: 20px;
+                    background: rgba(0, 0, 0, 0.8);
+                    color: white;
+                    padding: 8px 16px;
+                    border-radius: 20px;
+                    display: flex;
+                    align-items: center;
+                    gap: 8px;
+                    font-size: 14px;
+                    z-index: 1000;
+                    transition: all 0.3s ease;
+                    opacity: 0;
+                    transform: translateY(10px);
+                }
+                
+                .connection-status.visible {
+                    opacity: 1;
+                    transform: translateY(0);
+                }
+                
+                .connection-status.online {
+                    background: rgba(40, 167, 69, 0.9);
+                }
+                
+                .connection-status.offline {
+                    background: rgba(220, 53, 69, 0.9);
+                }
+                
+                .connection-status.connecting {
+                    background: rgba(255, 193, 7, 0.9);
+                }
+                
+                .connection-icon::before {
+                    content: '●';
+                    display: inline-block;
+                    animation: pulse 2s infinite;
+                }
+                
+                .connection-status.connecting .connection-icon::before {
+                    animation: spin 1s linear infinite;
+                    content: '↻';
+                }
+                
+                @keyframes pulse {
+                    0%, 100% { opacity: 1; }
+                    50% { opacity: 0.5; }
+                }
+                
+                @keyframes spin {
+                    from { transform: rotate(0deg); }
+                    to { transform: rotate(360deg); }
+                }
+                
+                .offline-message {
+                    position: fixed;
+                    top: 0;
+                    left: 0;
+                    right: 0;
+                    background: #dc3545;
+                    color: white;
+                    padding: 12px;
+                    text-align: center;
+                    z-index: 1001;
+                    transform: translateY(-100%);
+                    transition: transform 0.3s ease;
+                }
+                
+                .offline-message.show {
+                    transform: translateY(0);
+                }
+                
+                .offline-content {
+                    display: flex;
+                    align-items: center;
+                    justify-content: center;
+                    gap: 12px;
+                    flex-wrap: wrap;
+                }
+                
+                .offline-content i {
+                    font-size: 20px;
+                }
+                
+                .retry-connection {
+                    border-color: white;
+                    color: white;
+                }
+                
+                .retry-connection:hover {
+                    background: white;
+                    color: #dc3545;
+                }
+                
+                .queued-info {
+                    margin-left: 12px;
+                }
+                
+                .queued-count {
+                    opacity: 0.9;
+                }
+                
+                @media (max-width: 768px) {
+                    .connection-status {
+                        bottom: 10px;
+                        right: 10px;
+                        font-size: 12px;
+                        padding: 6px 12px;
+                    }
+                    
+                    .offline-content {
+                        font-size: 14px;
+                    }
+                }
+            `;
+            document.head.appendChild(style);
+        }
+    }
+    
+    private subscribeToConnectionChanges(): void {
+        this.connectionManager.subscribe('connection-ui', (state: ConnectionState) => {
+            this.updateUI(state);
+        });
+    }
+    
+    private updateUI(state: ConnectionState): void {
+        if (!this.statusElement || !this.offlineMessage) return;
+        
+        const statusText = this.statusElement.querySelector('.connection-text') as HTMLElement;
+        
+        // Update status element
+        this.statusElement.className = `connection-status visible ${state.status}`;
+        
+        switch (state.status) {
+            case 'online':
+                statusText.textContent = 'Connected';
+                this.hideOfflineMessage();
+                // Hide status after 3 seconds when online
+                setTimeout(() => {
+                    if (this.connectionManager.getState().status === 'online') {
+                        this.statusElement?.classList.remove('visible');
+                    }
+                }, 3000);
+                break;
+                
+            case 'offline':
+                statusText.textContent = 'Offline';
+                this.showOfflineMessage();
+                this.updateQueuedInfo();
+                break;
+                
+            case 'connecting':
+                statusText.textContent = 'Reconnecting...';
+                if (this.retryButton) {
+                    this.retryButton.disabled = true;
+                    this.retryButton.innerHTML = '<i class="fas fa-spinner fa-spin"></i> Connecting...';
+                }
+                break;
+                
+            case 'error':
+                statusText.textContent = `Connection error${state.retryCount > 0 ? ` (Retry ${state.retryCount})` : ''}`;
+                this.showOfflineMessage();
+                this.updateQueuedInfo();
+                if (this.retryButton) {
+                    this.retryButton.disabled = false;
+                    this.retryButton.innerHTML = '<i class="fas fa-sync"></i> Retry';
+                }
+                break;
+        }
+    }
+    
+    private showOfflineMessage(): void {
+        if (this.offlineMessage) {
+            this.offlineMessage.style.display = 'block';
+            setTimeout(() => {
+                this.offlineMessage?.classList.add('show');
+            }, 10);
+        }
+    }
+    
+    private hideOfflineMessage(): void {
+        if (this.offlineMessage) {
+            this.offlineMessage.classList.remove('show');
+            setTimeout(() => {
+                if (this.offlineMessage) {
+                    this.offlineMessage.style.display = 'none';
+                }
+            }, 300);
+        }
+    }
+    
+    private updateQueuedInfo(): void {
+        const queueStatus = this.queueManager.getStatus();
+        const queuedByType = this.queueManager.getQueuedByType();
+        const queuedInfo = this.offlineMessage?.querySelector('.queued-info') as HTMLElement;
+        const queuedCount = this.offlineMessage?.querySelector('.queued-count') as HTMLElement;
+        
+        if (queuedInfo && queuedCount) {
+            const totalQueued = queueStatus.queueLength + queueStatus.activeRequests;
+            
+            if (totalQueued > 0) {
+                queuedInfo.style.display = 'block';
+                const parts = [];
+                
+                if (queuedByType.transcribe > 0) {
+                    parts.push(`${queuedByType.transcribe} transcription${queuedByType.transcribe > 1 ? 's' : ''}`);
+                }
+                if (queuedByType.translate > 0) {
+                    parts.push(`${queuedByType.translate} translation${queuedByType.translate > 1 ? 's' : ''}`);
+                }
+                if (queuedByType.tts > 0) {
+                    parts.push(`${queuedByType.tts} audio generation${queuedByType.tts > 1 ? 's' : ''}`);
+                }
+                
+                queuedCount.textContent = `${totalQueued} request${totalQueued > 1 ? 's' : ''} queued${parts.length > 0 ? ': ' + parts.join(', ') : ''}`;
+            } else {
+                queuedInfo.style.display = 'none';
+            }
+        }
+    }
+    
+    private async handleRetry(): Promise<void> {
+        if (this.retryButton) {
+            this.retryButton.disabled = true;
+            this.retryButton.innerHTML = '<i class="fas fa-spinner fa-spin"></i> Connecting...';
+        }
+        
+        const success = await this.connectionManager.reconnect();
+        
+        if (!success && this.retryButton) {
+            this.retryButton.disabled = false;
+            this.retryButton.innerHTML = '<i class="fas fa-sync"></i> Retry';
+        }
+    }
+    
+    // Public method to show temporary connection message
+    showTemporaryMessage(message: string, type: 'success' | 'error' | 'warning' = 'success'): void {
+        if (!this.statusElement) return;
+        
+        const statusText = this.statusElement.querySelector('.connection-text') as HTMLElement;
+        const originalClass = this.statusElement.className;
+        const originalText = statusText.textContent;
+        
+        // Update appearance based on type
+        this.statusElement.className = `connection-status visible ${type === 'success' ? 'online' : type === 'error' ? 'offline' : 'connecting'}`;
+        statusText.textContent = message;
+        
+        // Reset after 3 seconds
+        setTimeout(() => {
+            if (this.statusElement && statusText) {
+                this.statusElement.className = originalClass;
+                statusText.textContent = originalText || '';
+            }
+        }, 3000);
+    }
+}
--- a/static/js/src/errorBoundary.ts
+++ b/static/js/src/errorBoundary.ts
@@ -0,0 +1,286 @@
+// Error boundary implementation for better error handling
+export interface ErrorInfo {
+    message: string;
+    stack?: string;
+    component?: string;
+    timestamp: number;
+    userAgent: string;
+    url: string;
+}
+
+export class ErrorBoundary {
+    private static instance: ErrorBoundary;
+    private errorLog: ErrorInfo[] = [];
+    private maxErrorLog = 50;
+    private errorHandlers: Map<string, (error: Error, errorInfo: ErrorInfo) => void> = new Map();
+    private globalErrorHandler: ((error: Error, errorInfo: ErrorInfo) => void) | null = null;
+
+    private constructor() {
+        this.setupGlobalErrorHandlers();
+    }
+
+    static getInstance(): ErrorBoundary {
+        if (!ErrorBoundary.instance) {
+            ErrorBoundary.instance = new ErrorBoundary();
+        }
+        return ErrorBoundary.instance;
+    }
+
+    private setupGlobalErrorHandlers(): void {
+        // Handle unhandled errors
+        window.addEventListener('error', (event: ErrorEvent) => {
+            const errorInfo: ErrorInfo = {
+                message: event.message,
+                stack: event.error?.stack,
+                timestamp: Date.now(),
+                userAgent: navigator.userAgent,
+                url: window.location.href,
+                component: 'global'
+            };
+
+            this.logError(event.error || new Error(event.message), errorInfo);
+            this.handleError(event.error || new Error(event.message), errorInfo);
+
+            // Prevent default error handling
+            event.preventDefault();
+        });
+
+        // Handle unhandled promise rejections
+        window.addEventListener('unhandledrejection', (event: PromiseRejectionEvent) => {
+            const error = new Error(event.reason?.message || 'Unhandled Promise Rejection');
+            const errorInfo: ErrorInfo = {
+                message: error.message,
+                stack: event.reason?.stack,
+                timestamp: Date.now(),
+                userAgent: navigator.userAgent,
+                url: window.location.href,
+                component: 'promise'
+            };
+
+            this.logError(error, errorInfo);
+            this.handleError(error, errorInfo);
+
+            // Prevent default error handling
+            event.preventDefault();
+        });
+    }
+
+    // Wrap a function with error boundary
+    wrap<T extends (...args: any[]) => any>(
+        fn: T,
+        component: string,
+        fallback?: (...args: Parameters<T>) => ReturnType<T>
+    ): T {
+        return ((...args: Parameters<T>): ReturnType<T> => {
+            try {
+                const result = fn(...args);
+                
+                // Handle async functions
+                if (result instanceof Promise) {
+                    return result.catch((error: Error) => {
+                        const errorInfo: ErrorInfo = {
+                            message: error.message,
+                            stack: error.stack,
+                            component,
+                            timestamp: Date.now(),
+                            userAgent: navigator.userAgent,
+                            url: window.location.href
+                        };
+
+                        this.logError(error, errorInfo);
+                        this.handleError(error, errorInfo);
+
+                        if (fallback) {
+                            return fallback(...args) as ReturnType<T>;
+                        }
+                        throw error;
+                    }) as ReturnType<T>;
+                }
+
+                return result;
+            } catch (error: any) {
+                const errorInfo: ErrorInfo = {
+                    message: error.message,
+                    stack: error.stack,
+                    component,
+                    timestamp: Date.now(),
+                    userAgent: navigator.userAgent,
+                    url: window.location.href
+                };
+
+                this.logError(error, errorInfo);
+                this.handleError(error, errorInfo);
+
+                if (fallback) {
+                    return fallback(...args);
+                }
+                throw error;
+            }
+        }) as T;
+    }
+
+    // Wrap async functions specifically
+    wrapAsync<T extends (...args: any[]) => Promise<any>>(
+        fn: T,
+        component: string,
+        fallback?: (...args: Parameters<T>) => ReturnType<T>
+    ): T {
+        return (async (...args: Parameters<T>) => {
+            try {
+                return await fn(...args);
+            } catch (error: any) {
+                const errorInfo: ErrorInfo = {
+                    message: error.message,
+                    stack: error.stack,
+                    component,
+                    timestamp: Date.now(),
+                    userAgent: navigator.userAgent,
+                    url: window.location.href
+                };
+
+                this.logError(error, errorInfo);
+                this.handleError(error, errorInfo);
+
+                if (fallback) {
+                    return fallback(...args);
+                }
+                throw error;
+            }
+        }) as T;
+    }
+
+    // Register component-specific error handler
+    registerErrorHandler(component: string, handler: (error: Error, errorInfo: ErrorInfo) => void): void {
+        this.errorHandlers.set(component, handler);
+    }
+
+    // Set global error handler
+    setGlobalErrorHandler(handler: (error: Error, errorInfo: ErrorInfo) => void): void {
+        this.globalErrorHandler = handler;
+    }
+
+    private logError(error: Error, errorInfo: ErrorInfo): void {
+        // Add to error log
+        this.errorLog.push(errorInfo);
+
+        // Keep only recent errors
+        if (this.errorLog.length > this.maxErrorLog) {
+            this.errorLog.shift();
+        }
+
+        // Log to console in development
+        console.error(`[${errorInfo.component}] Error:`, error);
+        console.error('Error Info:', errorInfo);
+
+        // Send to monitoring service if available
+        this.sendToMonitoring(error, errorInfo);
+    }
+
+    private handleError(error: Error, errorInfo: ErrorInfo): void {
+        // Check for component-specific handler
+        const componentHandler = this.errorHandlers.get(errorInfo.component || '');
+        if (componentHandler) {
+            componentHandler(error, errorInfo);
+            return;
+        }
+
+        // Use global handler if set
+        if (this.globalErrorHandler) {
+            this.globalErrorHandler(error, errorInfo);
+            return;
+        }
+
+        // Default error handling
+        this.showErrorNotification(error, errorInfo);
+    }
+
+    private showErrorNotification(error: Error, errorInfo: ErrorInfo): void {
+        // Create error notification
+        const notification = document.createElement('div');
+        notification.className = 'alert alert-danger alert-dismissible fade show position-fixed bottom-0 end-0 m-3';
+        notification.style.zIndex = '9999';
+        notification.style.maxWidth = '400px';
+        
+        const isUserFacing = this.isUserFacingError(error);
+        const message = isUserFacing ? error.message : 'An unexpected error occurred. Please try again.';
+        
+        notification.innerHTML = `
+            <strong><i class="fas fa-exclamation-circle"></i> Error${errorInfo.component ? ` in ${errorInfo.component}` : ''}</strong>
+            <p class="mb-0">${message}</p>
+            ${!isUserFacing ? '<small class="text-muted">The error has been logged for investigation.</small>' : ''}
+            <button type="button" class="btn-close" data-bs-dismiss="alert"></button>
+        `;
+
+        document.body.appendChild(notification);
+
+        // Auto-dismiss after 10 seconds
+        setTimeout(() => {
+            if (notification.parentNode) {
+                notification.remove();
+            }
+        }, 10000);
+    }
+
+    private isUserFacingError(error: Error): boolean {
+        // Determine if error should be shown to user as-is
+        const userFacingMessages = [
+            'rate limit',
+            'network',
+            'offline',
+            'not found',
+            'unauthorized',
+            'forbidden',
+            'timeout',
+            'invalid'
+        ];
+
+        const message = error.message.toLowerCase();
+        return userFacingMessages.some(msg => message.includes(msg));
+    }
+
+    private async sendToMonitoring(error: Error, errorInfo: ErrorInfo): Promise<void> {
+        // Only send errors in production
+        if (window.location.hostname === 'localhost' || window.location.hostname === '127.0.0.1') {
+            return;
+        }
+
+        try {
+            // Send error to backend monitoring endpoint
+            await fetch('/api/log-error', {
+                method: 'POST',
+                headers: {
+                    'Content-Type': 'application/json'
+                },
+                body: JSON.stringify({
+                    error: {
+                        message: error.message,
+                        stack: error.stack,
+                        name: error.name
+                    },
+                    errorInfo
+                })
+            });
+        } catch (monitoringError) {
+            // Fail silently - don't create error loop
+            console.error('Failed to send error to monitoring:', monitoringError);
+        }
+    }
+
+    // Get error log for debugging
+    getErrorLog(): ErrorInfo[] {
+        return [...this.errorLog];
+    }
+
+    // Clear error log
+    clearErrorLog(): void {
+        this.errorLog = [];
+    }
+
+    // Check if component has recent errors
+    hasRecentErrors(component: string, timeWindow: number = 60000): boolean {
+        const cutoff = Date.now() - timeWindow;
+        return this.errorLog.some(
+            error => error.component === component && error.timestamp > cutoff
+        );
+    }
+}
--- a/static/js/src/memoryManager.ts
+++ b/static/js/src/memoryManager.ts
@@ -0,0 +1,309 @@
+/**
+ * Memory management utilities for preventing leaks in audio handling
+ */
+
+export class MemoryManager {
+    private static instance: MemoryManager;
+    private audioContexts: Set<AudioContext> = new Set();
+    private objectURLs: Set<string> = new Set();
+    private mediaStreams: Set<MediaStream> = new Set();
+    private intervals: Set<number> = new Set();
+    private timeouts: Set<number> = new Set();
+
+    private constructor() {
+        // Set up periodic cleanup
+        this.startPeriodicCleanup();
+        
+        // Clean up on page unload
+        window.addEventListener('beforeunload', () => this.cleanup());
+    }
+
+    static getInstance(): MemoryManager {
+        if (!MemoryManager.instance) {
+            MemoryManager.instance = new MemoryManager();
+        }
+        return MemoryManager.instance;
+    }
+
+    /**
+     * Register an AudioContext for cleanup
+     */
+    registerAudioContext(context: AudioContext): void {
+        this.audioContexts.add(context);
+    }
+
+    /**
+     * Register an object URL for cleanup
+     */
+    registerObjectURL(url: string): void {
+        this.objectURLs.add(url);
+    }
+
+    /**
+     * Register a MediaStream for cleanup
+     */
+    registerMediaStream(stream: MediaStream): void {
+        this.mediaStreams.add(stream);
+    }
+
+    /**
+     * Register an interval for cleanup
+     */
+    registerInterval(id: number): void {
+        this.intervals.add(id);
+    }
+
+    /**
+     * Register a timeout for cleanup
+     */
+    registerTimeout(id: number): void {
+        this.timeouts.add(id);
+    }
+
+    /**
+     * Clean up a specific AudioContext
+     */
+    cleanupAudioContext(context: AudioContext): void {
+        if (context.state !== 'closed') {
+            context.close().catch(console.error);
+        }
+        this.audioContexts.delete(context);
+    }
+
+    /**
+     * Clean up a specific object URL
+     */
+    cleanupObjectURL(url: string): void {
+        URL.revokeObjectURL(url);
+        this.objectURLs.delete(url);
+    }
+
+    /**
+     * Clean up a specific MediaStream
+     */
+    cleanupMediaStream(stream: MediaStream): void {
+        stream.getTracks().forEach(track => {
+            track.stop();
+        });
+        this.mediaStreams.delete(stream);
+    }
+
+    /**
+     * Clean up all resources
+     */
+    cleanup(): void {
+        // Clean up audio contexts
+        this.audioContexts.forEach(context => {
+            if (context.state !== 'closed') {
+                context.close().catch(console.error);
+            }
+        });
+        this.audioContexts.clear();
+
+        // Clean up object URLs
+        this.objectURLs.forEach(url => {
+            URL.revokeObjectURL(url);
+        });
+        this.objectURLs.clear();
+
+        // Clean up media streams
+        this.mediaStreams.forEach(stream => {
+            stream.getTracks().forEach(track => {
+                track.stop();
+            });
+        });
+        this.mediaStreams.clear();
+
+        // Clear intervals and timeouts
+        this.intervals.forEach(id => clearInterval(id));
+        this.intervals.clear();
+        
+        this.timeouts.forEach(id => clearTimeout(id));
+        this.timeouts.clear();
+
+        console.log('Memory cleanup completed');
+    }
+
+    /**
+     * Get memory usage statistics
+     */
+    getStats(): MemoryStats {
+        return {
+            audioContexts: this.audioContexts.size,
+            objectURLs: this.objectURLs.size,
+            mediaStreams: this.mediaStreams.size,
+            intervals: this.intervals.size,
+            timeouts: this.timeouts.size
+        };
+    }
+
+    /**
+     * Start periodic cleanup of orphaned resources
+     */
+    private startPeriodicCleanup(): void {
+        setInterval(() => {
+            // Clean up closed audio contexts
+            this.audioContexts.forEach(context => {
+                if (context.state === 'closed') {
+                    this.audioContexts.delete(context);
+                }
+            });
+
+            // Clean up stopped media streams
+            this.mediaStreams.forEach(stream => {
+                const activeTracks = stream.getTracks().filter(track => track.readyState === 'live');
+                if (activeTracks.length === 0) {
+                    this.mediaStreams.delete(stream);
+                }
+            });
+
+            // Log stats in development
+            if (process.env.NODE_ENV === 'development') {
+                const stats = this.getStats();
+                if (Object.values(stats).some(v => v > 0)) {
+                    console.log('Memory manager stats:', stats);
+                }
+            }
+        }, 30000); // Every 30 seconds
+
+        // Don't track this interval to avoid self-reference
+        // It will be cleared on page unload
+    }
+}
+
+interface MemoryStats {
+    audioContexts: number;
+    objectURLs: number;
+    mediaStreams: number;
+    intervals: number;
+    timeouts: number;
+}
+
+/**
+ * Wrapper for safe audio blob handling
+ */
+export class AudioBlobHandler {
+    private blob: Blob;
+    private objectURL?: string;
+    private memoryManager: MemoryManager;
+
+    constructor(blob: Blob) {
+        this.blob = blob;
+        this.memoryManager = MemoryManager.getInstance();
+    }
+
+    /**
+     * Get object URL (creates one if needed)
+     */
+    getObjectURL(): string {
+        if (!this.objectURL) {
+            this.objectURL = URL.createObjectURL(this.blob);
+            this.memoryManager.registerObjectURL(this.objectURL);
+        }
+        return this.objectURL;
+    }
+
+    /**
+     * Get the blob
+     */
+    getBlob(): Blob {
+        return this.blob;
+    }
+
+    /**
+     * Clean up resources
+     */
+    cleanup(): void {
+        if (this.objectURL) {
+            this.memoryManager.cleanupObjectURL(this.objectURL);
+            this.objectURL = undefined;
+        }
+        // Help garbage collection
+        (this.blob as any) = null;
+    }
+}
+
+/**
+ * Safe MediaRecorder wrapper
+ */
+export class SafeMediaRecorder {
+    private mediaRecorder?: MediaRecorder;
+    private stream?: MediaStream;
+    private chunks: Blob[] = [];
+    private memoryManager: MemoryManager;
+
+    constructor() {
+        this.memoryManager = MemoryManager.getInstance();
+    }
+
+    async start(constraints: MediaStreamConstraints = { audio: true }): Promise<void> {
+        // Clean up any existing recorder
+        this.cleanup();
+
+        this.stream = await navigator.mediaDevices.getUserMedia(constraints);
+        this.memoryManager.registerMediaStream(this.stream);
+
+        const options = {
+            mimeType: MediaRecorder.isTypeSupported('audio/webm;codecs=opus') 
+                ? 'audio/webm;codecs=opus' 
+                : 'audio/webm'
+        };
+
+        this.mediaRecorder = new MediaRecorder(this.stream, options);
+        this.chunks = [];
+
+        this.mediaRecorder.ondataavailable = (event) => {
+            if (event.data.size > 0) {
+                this.chunks.push(event.data);
+            }
+        };
+
+        this.mediaRecorder.start();
+    }
+
+    stop(): Promise<Blob> {
+        return new Promise((resolve, reject) => {
+            if (!this.mediaRecorder) {
+                reject(new Error('MediaRecorder not initialized'));
+                return;
+            }
+
+            this.mediaRecorder.onstop = () => {
+                const blob = new Blob(this.chunks, { 
+                    type: this.mediaRecorder?.mimeType || 'audio/webm' 
+                });
+                resolve(blob);
+                
+                // Clean up after delivering the blob
+                setTimeout(() => this.cleanup(), 100);
+            };
+
+            this.mediaRecorder.stop();
+        });
+    }
+
+    cleanup(): void {
+        if (this.stream) {
+            this.memoryManager.cleanupMediaStream(this.stream);
+            this.stream = undefined;
+        }
+
+        if (this.mediaRecorder) {
+            if (this.mediaRecorder.state !== 'inactive') {
+                try {
+                    this.mediaRecorder.stop();
+                } catch (e) {
+                    // Ignore errors
+                }
+            }
+            this.mediaRecorder = undefined;
+        }
+
+        // Clear chunks
+        this.chunks = [];
+    }
+
+    isRecording(): boolean {
+        return this.mediaRecorder?.state === 'recording';
+    }
+}
--- a/static/js/src/performanceMonitor.ts
+++ b/static/js/src/performanceMonitor.ts
@@ -0,0 +1,147 @@
+// Performance monitoring for translation latency
+export class PerformanceMonitor {
+    private static instance: PerformanceMonitor;
+    private metrics: Map<string, number[]> = new Map();
+    private timers: Map<string, number> = new Map();
+    
+    private constructor() {}
+    
+    static getInstance(): PerformanceMonitor {
+        if (!PerformanceMonitor.instance) {
+            PerformanceMonitor.instance = new PerformanceMonitor();
+        }
+        return PerformanceMonitor.instance;
+    }
+    
+    // Start timing an operation
+    startTimer(operation: string): void {
+        this.timers.set(operation, performance.now());
+    }
+    
+    // End timing and record the duration
+    endTimer(operation: string): number {
+        const startTime = this.timers.get(operation);
+        if (!startTime) {
+            console.warn(`No start time found for operation: ${operation}`);
+            return 0;
+        }
+        
+        const duration = performance.now() - startTime;
+        this.recordMetric(operation, duration);
+        this.timers.delete(operation);
+        
+        return duration;
+    }
+    
+    // Record a metric value
+    recordMetric(name: string, value: number): void {
+        if (!this.metrics.has(name)) {
+            this.metrics.set(name, []);
+        }
+        
+        const values = this.metrics.get(name)!;
+        values.push(value);
+        
+        // Keep only last 100 values
+        if (values.length > 100) {
+            values.shift();
+        }
+    }
+    
+    // Get average metric value
+    getAverageMetric(name: string): number {
+        const values = this.metrics.get(name);
+        if (!values || values.length === 0) {
+            return 0;
+        }
+        
+        const sum = values.reduce((a, b) => a + b, 0);
+        return sum / values.length;
+    }
+    
+    // Get time to first byte (TTFB) for streaming
+    measureTTFB(operation: string, firstByteTime: number): number {
+        const startTime = this.timers.get(operation);
+        if (!startTime) {
+            return 0;
+        }
+        
+        const ttfb = firstByteTime - startTime;
+        this.recordMetric(`${operation}_ttfb`, ttfb);
+        return ttfb;
+    }
+    
+    // Get performance summary
+    getPerformanceSummary(): {
+        streaming: {
+            avgTotalTime: number;
+            avgTTFB: number;
+            count: number;
+        };
+        regular: {
+            avgTotalTime: number;
+            count: number;
+        };
+        improvement: {
+            ttfbReduction: number;
+            perceivedLatencyReduction: number;
+        };
+    } {
+        const streamingTotal = this.getAverageMetric('streaming_translation');
+        const streamingTTFB = this.getAverageMetric('streaming_translation_ttfb');
+        const streamingCount = this.metrics.get('streaming_translation')?.length || 0;
+        
+        const regularTotal = this.getAverageMetric('regular_translation');
+        const regularCount = this.metrics.get('regular_translation')?.length || 0;
+        
+        // Calculate improvements
+        const ttfbReduction = regularTotal > 0 && streamingTTFB > 0
+            ? ((regularTotal - streamingTTFB) / regularTotal) * 100
+            : 0;
+            
+        // Perceived latency is based on TTFB for streaming vs total time for regular
+        const perceivedLatencyReduction = ttfbReduction;
+        
+        return {
+            streaming: {
+                avgTotalTime: streamingTotal,
+                avgTTFB: streamingTTFB,
+                count: streamingCount
+            },
+            regular: {
+                avgTotalTime: regularTotal,
+                count: regularCount
+            },
+            improvement: {
+                ttfbReduction: Math.round(ttfbReduction),
+                perceivedLatencyReduction: Math.round(perceivedLatencyReduction)
+            }
+        };
+    }
+    
+    // Log performance stats to console
+    logPerformanceStats(): void {
+        const summary = this.getPerformanceSummary();
+        
+        console.group('Translation Performance Stats');
+        console.log('Streaming Translation:');
+        console.log(`  Average Total Time: ${summary.streaming.avgTotalTime.toFixed(2)}ms`);
+        console.log(`  Average TTFB: ${summary.streaming.avgTTFB.toFixed(2)}ms`);
+        console.log(`  Sample Count: ${summary.streaming.count}`);
+        
+        console.log('Regular Translation:');
+        console.log(`  Average Total Time: ${summary.regular.avgTotalTime.toFixed(2)}ms`);
+        console.log(`  Sample Count: ${summary.regular.count}`);
+        
+        console.log('Improvements:');
+        console.log(`  TTFB Reduction: ${summary.improvement.ttfbReduction}%`);
+        console.log(`  Perceived Latency Reduction: ${summary.improvement.perceivedLatencyReduction}%`);
+        console.groupEnd();
+    }
+    
+    // Clear all metrics
+    clearMetrics(): void {
+        this.metrics.clear();
+        this.timers.clear();
+    }
+}
--- a/static/js/src/requestQueue.ts
+++ b/static/js/src/requestQueue.ts
@@ -0,0 +1,333 @@
+// Request queue and throttling manager
+import { ConnectionManager, ConnectionState } from './connectionManager';
+
+export interface QueuedRequest {
+    id: string;
+    type: 'transcribe' | 'translate' | 'tts';
+    request: () => Promise<any>;
+    resolve: (value: any) => void;
+    reject: (reason?: any) => void;
+    retryCount: number;
+    priority: number;
+    timestamp: number;
+}
+
+export class RequestQueueManager {
+    private static instance: RequestQueueManager;
+    private queue: QueuedRequest[] = [];
+    private activeRequests: Map<string, QueuedRequest> = new Map();
+    private maxConcurrent = 2; // Maximum concurrent requests
+    private maxRetries = 3;
+    private retryDelay = 1000; // Base retry delay in ms
+    private isProcessing = false;
+    private connectionManager: ConnectionManager;
+    private isPaused = false;
+
+    // Rate limiting
+    private requestHistory: number[] = [];
+    private maxRequestsPerMinute = 30;
+    private maxRequestsPerSecond = 2;
+
+    private constructor() {
+        this.connectionManager = ConnectionManager.getInstance();
+        
+        // Subscribe to connection state changes
+        this.connectionManager.subscribe('request-queue', (state: ConnectionState) => {
+            this.handleConnectionStateChange(state);
+        });
+        
+        // Start processing queue
+        this.startProcessing();
+    }
+
+    static getInstance(): RequestQueueManager {
+        if (!RequestQueueManager.instance) {
+            RequestQueueManager.instance = new RequestQueueManager();
+        }
+        return RequestQueueManager.instance;
+    }
+
+    // Add request to queue
+    async enqueue<T>(
+        type: 'transcribe' | 'translate' | 'tts',
+        request: () => Promise<T>,
+        priority: number = 5
+    ): Promise<T> {
+        // Check rate limits
+        if (!this.checkRateLimits()) {
+            throw new Error('Rate limit exceeded. Please slow down.');
+        }
+
+        return new Promise((resolve, reject) => {
+            const id = this.generateId();
+            const queuedRequest: QueuedRequest = {
+                id,
+                type,
+                request,
+                resolve,
+                reject,
+                retryCount: 0,
+                priority,
+                timestamp: Date.now()
+            };
+
+            // Add to queue based on priority
+            this.addToQueue(queuedRequest);
+            
+            // Log queue status
+            console.log(`Request queued: ${type}, Queue size: ${this.queue.length}, Active: ${this.activeRequests.size}`);
+        });
+    }
+
+    private addToQueue(request: QueuedRequest): void {
+        // Insert based on priority (higher priority first)
+        const insertIndex = this.queue.findIndex(item => item.priority < request.priority);
+        if (insertIndex === -1) {
+            this.queue.push(request);
+        } else {
+            this.queue.splice(insertIndex, 0, request);
+        }
+    }
+
+    private checkRateLimits(): boolean {
+        const now = Date.now();
+        
+        // Clean old entries
+        this.requestHistory = this.requestHistory.filter(
+            time => now - time < 60000 // Keep last minute
+        );
+
+        // Check per-second limit
+        const lastSecond = this.requestHistory.filter(
+            time => now - time < 1000
+        ).length;
+        
+        if (lastSecond >= this.maxRequestsPerSecond) {
+            console.warn('Per-second rate limit reached');
+            return false;
+        }
+
+        // Check per-minute limit
+        if (this.requestHistory.length >= this.maxRequestsPerMinute) {
+            console.warn('Per-minute rate limit reached');
+            return false;
+        }
+
+        // Record this request
+        this.requestHistory.push(now);
+        return true;
+    }
+
+    private async startProcessing(): Promise<void> {
+        if (this.isProcessing) return;
+        this.isProcessing = true;
+
+        while (true) {
+            await this.processQueue();
+            await this.delay(100); // Check queue every 100ms
+        }
+    }
+
+    private async processQueue(): Promise<void> {
+        // Check if we're paused or can't process more requests
+        if (this.isPaused || this.activeRequests.size >= this.maxConcurrent || this.queue.length === 0) {
+            return;
+        }
+
+        // Check if we're online
+        if (!this.connectionManager.isOnline()) {
+            console.log('Queue processing paused - offline');
+            return;
+        }
+
+        // Get next request
+        const request = this.queue.shift();
+        if (!request) return;
+
+        // Mark as active
+        this.activeRequests.set(request.id, request);
+
+        try {
+            // Execute request with connection manager retry logic
+            const result = await this.connectionManager.retryRequest(
+                request.request,
+                {
+                    retries: this.maxRetries - request.retryCount,
+                    delay: this.calculateRetryDelay(request.retryCount + 1),
+                    onRetry: (attempt, error) => {
+                        console.log(`Retry ${attempt} for ${request.type}: ${error.message}`);
+                    }
+                }
+            );
+            
+            request.resolve(result);
+            console.log(`Request completed: ${request.type}`);
+        } catch (error) {
+            console.error(`Request failed after retries: ${request.type}`, error);
+            
+            // Check if it's a connection error and we should queue for later
+            if (this.isConnectionError(error) && request.retryCount < this.maxRetries) {
+                request.retryCount++;
+                console.log(`Re-queuing ${request.type} due to connection error`);
+                
+                // Re-queue with higher priority
+                request.priority = Math.max(request.priority + 1, 10);
+                this.addToQueue(request);
+            } else {
+                // Non-recoverable error or max retries reached
+                request.reject(error);
+            }
+        } finally {
+            // Remove from active
+            this.activeRequests.delete(request.id);
+        }
+    }
+
+    // Note: shouldRetry logic is now handled by ConnectionManager
+    // Keeping for reference but not used directly
+
+    private calculateRetryDelay(retryCount: number): number {
+        // Exponential backoff with jitter
+        const baseDelay = this.retryDelay * Math.pow(2, retryCount - 1);
+        const jitter = Math.random() * 0.3 * baseDelay; // 30% jitter
+        return Math.min(baseDelay + jitter, 30000); // Max 30 seconds
+    }
+
+    private generateId(): string {
+        return `${Date.now()}-${Math.random().toString(36).substr(2, 9)}`;
+    }
+
+    private delay(ms: number): Promise<void> {
+        return new Promise(resolve => setTimeout(resolve, ms));
+    }
+
+    // Get queue status
+    getStatus(): {
+        queueLength: number;
+        activeRequests: number;
+        requestsPerMinute: number;
+    } {
+        const now = Date.now();
+        const recentRequests = this.requestHistory.filter(
+            time => now - time < 60000
+        ).length;
+
+        return {
+            queueLength: this.queue.length,
+            activeRequests: this.activeRequests.size,
+            requestsPerMinute: recentRequests
+        };
+    }
+
+    // Clear queue (for emergency use)
+    clearQueue(): void {
+        this.queue.forEach(request => {
+            request.reject(new Error('Queue cleared'));
+        });
+        this.queue = [];
+    }
+    
+    // Clear stuck requests (requests older than 60 seconds)
+    clearStuckRequests(): void {
+        const now = Date.now();
+        const stuckThreshold = 60000; // 60 seconds
+        
+        // Clear stuck active requests
+        this.activeRequests.forEach((request, id) => {
+            if (now - request.timestamp > stuckThreshold) {
+                console.warn(`Clearing stuck active request: ${request.type}`);
+                request.reject(new Error('Request timeout - cleared by recovery'));
+                this.activeRequests.delete(id);
+            }
+        });
+        
+        // Clear old queued requests
+        this.queue = this.queue.filter(request => {
+            if (now - request.timestamp > stuckThreshold) {
+                console.warn(`Clearing stuck queued request: ${request.type}`);
+                request.reject(new Error('Request timeout - cleared by recovery'));
+                return false;
+            }
+            return true;
+        });
+    }
+
+    // Update settings
+    updateSettings(settings: {
+        maxConcurrent?: number;
+        maxRequestsPerMinute?: number;
+        maxRequestsPerSecond?: number;
+    }): void {
+        if (settings.maxConcurrent !== undefined) {
+            this.maxConcurrent = settings.maxConcurrent;
+        }
+        if (settings.maxRequestsPerMinute !== undefined) {
+            this.maxRequestsPerMinute = settings.maxRequestsPerMinute;
+        }
+        if (settings.maxRequestsPerSecond !== undefined) {
+            this.maxRequestsPerSecond = settings.maxRequestsPerSecond;
+        }
+    }
+    
+    // Handle connection state changes
+    private handleConnectionStateChange(state: ConnectionState): void {
+        console.log(`Connection state changed: ${state.status}`);
+        
+        if (state.status === 'offline' || state.status === 'error') {
+            // Pause processing when offline
+            this.isPaused = true;
+            
+            // Notify queued requests about offline status
+            if (this.queue.length > 0) {
+                console.log(`${this.queue.length} requests queued while offline`);
+            }
+        } else if (state.status === 'online') {
+            // Resume processing when back online
+            this.isPaused = false;
+            console.log('Connection restored, resuming queue processing');
+            
+            // Process any queued requests
+            if (this.queue.length > 0) {
+                console.log(`Processing ${this.queue.length} queued requests`);
+            }
+        }
+    }
+    
+    // Check if error is connection-related
+    private isConnectionError(error: any): boolean {
+        const errorMessage = error.message?.toLowerCase() || '';
+        const connectionErrors = [
+            'network',
+            'fetch',
+            'connection',
+            'timeout',
+            'offline',
+            'cors'
+        ];
+        
+        return connectionErrors.some(e => errorMessage.includes(e));
+    }
+    
+    // Pause queue processing
+    pause(): void {
+        this.isPaused = true;
+        console.log('Request queue paused');
+    }
+    
+    // Resume queue processing
+    resume(): void {
+        this.isPaused = false;
+        console.log('Request queue resumed');
+    }
+    
+    // Get number of queued requests by type
+    getQueuedByType(): { transcribe: number; translate: number; tts: number } {
+        const counts = { transcribe: 0, translate: 0, tts: 0 };
+        
+        this.queue.forEach(request => {
+            counts[request.type]++;
+        });
+        
+        return counts;
+    }
+}
--- a/static/js/src/speakerManager.ts
+++ b/static/js/src/speakerManager.ts
@@ -0,0 +1,270 @@
+// Speaker management for multi-speaker support
+export interface Speaker {
+    id: string;
+    name: string;
+    language: string;
+    color: string;
+    avatar?: string;
+    isActive: boolean;
+    lastActiveTime?: number;
+}
+
+export interface SpeakerTranscription {
+    speakerId: string;
+    text: string;
+    language: string;
+    timestamp: number;
+}
+
+export interface ConversationEntry {
+    id: string;
+    speakerId: string;
+    originalText: string;
+    originalLanguage: string;
+    translations: Map<string, string>; // languageCode -> translatedText
+    timestamp: number;
+    audioUrl?: string;
+}
+
+export class SpeakerManager {
+    private static instance: SpeakerManager;
+    private speakers: Map<string, Speaker> = new Map();
+    private conversation: ConversationEntry[] = [];
+    private activeSpeakerId: string | null = null;
+    private maxConversationLength = 100;
+    
+    // Predefined colors for speakers
+    private speakerColors = [
+        '#007bff', '#28a745', '#dc3545', '#ffc107', 
+        '#17a2b8', '#6f42c1', '#e83e8c', '#fd7e14'
+    ];
+    
+    private constructor() {
+        this.loadFromLocalStorage();
+    }
+    
+    static getInstance(): SpeakerManager {
+        if (!SpeakerManager.instance) {
+            SpeakerManager.instance = new SpeakerManager();
+        }
+        return SpeakerManager.instance;
+    }
+    
+    // Add a new speaker
+    addSpeaker(name: string, language: string): Speaker {
+        const id = this.generateSpeakerId();
+        const colorIndex = this.speakers.size % this.speakerColors.length;
+        
+        const speaker: Speaker = {
+            id,
+            name,
+            language,
+            color: this.speakerColors[colorIndex],
+            isActive: false,
+            avatar: this.generateAvatar(name)
+        };
+        
+        this.speakers.set(id, speaker);
+        this.saveToLocalStorage();
+        
+        return speaker;
+    }
+    
+    // Update speaker
+    updateSpeaker(id: string, updates: Partial<Speaker>): void {
+        const speaker = this.speakers.get(id);
+        if (speaker) {
+            Object.assign(speaker, updates);
+            this.saveToLocalStorage();
+        }
+    }
+    
+    // Remove speaker
+    removeSpeaker(id: string): void {
+        this.speakers.delete(id);
+        if (this.activeSpeakerId === id) {
+            this.activeSpeakerId = null;
+        }
+        this.saveToLocalStorage();
+    }
+    
+    // Get all speakers
+    getAllSpeakers(): Speaker[] {
+        return Array.from(this.speakers.values());
+    }
+    
+    // Get speaker by ID
+    getSpeaker(id: string): Speaker | undefined {
+        return this.speakers.get(id);
+    }
+    
+    // Set active speaker
+    setActiveSpeaker(id: string | null): void {
+        // Deactivate all speakers
+        this.speakers.forEach(speaker => {
+            speaker.isActive = false;
+        });
+        
+        // Activate selected speaker
+        if (id && this.speakers.has(id)) {
+            const speaker = this.speakers.get(id)!;
+            speaker.isActive = true;
+            speaker.lastActiveTime = Date.now();
+            this.activeSpeakerId = id;
+        } else {
+            this.activeSpeakerId = null;
+        }
+        
+        this.saveToLocalStorage();
+    }
+    
+    // Get active speaker
+    getActiveSpeaker(): Speaker | null {
+        return this.activeSpeakerId ? this.speakers.get(this.activeSpeakerId) || null : null;
+    }
+    
+    // Add conversation entry
+    addConversationEntry(
+        speakerId: string,
+        originalText: string,
+        originalLanguage: string
+    ): ConversationEntry {
+        const entry: ConversationEntry = {
+            id: this.generateEntryId(),
+            speakerId,
+            originalText,
+            originalLanguage,
+            translations: new Map(),
+            timestamp: Date.now()
+        };
+        
+        this.conversation.push(entry);
+        
+        // Limit conversation length
+        if (this.conversation.length > this.maxConversationLength) {
+            this.conversation.shift();
+        }
+        
+        this.saveToLocalStorage();
+        return entry;
+    }
+    
+    // Add translation to conversation entry
+    addTranslation(entryId: string, language: string, translatedText: string): void {
+        const entry = this.conversation.find(e => e.id === entryId);
+        if (entry) {
+            entry.translations.set(language, translatedText);
+            this.saveToLocalStorage();
+        }
+    }
+    
+    // Get conversation for a specific language
+    getConversationInLanguage(language: string): Array<{
+        speakerId: string;
+        speakerName: string;
+        speakerColor: string;
+        text: string;
+        timestamp: number;
+        isOriginal: boolean;
+    }> {
+        return this.conversation.map(entry => {
+            const speaker = this.speakers.get(entry.speakerId);
+            const isOriginal = entry.originalLanguage === language;
+            const text = isOriginal ? 
+                entry.originalText : 
+                entry.translations.get(language) || `[Translating from ${entry.originalLanguage}...]`;
+            
+            return {
+                speakerId: entry.speakerId,
+                speakerName: speaker?.name || 'Unknown',
+                speakerColor: speaker?.color || '#666',
+                text,
+                timestamp: entry.timestamp,
+                isOriginal
+            };
+        });
+    }
+    
+    // Get full conversation history
+    getFullConversation(): ConversationEntry[] {
+        return [...this.conversation];
+    }
+    
+    // Clear conversation
+    clearConversation(): void {
+        this.conversation = [];
+        this.saveToLocalStorage();
+    }
+    
+    // Generate unique speaker ID
+    private generateSpeakerId(): string {
+        return `speaker_${Date.now()}_${Math.random().toString(36).substr(2, 9)}`;
+    }
+    
+    // Generate unique entry ID
+    private generateEntryId(): string {
+        return `entry_${Date.now()}_${Math.random().toString(36).substr(2, 9)}`;
+    }
+    
+    // Generate avatar initials
+    private generateAvatar(name: string): string {
+        const parts = name.trim().split(' ');
+        if (parts.length >= 2) {
+            return parts[0][0].toUpperCase() + parts[1][0].toUpperCase();
+        }
+        return name.substr(0, 2).toUpperCase();
+    }
+    
+    // Save to localStorage
+    private saveToLocalStorage(): void {
+        try {
+            const data = {
+                speakers: Array.from(this.speakers.entries()),
+                conversation: this.conversation.map(entry => ({
+                    ...entry,
+                    translations: Array.from(entry.translations.entries())
+                })),
+                activeSpeakerId: this.activeSpeakerId
+            };
+            localStorage.setItem('speakerData', JSON.stringify(data));
+        } catch (error) {
+            console.error('Failed to save speaker data:', error);
+        }
+    }
+    
+    // Load from localStorage
+    private loadFromLocalStorage(): void {
+        try {
+            const saved = localStorage.getItem('speakerData');
+            if (saved) {
+                const data = JSON.parse(saved);
+                
+                // Restore speakers
+                if (data.speakers) {
+                    this.speakers = new Map(data.speakers);
+                }
+                
+                // Restore conversation with Map translations
+                if (data.conversation) {
+                    this.conversation = data.conversation.map((entry: any) => ({
+                        ...entry,
+                        translations: new Map(entry.translations || [])
+                    }));
+                }
+                
+                // Restore active speaker
+                this.activeSpeakerId = data.activeSpeakerId || null;
+            }
+        } catch (error) {
+            console.error('Failed to load speaker data:', error);
+        }
+    }
+    
+    // Export conversation as text
+    exportConversation(language: string): string {
+        const entries = this.getConversationInLanguage(language);
+        return entries.map(entry => 
+            `[${new Date(entry.timestamp).toLocaleTimeString()}] ${entry.speakerName}: ${entry.text}`
+        ).join('\n');
+    }
+}
--- a/static/js/src/streamingTranslation.ts
+++ b/static/js/src/streamingTranslation.ts
@@ -0,0 +1,250 @@
+// Streaming translation implementation for reduced latency
+import { Validator } from './validator';
+import { PerformanceMonitor } from './performanceMonitor';
+
+export interface StreamChunk {
+    type: 'start' | 'chunk' | 'complete' | 'error';
+    text?: string;
+    full_text?: string;
+    error?: string;
+    source_lang?: string;
+    target_lang?: string;
+}
+
+export class StreamingTranslation {
+    private eventSource: EventSource | null = null;
+    private abortController: AbortController | null = null;
+    private performanceMonitor = PerformanceMonitor.getInstance();
+    private firstChunkReceived = false;
+    
+    constructor(
+        private onChunk: (text: string) => void,
+        private onComplete: (fullText: string) => void,
+        private onError: (error: string) => void,
+        private onStart?: () => void
+    ) {}
+    
+    async startStreaming(
+        text: string,
+        sourceLang: string,
+        targetLang: string,
+        useStreaming: boolean = true
+    ): Promise<void> {
+        // Cancel any existing stream
+        this.cancel();
+        
+        // Validate inputs
+        const sanitizedText = Validator.sanitizeText(text);
+        if (!sanitizedText) {
+            this.onError('No text to translate');
+            return;
+        }
+        
+        if (!useStreaming) {
+            // Fall back to regular translation
+            await this.fallbackToRegularTranslation(sanitizedText, sourceLang, targetLang);
+            return;
+        }
+        
+        try {
+            // Check if browser supports EventSource
+            if (!window.EventSource) {
+                console.warn('EventSource not supported, falling back to regular translation');
+                await this.fallbackToRegularTranslation(sanitizedText, sourceLang, targetLang);
+                return;
+            }
+            
+            // Notify start
+            if (this.onStart) {
+                this.onStart();
+            }
+            
+            // Start performance timing
+            this.performanceMonitor.startTimer('streaming_translation');
+            this.firstChunkReceived = false;
+            
+            // Create abort controller for cleanup
+            this.abortController = new AbortController();
+            
+            // Start streaming request
+            const response = await fetch('/translate/stream', {
+                method: 'POST',
+                headers: {
+                    'Content-Type': 'application/json',
+                },
+                body: JSON.stringify({
+                    text: sanitizedText,
+                    source_lang: sourceLang,
+                    target_lang: targetLang
+                }),
+                signal: this.abortController.signal
+            });
+            
+            if (!response.ok) {
+                throw new Error(`HTTP error! status: ${response.status}`);
+            }
+            
+            // Check if response is event-stream
+            const contentType = response.headers.get('content-type');
+            if (!contentType || !contentType.includes('text/event-stream')) {
+                throw new Error('Server does not support streaming');
+            }
+            
+            // Process the stream
+            await this.processStream(response);
+            
+        } catch (error: any) {
+            if (error.name === 'AbortError') {
+                console.log('Stream cancelled');
+                return;
+            }
+            
+            console.error('Streaming error:', error);
+            
+            // Fall back to regular translation on error
+            await this.fallbackToRegularTranslation(sanitizedText, sourceLang, targetLang);
+        }
+    }
+    
+    private async processStream(response: Response): Promise<void> {
+        const reader = response.body?.getReader();
+        if (!reader) {
+            throw new Error('No response body');
+        }
+        
+        const decoder = new TextDecoder();
+        let buffer = '';
+        
+        try {
+            while (true) {
+                const { done, value } = await reader.read();
+                
+                if (done) {
+                    break;
+                }
+                
+                buffer += decoder.decode(value, { stream: true });
+                
+                // Process complete SSE messages
+                const lines = buffer.split('\n');
+                buffer = lines.pop() || ''; // Keep incomplete line in buffer
+                
+                for (const line of lines) {
+                    if (line.startsWith('data: ')) {
+                        try {
+                            const data = JSON.parse(line.slice(6)) as StreamChunk;
+                            this.handleStreamChunk(data);
+                        } catch (e) {
+                            console.error('Failed to parse SSE data:', e);
+                        }
+                    }
+                }
+            }
+        } finally {
+            reader.releaseLock();
+        }
+    }
+    
+    private handleStreamChunk(chunk: StreamChunk): void {
+        switch (chunk.type) {
+            case 'start':
+                console.log('Translation started:', chunk.source_lang, '->', chunk.target_lang);
+                break;
+                
+            case 'chunk':
+                if (chunk.text) {
+                    // Record time to first byte
+                    if (!this.firstChunkReceived) {
+                        this.firstChunkReceived = true;
+                        this.performanceMonitor.measureTTFB('streaming_translation', performance.now());
+                    }
+                    this.onChunk(chunk.text);
+                }
+                break;
+                
+            case 'complete':
+                if (chunk.full_text) {
+                    // End performance timing
+                    this.performanceMonitor.endTimer('streaming_translation');
+                    this.onComplete(chunk.full_text);
+                    
+                    // Log performance stats periodically
+                    if (Math.random() < 0.1) { // 10% of the time
+                        this.performanceMonitor.logPerformanceStats();
+                    }
+                }
+                break;
+                
+            case 'error':
+                this.onError(chunk.error || 'Unknown streaming error');
+                break;
+        }
+    }
+    
+    private async fallbackToRegularTranslation(
+        text: string,
+        sourceLang: string,
+        targetLang: string
+    ): Promise<void> {
+        try {
+            const response = await fetch('/translate', {
+                method: 'POST',
+                headers: {
+                    'Content-Type': 'application/json',
+                },
+                body: JSON.stringify({
+                    text: text,
+                    source_lang: sourceLang,
+                    target_lang: targetLang
+                })
+            });
+            
+            if (!response.ok) {
+                throw new Error(`HTTP error! status: ${response.status}`);
+            }
+            
+            const data = await response.json();
+            
+            if (data.success && data.translation) {
+                // Simulate streaming by showing text progressively
+                this.simulateStreaming(data.translation);
+            } else {
+                this.onError(data.error || 'Translation failed');
+            }
+        } catch (error: any) {
+            this.onError(error.message || 'Translation failed');
+        }
+    }
+    
+    private simulateStreaming(text: string): void {
+        // Simulate streaming for better UX even with non-streaming response
+        const words = text.split(' ');
+        let index = 0;
+        let accumulated = '';
+        
+        const interval = setInterval(() => {
+            if (index >= words.length) {
+                clearInterval(interval);
+                this.onComplete(accumulated.trim());
+                return;
+            }
+            
+            const chunk = words[index] + (index < words.length - 1 ? ' ' : '');
+            accumulated += chunk;
+            this.onChunk(chunk);
+            index++;
+        }, 50); // 50ms between words for smooth appearance
+    }
+    
+    cancel(): void {
+        if (this.abortController) {
+            this.abortController.abort();
+            this.abortController = null;
+        }
+        
+        if (this.eventSource) {
+            this.eventSource.close();
+            this.eventSource = null;
+        }
+    }
+}
--- a/static/js/src/translationCache.ts
+++ b/static/js/src/translationCache.ts
@@ -0,0 +1,243 @@
+// Translation cache management for offline support
+import { TranslationCacheEntry, CacheStats } from './types';
+import { Validator } from './validator';
+
+export class TranslationCache {
+    private static DB_NAME = 'VoiceTranslatorDB';
+    private static DB_VERSION = 2; // Increment version for cache store
+    private static CACHE_STORE = 'translationCache';
+    // private static MAX_CACHE_SIZE = 50 * 1024 * 1024; // 50MB limit - Reserved for future use
+    private static MAX_ENTRIES = 1000; // Maximum number of cached translations
+    private static CACHE_EXPIRY_DAYS = 30; // Expire entries after 30 days
+
+    // Generate cache key from input parameters
+    static generateCacheKey(text: string, sourceLang: string, targetLang: string): string {
+        // Normalize and sanitize text to create a consistent key
+        const normalizedText = text.trim().toLowerCase();
+        const sanitized = Validator.sanitizeCacheKey(normalizedText);
+        return `${sourceLang}:${targetLang}:${sanitized}`;
+    }
+
+    // Open or create the cache database
+    static async openDB(): Promise<IDBDatabase> {
+        return new Promise((resolve, reject) => {
+            const request = indexedDB.open(this.DB_NAME, this.DB_VERSION);
+            
+            request.onupgradeneeded = (event: IDBVersionChangeEvent) => {
+                const db = (event.target as IDBOpenDBRequest).result;
+                
+                // Create cache store if it doesn't exist
+                if (!db.objectStoreNames.contains(this.CACHE_STORE)) {
+                    const store = db.createObjectStore(this.CACHE_STORE, { keyPath: 'key' });
+                    store.createIndex('timestamp', 'timestamp', { unique: false });
+                    store.createIndex('lastAccessed', 'lastAccessed', { unique: false });
+                    store.createIndex('sourceLanguage', 'sourceLanguage', { unique: false });
+                    store.createIndex('targetLanguage', 'targetLanguage', { unique: false });
+                }
+            };
+            
+            request.onsuccess = (event: Event) => {
+                resolve((event.target as IDBOpenDBRequest).result);
+            };
+            
+            request.onerror = () => {
+                reject('Failed to open translation cache database');
+            };
+        });
+    }
+
+    // Get cached translation
+    static async getCachedTranslation(
+        text: string, 
+        sourceLang: string, 
+        targetLang: string
+    ): Promise<string | null> {
+        try {
+            const db = await this.openDB();
+            const transaction = db.transaction([this.CACHE_STORE], 'readwrite');
+            const store = transaction.objectStore(this.CACHE_STORE);
+            
+            const key = this.generateCacheKey(text, sourceLang, targetLang);
+            const request = store.get(key);
+            
+            return new Promise((resolve) => {
+                request.onsuccess = (event: Event) => {
+                    const entry = (event.target as IDBRequest).result as TranslationCacheEntry;
+                    
+                    if (entry) {
+                        // Check if entry is not expired
+                        const expiryTime = entry.timestamp + (this.CACHE_EXPIRY_DAYS * 24 * 60 * 60 * 1000);
+                        if (Date.now() < expiryTime) {
+                            // Update access count and last accessed time
+                            entry.accessCount++;
+                            entry.lastAccessed = Date.now();
+                            store.put(entry);
+                            
+                            console.log(`Cache hit for translation: ${sourceLang} -> ${targetLang}`);
+                            resolve(entry.targetText);
+                        } else {
+                            // Entry expired, delete it
+                            store.delete(key);
+                            resolve(null);
+                        }
+                    } else {
+                        resolve(null);
+                    }
+                };
+                
+                request.onerror = () => {
+                    console.error('Failed to get cached translation');
+                    resolve(null);
+                };
+            });
+        } catch (error) {
+            console.error('Cache lookup error:', error);
+            return null;
+        }
+    }
+
+    // Save translation to cache
+    static async cacheTranslation(
+        sourceText: string,
+        sourceLang: string,
+        targetText: string,
+        targetLang: string
+    ): Promise<void> {
+        try {
+            const db = await this.openDB();
+            const transaction = db.transaction([this.CACHE_STORE], 'readwrite');
+            const store = transaction.objectStore(this.CACHE_STORE);
+            
+            const key = this.generateCacheKey(sourceText, sourceLang, targetLang);
+            const entry: TranslationCacheEntry = {
+                key,
+                sourceText,
+                sourceLanguage: sourceLang,
+                targetText,
+                targetLanguage: targetLang,
+                timestamp: Date.now(),
+                accessCount: 1,
+                lastAccessed: Date.now()
+            };
+            
+            // Check cache size before adding
+            await this.ensureCacheSize(db);
+            
+            store.put(entry);
+            console.log(`Cached translation: ${sourceLang} -> ${targetLang}`);
+        } catch (error) {
+            console.error('Failed to cache translation:', error);
+        }
+    }
+
+    // Ensure cache doesn't exceed size limits
+    static async ensureCacheSize(db: IDBDatabase): Promise<void> {
+        const transaction = db.transaction([this.CACHE_STORE], 'readwrite');
+        const store = transaction.objectStore(this.CACHE_STORE);
+        
+        // Count entries
+        const countRequest = store.count();
+        
+        countRequest.onsuccess = async () => {
+            const count = countRequest.result;
+            
+            if (count >= this.MAX_ENTRIES) {
+                // Delete least recently accessed entries
+                const index = store.index('lastAccessed');
+                const cursor = index.openCursor();
+                let deleted = 0;
+                const toDelete = Math.floor(count * 0.2); // Delete 20% of entries
+                
+                cursor.onsuccess = (event: Event) => {
+                    const cursor = (event.target as IDBRequest).result;
+                    if (cursor && deleted < toDelete) {
+                        cursor.delete();
+                        deleted++;
+                        cursor.continue();
+                    }
+                };
+            }
+        };
+    }
+
+    // Get cache statistics
+    static async getCacheStats(): Promise<CacheStats> {
+        try {
+            const db = await this.openDB();
+            const transaction = db.transaction([this.CACHE_STORE], 'readonly');
+            const store = transaction.objectStore(this.CACHE_STORE);
+            
+            return new Promise((resolve) => {
+                const stats: CacheStats = {
+                    totalEntries: 0,
+                    totalSize: 0,
+                    oldestEntry: Date.now(),
+                    newestEntry: 0
+                };
+                
+                const countRequest = store.count();
+                countRequest.onsuccess = () => {
+                    stats.totalEntries = countRequest.result;
+                };
+                
+                const cursor = store.openCursor();
+                cursor.onsuccess = (event: Event) => {
+                    const cursor = (event.target as IDBRequest).result;
+                    if (cursor) {
+                        const entry = cursor.value as TranslationCacheEntry;
+                        // Estimate size (rough calculation)
+                        stats.totalSize += (entry.sourceText.length + entry.targetText.length) * 2;
+                        stats.oldestEntry = Math.min(stats.oldestEntry, entry.timestamp);
+                        stats.newestEntry = Math.max(stats.newestEntry, entry.timestamp);
+                        cursor.continue();
+                    } else {
+                        resolve(stats);
+                    }
+                };
+            });
+        } catch (error) {
+            console.error('Failed to get cache stats:', error);
+            return {
+                totalEntries: 0,
+                totalSize: 0,
+                oldestEntry: 0,
+                newestEntry: 0
+            };
+        }
+    }
+
+    // Clear all cache
+    static async clearCache(): Promise<void> {
+        try {
+            const db = await this.openDB();
+            const transaction = db.transaction([this.CACHE_STORE], 'readwrite');
+            const store = transaction.objectStore(this.CACHE_STORE);
+            store.clear();
+            console.log('Translation cache cleared');
+        } catch (error) {
+            console.error('Failed to clear cache:', error);
+        }
+    }
+
+    // Export cache for backup
+    static async exportCache(): Promise<TranslationCacheEntry[]> {
+        try {
+            const db = await this.openDB();
+            const transaction = db.transaction([this.CACHE_STORE], 'readonly');
+            const store = transaction.objectStore(this.CACHE_STORE);
+            const request = store.getAll();
+            
+            return new Promise((resolve) => {
+                request.onsuccess = () => {
+                    resolve(request.result);
+                };
+                request.onerror = () => {
+                    resolve([]);
+                };
+            });
+        } catch (error) {
+            console.error('Failed to export cache:', error);
+            return [];
+        }
+    }
+}
--- a/static/js/src/types.ts
+++ b/static/js/src/types.ts
@@ -0,0 +1,109 @@
+// Type definitions for Talk2Me application
+
+export interface TranscriptionResponse {
+  success: boolean;
+  text?: string;
+  error?: string;
+  detected_language?: string;
+}
+
+export interface TranslationResponse {
+  success: boolean;
+  translation?: string;
+  error?: string;
+}
+
+export interface TTSResponse {
+  success: boolean;
+  audio_url?: string;
+  error?: string;
+}
+
+export interface TTSServerStatus {
+  status: 'online' | 'error' | 'auth_error';
+  message: string;
+  url: string;
+  code?: number;
+}
+
+export interface TTSConfigUpdate {
+  server_url?: string;
+  api_key?: string;
+}
+
+export interface TTSConfigResponse {
+  success: boolean;
+  message?: string;
+  url?: string;
+  error?: string;
+}
+
+export interface TranslationRequest {
+  text: string;
+  source_lang: string;
+  target_lang: string;
+}
+
+export interface TTSRequest {
+  text: string;
+  language: string;
+}
+
+export interface PushPublicKeyResponse {
+  publicKey: string;
+}
+
+export interface IndexedDBRecord {
+  timestamp: string;
+}
+
+export interface TranscriptionRecord extends IndexedDBRecord {
+  text: string;
+  language: string;
+}
+
+export interface TranslationRecord extends IndexedDBRecord {
+  sourceText: string;
+  sourceLanguage: string;
+  targetText: string;
+  targetLanguage: string;
+}
+
+export interface TranslationCacheEntry {
+  key: string;
+  sourceText: string;
+  sourceLanguage: string;
+  targetText: string;
+  targetLanguage: string;
+  timestamp: number;
+  accessCount: number;
+  lastAccessed: number;
+}
+
+export interface CacheStats {
+  totalEntries: number;
+  totalSize: number;
+  oldestEntry: number;
+  newestEntry: number;
+}
+
+// Service Worker types
+export interface PeriodicSyncManager {
+  register(tag: string, options?: { minInterval: number }): Promise<void>;
+}
+
+export interface ServiceWorkerRegistrationExtended extends ServiceWorkerRegistration {
+  periodicSync?: PeriodicSyncManager;
+}
+
+// Extend window interface for PWA features
+declare global {
+  interface Window {
+    deferredPrompt?: BeforeInstallPromptEvent;
+  }
+}
+
+export interface BeforeInstallPromptEvent extends Event {
+  prompt(): Promise<void>;
+  userChoice: Promise<{ outcome: 'accepted' | 'dismissed' }>;
+}
--- a/static/js/src/validator.ts
+++ b/static/js/src/validator.ts
@@ -0,0 +1,259 @@
+// Input validation and sanitization utilities
+export class Validator {
+    // Sanitize HTML to prevent XSS attacks
+    static sanitizeHTML(input: string): string {
+        // Create a temporary div element
+        const temp = document.createElement('div');
+        temp.textContent = input;
+        return temp.innerHTML;
+    }
+
+    // Validate and sanitize text input
+    static sanitizeText(input: string, maxLength: number = 10000): string {
+        if (typeof input !== 'string') {
+            return '';
+        }
+        
+        // Trim and limit length
+        let sanitized = input.trim().substring(0, maxLength);
+        
+        // Remove null bytes
+        sanitized = sanitized.replace(/\0/g, '');
+        
+        // Remove control characters except newlines and tabs
+        sanitized = sanitized.replace(/[\x00-\x08\x0B\x0C\x0E-\x1F\x7F]/g, '');
+        
+        return sanitized;
+    }
+
+    // Validate language code
+    static validateLanguageCode(code: string, allowedLanguages: string[]): string | null {
+        if (!code || typeof code !== 'string') {
+            return null;
+        }
+        
+        const sanitized = code.trim().toLowerCase();
+        
+        // Check if it's in the allowed list
+        if (allowedLanguages.includes(sanitized) || sanitized === 'auto') {
+            return sanitized;
+        }
+        
+        return null;
+    }
+
+    // Validate file upload
+    static validateAudioFile(file: File): { valid: boolean; error?: string } {
+        // Check if file exists
+        if (!file) {
+            return { valid: false, error: 'No file provided' };
+        }
+
+        // Check file size (max 25MB)
+        const maxSize = 25 * 1024 * 1024;
+        if (file.size > maxSize) {
+            return { valid: false, error: 'File size exceeds 25MB limit' };
+        }
+
+        // Check file type
+        const allowedTypes = [
+            'audio/webm',
+            'audio/ogg',
+            'audio/wav',
+            'audio/mp3',
+            'audio/mpeg',
+            'audio/mp4',
+            'audio/x-m4a',
+            'audio/x-wav'
+        ];
+        
+        if (!allowedTypes.includes(file.type)) {
+            // Check by extension as fallback
+            const ext = file.name.toLowerCase().split('.').pop();
+            const allowedExtensions = ['webm', 'ogg', 'wav', 'mp3', 'mp4', 'm4a'];
+            
+            if (!ext || !allowedExtensions.includes(ext)) {
+                return { valid: false, error: 'Invalid audio file type' };
+            }
+        }
+
+        return { valid: true };
+    }
+
+    // Validate URL
+    static validateURL(url: string): string | null {
+        if (!url || typeof url !== 'string') {
+            return null;
+        }
+
+        try {
+            const parsed = new URL(url);
+            
+            // Only allow http and https
+            if (!['http:', 'https:'].includes(parsed.protocol)) {
+                return null;
+            }
+            
+            // Prevent localhost in production
+            if (window.location.hostname !== 'localhost' && 
+                (parsed.hostname === 'localhost' || parsed.hostname === '127.0.0.1')) {
+                return null;
+            }
+            
+            return parsed.toString();
+        } catch (e) {
+            return null;
+        }
+    }
+
+    // Validate API key (basic format check)
+    static validateAPIKey(key: string): string | null {
+        if (!key || typeof key !== 'string') {
+            return null;
+        }
+        
+        // Trim whitespace
+        const trimmed = key.trim();
+        
+        // Check length (most API keys are 20-128 characters)
+        if (trimmed.length < 20 || trimmed.length > 128) {
+            return null;
+        }
+        
+        // Only allow alphanumeric, dash, and underscore
+        if (!/^[a-zA-Z0-9\-_]+$/.test(trimmed)) {
+            return null;
+        }
+        
+        return trimmed;
+    }
+
+    // Validate request body size
+    static validateRequestSize(data: any, maxSizeKB: number = 1024): boolean {
+        try {
+            const jsonString = JSON.stringify(data);
+            const sizeInBytes = new Blob([jsonString]).size;
+            return sizeInBytes <= maxSizeKB * 1024;
+        } catch (e) {
+            return false;
+        }
+    }
+
+    // Sanitize filename
+    static sanitizeFilename(filename: string): string {
+        if (!filename || typeof filename !== 'string') {
+            return 'file';
+        }
+        
+        // Remove path components
+        let name = filename.split(/[/\\]/).pop() || 'file';
+        
+        // Remove dangerous characters
+        name = name.replace(/[^a-zA-Z0-9.\-_]/g, '_');
+        
+        // Limit length
+        if (name.length > 255) {
+            const ext = name.split('.').pop();
+            const base = name.substring(0, 250 - (ext ? ext.length + 1 : 0));
+            name = ext ? `${base}.${ext}` : base;
+        }
+        
+        return name;
+    }
+
+    // Validate settings object
+    static validateSettings(settings: any): { valid: boolean; sanitized?: any; errors?: string[] } {
+        const errors: string[] = [];
+        const sanitized: any = {};
+
+        // Validate notification settings
+        if (settings.notificationsEnabled !== undefined) {
+            sanitized.notificationsEnabled = Boolean(settings.notificationsEnabled);
+        }
+
+        if (settings.notifyTranscription !== undefined) {
+            sanitized.notifyTranscription = Boolean(settings.notifyTranscription);
+        }
+
+        if (settings.notifyTranslation !== undefined) {
+            sanitized.notifyTranslation = Boolean(settings.notifyTranslation);
+        }
+
+        if (settings.notifyErrors !== undefined) {
+            sanitized.notifyErrors = Boolean(settings.notifyErrors);
+        }
+
+        // Validate offline mode
+        if (settings.offlineMode !== undefined) {
+            sanitized.offlineMode = Boolean(settings.offlineMode);
+        }
+
+        // Validate TTS settings
+        if (settings.ttsServerUrl !== undefined) {
+            const url = this.validateURL(settings.ttsServerUrl);
+            if (settings.ttsServerUrl && !url) {
+                errors.push('Invalid TTS server URL');
+            } else {
+                sanitized.ttsServerUrl = url;
+            }
+        }
+
+        if (settings.ttsApiKey !== undefined) {
+            const key = this.validateAPIKey(settings.ttsApiKey);
+            if (settings.ttsApiKey && !key) {
+                errors.push('Invalid API key format');
+            } else {
+                sanitized.ttsApiKey = key;
+            }
+        }
+
+        return {
+            valid: errors.length === 0,
+            sanitized: errors.length === 0 ? sanitized : undefined,
+            errors: errors.length > 0 ? errors : undefined
+        };
+    }
+
+    // Rate limiting check
+    private static requestCounts: Map<string, number[]> = new Map();
+    
+    static checkRateLimit(
+        action: string, 
+        maxRequests: number = 10, 
+        windowMs: number = 60000
+    ): boolean {
+        const now = Date.now();
+        const key = action;
+        
+        if (!this.requestCounts.has(key)) {
+            this.requestCounts.set(key, []);
+        }
+        
+        const timestamps = this.requestCounts.get(key)!;
+        
+        // Remove old timestamps
+        const cutoff = now - windowMs;
+        const recent = timestamps.filter(t => t > cutoff);
+        
+        // Check if limit exceeded
+        if (recent.length >= maxRequests) {
+            return false;
+        }
+        
+        // Add current timestamp
+        recent.push(now);
+        this.requestCounts.set(key, recent);
+        
+        return true;
+    }
+
+    // Validate translation cache key
+    static sanitizeCacheKey(key: string): string {
+        if (!key || typeof key !== 'string') {
+            return '';
+        }
+        
+        // Remove special characters that might cause issues
+        return key.replace(/[^\w\s-]/gi, '').substring(0, 500);
+    }
+}
--- a/static/manifest.json
+++ b/static/manifest.json
@@ -1,5 +1,5 @@
 {
-  "name": "Voice Language Translator",
+  "name": "Talk2Me",
  "short_name": "Translator",
  "description": "Translate spoken language between multiple languages with speech input and output",
  "start_url": "/",
--- a/static/service-worker.js
+++ b/static/service-worker.js
@@ -1,13 +1,15 @@
-// Service Worker for Voice Language Translator PWA
+// Service Worker for Talk2Me PWA

 const CACHE_NAME = 'voice-translator-v1';
 const ASSETS_TO_CACHE = [
  '/',
  '/static/css/styles.css',
-  '/static/js/app.js',
+  '/static/js/dist/app.js',
  '/static/icons/icon-192x192.png',
  '/static/icons/icon-512x512.png',
-  '/static/icons/favicon.ico'
+  '/static/icons/favicon.ico',
+  'https://cdn.jsdelivr.net/npm/bootstrap@5.3.0-alpha1/dist/css/bootstrap.min.css',
+  'https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.0.0/css/all.min.css'
 ];

 // Install event - cache essential assets
@@ -90,15 +92,34 @@ self.addEventListener('fetch', (event) => {

 // Handle push notifications
 self.addEventListener('push', (event) => {
+  if (!event.data) {
+    return;
+  }
+  
  const data = event.data.json();
  const options = {
    body: data.body || 'New translation available',
-    icon: '/static/icons/icon-192x192.png',
-    badge: '/static/icons/badge-72x72.png',
+    icon: data.icon || '/static/icons/icon-192x192.png',
+    badge: data.badge || '/static/icons/icon-192x192.png',
    vibrate: [100, 50, 100],
+    tag: data.tag || 'talk2me-notification',
+    requireInteraction: false,
+    silent: false,
    data: {
-      url: data.url || '/'
+      url: data.url || '/',
+      ...data.data
+    },
+    actions: [
+      {
+        action: 'view',
+        title: 'View',
+        icon: '/static/icons/icon-192x192.png'
+      },
+      {
+        action: 'close',
+        title: 'Close'
      }
+    ]
  };

  event.waitUntil(
@@ -109,7 +130,55 @@ self.addEventListener('push', (event) => {
 // Handle notification click
 self.addEventListener('notificationclick', (event) => {
  event.notification.close();
+  
+  if (event.action === 'close') {
+    return;
+  }
+  
+  const urlToOpen = event.notification.data.url || '/';
+  
  event.waitUntil(
-    clients.openWindow(event.notification.data.url)
+    clients.matchAll({
+      type: 'window',
+      includeUncontrolled: true
+    }).then((windowClients) => {
+      // Check if there's already a window/tab with the app open
+      for (let client of windowClients) {
+        if (client.url === urlToOpen && 'focus' in client) {
+          return client.focus();
+        }
+      }
+      // If not, open a new window/tab
+      if (clients.openWindow) {
+        return clients.openWindow(urlToOpen);
+      }
+    })
  );
 });
+
+// Handle periodic background sync
+self.addEventListener('periodicsync', (event) => {
+  if (event.tag === 'translation-updates') {
+    event.waitUntil(checkForUpdates());
+  }
+});
+
+async function checkForUpdates() {
+  // Check for app updates or send usage statistics
+  try {
+    const response = await fetch('/api/check-updates');
+    if (response.ok) {
+      const data = await response.json();
+      if (data.hasUpdate) {
+        self.registration.showNotification('Update Available', {
+          body: 'A new version of Voice Translator is available!',
+          icon: '/static/icons/icon-192x192.png',
+          badge: '/static/icons/icon-192x192.png',
+          tag: 'update-notification'
+        });
+      }
+    }
+  } catch (error) {
+    console.error('Failed to check for updates:', error);
+  }
+}
--- a/talk2me.service
+++ b/talk2me.service
@@ -0,0 +1,66 @@
+[Unit]
+Description=Talk2Me Real-time Translation Service
+Documentation=https://github.com/your-repo/talk2me
+After=network.target
+
+[Service]
+Type=notify
+User=talk2me
+Group=talk2me
+WorkingDirectory=/opt/talk2me
+Environment="PATH=/opt/talk2me/venv/bin"
+Environment="FLASK_ENV=production"
+Environment="PYTHONUNBUFFERED=1"
+
+# Production environment variables
+EnvironmentFile=-/opt/talk2me/.env
+
+# Gunicorn command with production settings
+ExecStart=/opt/talk2me/venv/bin/gunicorn \
+    --config /opt/talk2me/gunicorn_config.py \
+    --error-logfile /var/log/talk2me/gunicorn-error.log \
+    --access-logfile /var/log/talk2me/gunicorn-access.log \
+    --log-level info \
+    wsgi:application
+
+# Reload via SIGHUP
+ExecReload=/bin/kill -s HUP $MAINPID
+
+# Graceful stop
+KillMode=mixed
+TimeoutStopSec=30
+
+# Restart policy
+Restart=always
+RestartSec=10
+StartLimitBurst=3
+StartLimitInterval=60
+
+# Security settings
+NoNewPrivileges=true
+PrivateTmp=true
+ProtectSystem=strict
+ProtectHome=true
+ProtectKernelTunables=true
+ProtectKernelModules=true
+ProtectControlGroups=true
+RestrictRealtime=true
+RestrictSUIDSGID=true
+LockPersonality=true
+
+# Allow writing to specific directories
+ReadWritePaths=/var/log/talk2me /tmp/talk2me_uploads
+
+# Resource limits
+LimitNOFILE=65536
+LimitNPROC=4096
+
+# Memory limits (adjust based on your system)
+MemoryLimit=4G
+MemoryHigh=3G
+
+# CPU limits (optional)
+# CPUQuota=200%
+
+[Install]
+WantedBy=multi-user.target
--- a/templates/index.html
+++ b/templates/index.html
@@ -3,7 +3,7 @@
 <head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0, user-scalable=no">
-    <title>Voice Language Translator</title>
+    <title>Talk2Me</title>
    <link href="https://cdn.jsdelivr.net/npm/bootstrap@5.3.0-alpha1/dist/css/bootstrap.min.css" rel="stylesheet">
    <link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.0.0/css/all.min.css">
    <link rel="icon" href="/favicon.ico" sizes="any">
@@ -74,6 +74,7 @@
            background-color: #f8f9fa;
            border-radius: 10px;
            margin-bottom: 15px;
+            position: relative;
        }
        .btn-action {
            border-radius: 10px;
@@ -121,9 +122,28 @@
 </head>
 <body>
    <div class="container">
-        <h1 class="text-center mb-4">Voice Language Translator</h1>
+        <h1 class="text-center mb-4">Talk2Me</h1>
        <!--<p class="text-center text-muted">Powered by Gemma 3, Whisper & Edge TTS</p>-->
        
+        <!-- Multi-speaker toolbar -->
+        <div id="speakerToolbar" class="card mb-3" style="display: none;">
+            <div class="card-body p-2">
+                <div class="d-flex align-items-center justify-content-between flex-wrap">
+                    <div class="d-flex align-items-center gap-2 mb-2 mb-md-0">
+                        <button id="addSpeakerBtn" class="btn btn-sm btn-outline-primary">
+                            <i class="fas fa-user-plus"></i> Add Speaker
+                        </button>
+                        <button id="toggleMultiSpeaker" class="btn btn-sm btn-secondary">
+                            <i class="fas fa-users"></i> Multi-Speaker: <span id="multiSpeakerStatus">OFF</span>
+                        </button>
+                    </div>
+                    <div id="speakerList" class="d-flex gap-2 flex-wrap">
+                        <!-- Speaker buttons will be added here dynamically -->
+                    </div>
+                </div>
+            </div>
+        </div>
+
        <div class="row">
            <div class="col-md-6 mb-3">
                <div class="card">
@@ -132,6 +152,7 @@
                    </div>
                    <div class="card-body">
                        <select id="sourceLanguage" class="form-select language-select mb-3">
+                            <option value="auto">Auto-detect</option>
                            {% for language in languages %}
                            <option value="{{ language }}">{{ language }}</option>
                            {% endfor %}
@@ -183,6 +204,13 @@
                <i class="fas fa-microphone"></i>
            </button>
            <p class="status-indicator" id="statusIndicator">Click to start recording</p>
+            <!-- Queue Status Indicator -->
+            <div id="queueStatus" class="text-center mt-2" style="display: none;">
+                <small class="text-muted">
+                    <i class="fas fa-list"></i> Queue: <span id="queueLength">0</span> | 
+                    <i class="fas fa-sync"></i> Active: <span id="activeRequests">0</span>
+                </small>
+            </div>
        </div>

        <div class="text-center mt-3">
@@ -197,284 +225,180 @@
            </div>
        </div>
        
+        <!-- Multi-speaker conversation view -->
+        <div id="conversationView" class="card mt-4" style="display: none;">
+            <div class="card-header bg-info text-white d-flex justify-content-between align-items-center">
+                <h5 class="mb-0">Conversation</h5>
+                <div>
+                    <button id="exportConversation" class="btn btn-sm btn-light">
+                        <i class="fas fa-download"></i> Export
+                    </button>
+                    <button id="clearConversation" class="btn btn-sm btn-light">
+                        <i class="fas fa-trash"></i> Clear
+                    </button>
+                </div>
+            </div>
+            <div class="card-body" style="max-height: 400px; overflow-y: auto;">
+                <div id="conversationContent">
+                    <!-- Conversation entries will be added here -->
+                </div>
+            </div>
+        </div>
+
        <audio id="audioPlayer" style="display: none;"></audio>
+        
+        <!-- TTS Server Configuration Alert -->
+        <div id="ttsServerAlert" class="alert alert-warning d-none" role="alert">
+            <strong>TTS Server Status:</strong> <span id="ttsServerMessage">Checking...</span>
+            <div class="mt-2">
+                <input type="text" id="ttsServerUrl" class="form-control mb-2" placeholder="TTS Server URL">
+                <input type="password" id="ttsApiKey" class="form-control mb-2" placeholder="API Key">
+                <button id="updateTtsServer" class="btn btn-sm btn-primary">Update Configuration</button>
+            </div>
+        </div>
+        
+        <!-- Loading Overlay -->
+        <div id="loadingOverlay" class="loading-overlay">
+            <div class="loading-content">
+                <div class="spinner-custom"></div>
+                <p id="loadingText" class="mt-3">Processing...</p>
+            </div>
+        </div>
+        
+        <!-- Notification Settings -->
+        <div class="position-fixed bottom-0 end-0 p-3" style="z-index: 5">
+            <div id="notificationPrompt" class="toast" role="alert" aria-live="assertive" aria-atomic="true">
+                <div class="toast-header">
+                    <i class="fas fa-bell text-primary me-2"></i>
+                    <strong class="me-auto">Enable Notifications</strong>
+                    <button type="button" class="btn-close" data-bs-dismiss="toast" aria-label="Close"></button>
+                </div>
+                <div class="toast-body">
+                    Get notified when translations are complete!
+                    <div class="mt-2">
+                        <button type="button" class="btn btn-sm btn-primary" id="enableNotifications">Enable</button>
+                        <button type="button" class="btn btn-sm btn-secondary" data-bs-dismiss="toast">Not now</button>
+                    </div>
+                </div>
+            </div>
+            
+            <!-- Success Toast -->
+            <div id="successToast" class="toast align-items-center text-white bg-success border-0" role="alert" aria-live="assertive" aria-atomic="true">
+                <div class="d-flex">
+                    <div class="toast-body">
+                        <i class="fas fa-check-circle me-2"></i>
+                        <span id="successMessage">Settings saved successfully!</span>
+                    </div>
+                    <button type="button" class="btn-close btn-close-white me-2 m-auto" data-bs-dismiss="toast" aria-label="Close"></button>
+                </div>
+            </div>
+        </div>
+        
+        <!-- Settings Modal -->
+        <div class="modal fade" id="settingsModal" tabindex="-1" aria-labelledby="settingsModalLabel" aria-hidden="true">
+            <div class="modal-dialog">
+                <div class="modal-content">
+                    <div class="modal-header">
+                        <h5 class="modal-title" id="settingsModalLabel">Settings</h5>
+                        <button type="button" class="btn-close" data-bs-dismiss="modal" aria-label="Close"></button>
+                    </div>
+                    <div class="modal-body">
+                        <h6>Notifications</h6>
+                        <div class="form-check form-switch">
+                            <input class="form-check-input" type="checkbox" id="notificationToggle">
+                            <label class="form-check-label" for="notificationToggle">
+                                Enable push notifications
+                            </label>
+                        </div>
+                        <p class="text-muted small mt-2">Get notified when transcriptions and translations complete</p>
+                        
+                        <hr>
+                        
+                        <h6>Notification Types</h6>
+                        <div class="form-check">
+                            <input class="form-check-input" type="checkbox" id="notifyTranscription" checked>
+                            <label class="form-check-label" for="notifyTranscription">
+                                Transcription complete
+                            </label>
+                        </div>
+                        <div class="form-check">
+                            <input class="form-check-input" type="checkbox" id="notifyTranslation" checked>
+                            <label class="form-check-label" for="notifyTranslation">
+                                Translation complete
+                            </label>
+                        </div>
+                        <div class="form-check">
+                            <input class="form-check-input" type="checkbox" id="notifyErrors">
+                            <label class="form-check-label" for="notifyErrors">
+                                Error notifications
+                            </label>
+                        </div>
+                        
+                        <hr>
+                        
+                        <h6 class="mb-3">Translation Settings</h6>
+                        <div class="form-check form-switch mb-3">
+                            <input class="form-check-input" type="checkbox" id="streamingTranslation" checked>
+                            <label class="form-check-label" for="streamingTranslation">
+                                Enable streaming translation
+                                <small class="text-muted d-block">Shows translation as it's generated for faster feedback</small>
+                            </label>
+                        </div>
+                        <div class="form-check form-switch mb-3">
+                            <input class="form-check-input" type="checkbox" id="multiSpeakerMode">
+                            <label class="form-check-label" for="multiSpeakerMode">
+                                Enable multi-speaker mode
+                                <small class="text-muted d-block">Track multiple speakers in conversations</small>
+                            </label>
+                        </div>
+                        
+                        <hr>
+                        
+                        <h6>Offline Cache</h6>
+                        <div class="mb-3">
+                            <div class="d-flex justify-content-between align-items-center mb-2">
+                                <span>Cached translations:</span>
+                                <span id="cacheCount" class="badge bg-primary">0</span>
+                            </div>
+                            <div class="d-flex justify-content-between align-items-center mb-2">
+                                <span>Cache size:</span>
+                                <span id="cacheSize" class="badge bg-secondary">0 KB</span>
+                            </div>
+                            <div class="form-check form-switch mb-2">
+                                <input class="form-check-input" type="checkbox" id="offlineMode" checked>
+                                <label class="form-check-label" for="offlineMode">
+                                    Enable offline caching
+                                </label>
+                            </div>
+                            <button type="button" class="btn btn-sm btn-outline-danger" id="clearCache">
+                                <i class="fas fa-trash"></i> Clear Cache
+                            </button>
+                        </div>
+                    </div>
+                    <div class="modal-footer">
+                        <div id="settingsSaveStatus" class="text-success me-auto" style="display: none;">
+                            <i class="fas fa-check-circle"></i> Saved!
+                        </div>
+                        <button type="button" class="btn btn-secondary" data-bs-dismiss="modal">Close</button>
+                        <button type="button" class="btn btn-primary" id="saveSettings">Save settings</button>
+                    </div>
+                </div>
+            </div>
+        </div>
+        
+        <!-- Settings Button -->
+        <button type="button" class="btn btn-outline-secondary position-fixed top-0 end-0 m-3" data-bs-toggle="modal" data-bs-target="#settingsModal">
+            <i class="fas fa-cog"></i>
+        </button>
+        
+        <!-- Simple Success Notification -->
+        <div id="successNotification" class="success-notification">
+            <i class="fas fa-check-circle"></i>
+            <span id="successText">Settings saved successfully!</span>
+        </div>
    </div>

    <script src="https://cdn.jsdelivr.net/npm/bootstrap@5.3.0-alpha1/dist/js/bootstrap.bundle.min.js"></script>
-    <script>
-        document.addEventListener('DOMContentLoaded', function() {
-            // DOM elements
-            const recordBtn = document.getElementById('recordBtn');
-            const translateBtn = document.getElementById('translateBtn');
-            const sourceText = document.getElementById('sourceText');
-            const translatedText = document.getElementById('translatedText');
-            const sourceLanguage = document.getElementById('sourceLanguage');
-            const targetLanguage = document.getElementById('targetLanguage');
-            const playSource = document.getElementById('playSource');
-            const playTranslation = document.getElementById('playTranslation');
-            const clearSource = document.getElementById('clearSource');
-            const clearTranslation = document.getElementById('clearTranslation');
-            const statusIndicator = document.getElementById('statusIndicator');
-            const progressContainer = document.getElementById('progressContainer');
-            const progressBar = document.getElementById('progressBar');
-            const audioPlayer = document.getElementById('audioPlayer');
-
-            // Set initial values
-            let isRecording = false;
-            let mediaRecorder = null;
-            let audioChunks = [];
-            let currentSourceText = '';
-            let currentTranslationText = '';
-
-            // Make sure target language is different from source
-            if (targetLanguage.options[0].value === sourceLanguage.value) {
-                targetLanguage.selectedIndex = 1;
-            }
-
-            // Event listeners for language selection
-            sourceLanguage.addEventListener('change', function() {
-                if (targetLanguage.value === sourceLanguage.value) {
-                    for (let i = 0; i < targetLanguage.options.length; i++) {
-                        if (targetLanguage.options[i].value !== sourceLanguage.value) {
-                            targetLanguage.selectedIndex = i;
-                            break;
-                        }
-                    }
-                }
-            });
-
-            targetLanguage.addEventListener('change', function() {
-                if (targetLanguage.value === sourceLanguage.value) {
-                    for (let i = 0; i < sourceLanguage.options.length; i++) {
-                        if (sourceLanguage.options[i].value !== targetLanguage.value) {
-                            sourceLanguage.selectedIndex = i;
-                            break;
-                        }
-                    }
-                }
-            });
-
-            // Record button click event
-            recordBtn.addEventListener('click', function() {
-                if (isRecording) {
-                    stopRecording();
-                } else {
-                    startRecording();
-                }
-            });
-
-            // Function to start recording
-            function startRecording() {
-                navigator.mediaDevices.getUserMedia({ audio: true })
-                    .then(stream => {
-                        mediaRecorder = new MediaRecorder(stream);
-                        audioChunks = [];
-
-                        mediaRecorder.addEventListener('dataavailable', event => {
-                            audioChunks.push(event.data);
-                        });
-
-                        mediaRecorder.addEventListener('stop', () => {
-                            const audioBlob = new Blob(audioChunks, { type: 'audio/wav' });
-                            transcribeAudio(audioBlob);
-                        });
-
-                        mediaRecorder.start();
-                        isRecording = true;
-                        recordBtn.classList.add('recording');
-                        recordBtn.classList.replace('btn-primary', 'btn-danger');
-                        recordBtn.innerHTML = '<i class="fas fa-stop"></i>';
-                        statusIndicator.textContent = 'Recording... Click to stop';
-                    })
-                    .catch(error => {
-                        console.error('Error accessing microphone:', error);
-                        alert('Error accessing microphone. Please make sure you have given permission for microphone access.');
-                    });
-            }
-
-            // Function to stop recording
-            function stopRecording() {
-                mediaRecorder.stop();
-                isRecording = false;
-                recordBtn.classList.remove('recording');
-                recordBtn.classList.replace('btn-danger', 'btn-primary');
-                recordBtn.innerHTML = '<i class="fas fa-microphone"></i>';
-                statusIndicator.textContent = 'Processing audio...';
-                
-                // Stop all audio tracks
-                mediaRecorder.stream.getTracks().forEach(track => track.stop());
-            }
-
-            // Function to transcribe audio
-            function transcribeAudio(audioBlob) {
-                const formData = new FormData();
-                formData.append('audio', audioBlob);
-                formData.append('source_lang', sourceLanguage.value);
-
-                showProgress();
-                
-                fetch('/transcribe', {
-                    method: 'POST',
-                    body: formData
-                })
-                .then(response => response.json())
-                .then(data => {
-                    hideProgress();
-                    
-                    if (data.success) {
-                        currentSourceText = data.text;
-                        sourceText.innerHTML = `<p>${data.text}</p>`;
-                        playSource.disabled = false;
-                        translateBtn.disabled = false;
-                        statusIndicator.textContent = 'Transcription complete';
-                    } else {
-                        sourceText.innerHTML = `<p class="text-danger">Error: ${data.error}</p>`;
-                        statusIndicator.textContent = 'Transcription failed';
-                    }
-                })
-                .catch(error => {
-                    hideProgress();
-                    console.error('Transcription error:', error);
-                    sourceText.innerHTML = `<p class="text-danger">Failed to transcribe audio. Please try again.</p>`;
-                    statusIndicator.textContent = 'Transcription failed';
-                });
-            }
-
-            // Translate button click event
-            translateBtn.addEventListener('click', function() {
-                if (!currentSourceText) {
-                    return;
-                }
-
-                statusIndicator.textContent = 'Translating...';
-                showProgress();
-                
-                fetch('/translate', {
-                    method: 'POST',
-                    headers: {
-                        'Content-Type': 'application/json'
-                    },
-                    body: JSON.stringify({
-                        text: currentSourceText,
-                        source_lang: sourceLanguage.value,
-                        target_lang: targetLanguage.value
-                    })
-                })
-                .then(response => response.json())
-                .then(data => {
-                    hideProgress();
-                    
-                    if (data.success) {
-                        currentTranslationText = data.translation;
-                        translatedText.innerHTML = `<p>${data.translation}</p>`;
-                        playTranslation.disabled = false;
-                        statusIndicator.textContent = 'Translation complete';
-                    } else {
-                        translatedText.innerHTML = `<p class="text-danger">Error: ${data.error}</p>`;
-                        statusIndicator.textContent = 'Translation failed';
-                    }
-                })
-                .catch(error => {
-                    hideProgress();
-                    console.error('Translation error:', error);
-                    translatedText.innerHTML = `<p class="text-danger">Failed to translate. Please try again.</p>`;
-                    statusIndicator.textContent = 'Translation failed';
-                });
-            });
-
-            // Play source text
-            playSource.addEventListener('click', function() {
-                if (!currentSourceText) return;
-                
-                playAudio(currentSourceText, sourceLanguage.value);
-                statusIndicator.textContent = 'Playing source audio...';
-            });
-
-            // Play translation
-            playTranslation.addEventListener('click', function() {
-                if (!currentTranslationText) return;
-                
-                playAudio(currentTranslationText, targetLanguage.value);
-                statusIndicator.textContent = 'Playing translation audio...';
-            });
-
-            // Function to play audio via TTS
-            function playAudio(text, language) {
-                showProgress();
-                
-                fetch('/speak', {
-                    method: 'POST',
-                    headers: {
-                        'Content-Type': 'application/json'
-                    },
-                    body: JSON.stringify({
-                        text: text,
-                        language: language
-                    })
-                })
-                .then(response => response.json())
-                .then(data => {
-                    hideProgress();
-                    
-                    if (data.success) {
-                        audioPlayer.src = data.audio_url;
-                        audioPlayer.onended = function() {
-                            statusIndicator.textContent = 'Ready';
-                        };
-                        audioPlayer.play();
-                    } else {
-                        statusIndicator.textContent = 'TTS failed';
-                        alert('Failed to play audio: ' + data.error);
-                    }
-                })
-                .catch(error => {
-                    hideProgress();
-                    console.error('TTS error:', error);
-                    statusIndicator.textContent = 'TTS failed';
-                });
-            }
-
-            // Clear buttons
-            clearSource.addEventListener('click', function() {
-                sourceText.innerHTML = '<p class="text-muted">Your transcribed text will appear here...</p>';
-                currentSourceText = '';
-                playSource.disabled = true;
-                translateBtn.disabled = true;
-            });
-
-            clearTranslation.addEventListener('click', function() {
-                translatedText.innerHTML = '<p class="text-muted">Translation will appear here...</p>';
-                currentTranslationText = '';
-                playTranslation.disabled = true;
-            });
-
-            // Progress indicator functions
-            function showProgress() {
-                progressContainer.classList.remove('d-none');
-                let progress = 0;
-                const interval = setInterval(() => {
-                    progress += 5;
-                    if (progress > 90) {
-                        clearInterval(interval);
-                    }
-                    progressBar.style.width = `${progress}%`;
-                }, 100);
-                progressBar.dataset.interval = interval;
-            }
-
-            function hideProgress() {
-                const interval = progressBar.dataset.interval;
-                if (interval) {
-                    clearInterval(Number(interval));
-                }
-                progressBar.style.width = '100%';
-                setTimeout(() => {
-                    progressContainer.classList.add('d-none');
-                    progressBar.style.width = '0%';
-                }, 500);
-            }
-        });
-    </script>
-    <script src="/static/js/app.js"></script>
+    <script src="/static/js/dist/app.js"></script>
 </body>
 </html>
--- a/test-cors.html
+++ b/test-cors.html
@@ -0,0 +1,228 @@
+<!DOCTYPE html>
+<html lang="en">
+<head>
+    <meta charset="UTF-8">
+    <meta name="viewport" content="width=device-width, initial-scale=1.0">
+    <title>CORS Test for Talk2Me</title>
+    <style>
+        body {
+            font-family: Arial, sans-serif;
+            max-width: 800px;
+            margin: 50px auto;
+            padding: 20px;
+        }
+        .test-result {
+            margin: 10px 0;
+            padding: 10px;
+            border-radius: 5px;
+        }
+        .success {
+            background-color: #d4edda;
+            color: #155724;
+            border: 1px solid #c3e6cb;
+        }
+        .error {
+            background-color: #f8d7da;
+            color: #721c24;
+            border: 1px solid #f5c6cb;
+        }
+        button {
+            background-color: #007bff;
+            color: white;
+            padding: 10px 20px;
+            border: none;
+            border-radius: 5px;
+            cursor: pointer;
+            margin: 5px;
+        }
+        button:hover {
+            background-color: #0056b3;
+        }
+        input {
+            width: 100%;
+            padding: 10px;
+            margin: 10px 0;
+            border: 1px solid #ddd;
+            border-radius: 5px;
+        }
+        #results {
+            margin-top: 20px;
+        }
+        pre {
+            background-color: #f8f9fa;
+            padding: 10px;
+            border-radius: 5px;
+            overflow-x: auto;
+        }
+    </style>
+</head>
+<body>
+    <h1>CORS Test for Talk2Me API</h1>
+    
+    <p>This page tests CORS configuration for the Talk2Me API. Open this file from a different origin (e.g., file:// or a different port) to test cross-origin requests.</p>
+    
+    <div>
+        <label for="apiUrl">API Base URL:</label>
+        <input type="text" id="apiUrl" placeholder="http://localhost:5005" value="http://localhost:5005">
+    </div>
+    
+    <h2>Tests:</h2>
+    
+    <button onclick="testHealthEndpoint()">Test Health Endpoint</button>
+    <button onclick="testPreflightRequest()">Test Preflight Request</button>
+    <button onclick="testTranscribeEndpoint()">Test Transcribe Endpoint (OPTIONS)</button>
+    <button onclick="testWithCredentials()">Test With Credentials</button>
+    
+    <div id="results"></div>
+    
+    <script>
+        function addResult(test, success, message, details = null) {
+            const resultsDiv = document.getElementById('results');
+            const resultDiv = document.createElement('div');
+            resultDiv.className = `test-result ${success ? 'success' : 'error'}`;
+            
+            let html = `<strong>${test}:</strong> ${message}`;
+            if (details) {
+                html += `<pre>${JSON.stringify(details, null, 2)}</pre>`;
+            }
+            resultDiv.innerHTML = html;
+            resultsDiv.appendChild(resultDiv);
+        }
+        
+        function getApiUrl() {
+            return document.getElementById('apiUrl').value.trim();
+        }
+        
+        async function testHealthEndpoint() {
+            const apiUrl = getApiUrl();
+            try {
+                const response = await fetch(`${apiUrl}/health`, {
+                    method: 'GET',
+                    mode: 'cors',
+                    headers: {
+                        'Origin': window.location.origin
+                    }
+                });
+                
+                const data = await response.json();
+                
+                // Check CORS headers
+                const corsHeaders = {
+                    'Access-Control-Allow-Origin': response.headers.get('Access-Control-Allow-Origin'),
+                    'Access-Control-Allow-Credentials': response.headers.get('Access-Control-Allow-Credentials')
+                };
+                
+                addResult('Health Endpoint GET', true, 'Request successful', {
+                    status: response.status,
+                    data: data,
+                    corsHeaders: corsHeaders
+                });
+            } catch (error) {
+                addResult('Health Endpoint GET', false, error.message);
+            }
+        }
+        
+        async function testPreflightRequest() {
+            const apiUrl = getApiUrl();
+            try {
+                const response = await fetch(`${apiUrl}/api/push-public-key`, {
+                    method: 'OPTIONS',
+                    mode: 'cors',
+                    headers: {
+                        'Origin': window.location.origin,
+                        'Access-Control-Request-Method': 'GET',
+                        'Access-Control-Request-Headers': 'content-type'
+                    }
+                });
+                
+                const corsHeaders = {
+                    'Access-Control-Allow-Origin': response.headers.get('Access-Control-Allow-Origin'),
+                    'Access-Control-Allow-Methods': response.headers.get('Access-Control-Allow-Methods'),
+                    'Access-Control-Allow-Headers': response.headers.get('Access-Control-Allow-Headers'),
+                    'Access-Control-Max-Age': response.headers.get('Access-Control-Max-Age')
+                };
+                
+                addResult('Preflight Request', response.ok, `Status: ${response.status}`, corsHeaders);
+            } catch (error) {
+                addResult('Preflight Request', false, error.message);
+            }
+        }
+        
+        async function testTranscribeEndpoint() {
+            const apiUrl = getApiUrl();
+            try {
+                const response = await fetch(`${apiUrl}/transcribe`, {
+                    method: 'OPTIONS',
+                    mode: 'cors',
+                    headers: {
+                        'Origin': window.location.origin,
+                        'Access-Control-Request-Method': 'POST',
+                        'Access-Control-Request-Headers': 'content-type'
+                    }
+                });
+                
+                const corsHeaders = {
+                    'Access-Control-Allow-Origin': response.headers.get('Access-Control-Allow-Origin'),
+                    'Access-Control-Allow-Methods': response.headers.get('Access-Control-Allow-Methods'),
+                    'Access-Control-Allow-Headers': response.headers.get('Access-Control-Allow-Headers'),
+                    'Access-Control-Allow-Credentials': response.headers.get('Access-Control-Allow-Credentials')
+                };
+                
+                addResult('Transcribe Endpoint OPTIONS', response.ok, `Status: ${response.status}`, corsHeaders);
+            } catch (error) {
+                addResult('Transcribe Endpoint OPTIONS', false, error.message);
+            }
+        }
+        
+        async function testWithCredentials() {
+            const apiUrl = getApiUrl();
+            try {
+                const response = await fetch(`${apiUrl}/health`, {
+                    method: 'GET',
+                    mode: 'cors',
+                    credentials: 'include',
+                    headers: {
+                        'Origin': window.location.origin
+                    }
+                });
+                
+                const data = await response.json();
+                
+                addResult('Request with Credentials', true, 'Request successful', {
+                    status: response.status,
+                    credentialsIncluded: true,
+                    data: data
+                });
+            } catch (error) {
+                addResult('Request with Credentials', false, error.message);
+            }
+        }
+        
+        // Clear results before running new tests
+        function clearResults() {
+            document.getElementById('results').innerHTML = '';
+        }
+        
+        // Add event listeners
+        document.querySelectorAll('button').forEach(button => {
+            button.addEventListener('click', (e) => {
+                if (!e.target.textContent.includes('Test')) return;
+                clearResults();
+            });
+        });
+        
+        // Show current origin
+        window.addEventListener('load', () => {
+            const info = document.createElement('div');
+            info.style.marginBottom = '20px';
+            info.style.padding = '10px';
+            info.style.backgroundColor = '#e9ecef';
+            info.style.borderRadius = '5px';
+            info.innerHTML = `<strong>Current Origin:</strong> ${window.location.origin}<br>
+                             <strong>Protocol:</strong> ${window.location.protocol}<br>
+                             <strong>Note:</strong> For effective CORS testing, open this file from a different origin than your API server.`;
+            document.body.insertBefore(info, document.querySelector('h2'));
+        });
+    </script>
+</body>
+</html>
--- a/test_error_logging.py
+++ b/test_error_logging.py
@@ -0,0 +1,168 @@
+#!/usr/bin/env python3
+"""
+Test script for error logging system
+"""
+import logging
+import json
+import os
+import time
+from error_logger import ErrorLogger, log_errors, log_performance, get_logger
+
+def test_basic_logging():
+    """Test basic logging functionality"""
+    print("\n=== Testing Basic Logging ===")
+    
+    # Get logger
+    logger = get_logger('test')
+    
+    # Test different log levels
+    logger.debug("This is a debug message")
+    logger.info("This is an info message")
+    logger.warning("This is a warning message")
+    logger.error("This is an error message")
+    
+    print("✓ Basic logging test completed")
+
+def test_error_logging():
+    """Test error logging with exceptions"""
+    print("\n=== Testing Error Logging ===")
+    
+    @log_errors('test.functions')
+    def failing_function():
+        raise ValueError("This is a test error")
+    
+    try:
+        failing_function()
+    except ValueError:
+        print("✓ Error was logged")
+    
+    # Check if error log exists
+    if os.path.exists('logs/errors.log'):
+        print("✓ Error log file created")
+        
+        # Read last line
+        with open('logs/errors.log', 'r') as f:
+            lines = f.readlines()
+            if lines:
+                try:
+                    error_entry = json.loads(lines[-1])
+                    print(f"✓ Error logged with level: {error_entry.get('level')}")
+                    print(f"✓ Error type: {error_entry.get('exception', {}).get('type')}")
+                except json.JSONDecodeError:
+                    print("✗ Error log entry is not valid JSON")
+    else:
+        print("✗ Error log file not created")
+
+def test_performance_logging():
+    """Test performance logging"""
+    print("\n=== Testing Performance Logging ===")
+    
+    @log_performance('test_operation')
+    def slow_function():
+        time.sleep(0.1)  # Simulate slow operation
+        return "result"
+    
+    result = slow_function()
+    print(f"✓ Function returned: {result}")
+    
+    # Check performance log
+    if os.path.exists('logs/performance.log'):
+        print("✓ Performance log file created")
+        
+        # Read last line
+        with open('logs/performance.log', 'r') as f:
+            lines = f.readlines()
+            if lines:
+                try:
+                    perf_entry = json.loads(lines[-1])
+                    duration = perf_entry.get('extra_fields', {}).get('duration_ms', 0)
+                    print(f"✓ Performance logged with duration: {duration}ms")
+                except json.JSONDecodeError:
+                    print("✗ Performance log entry is not valid JSON")
+    else:
+        print("✗ Performance log file not created")
+
+def test_structured_logging():
+    """Test structured logging format"""
+    print("\n=== Testing Structured Logging ===")
+    
+    logger = get_logger('test.structured')
+    
+    # Log with extra fields
+    logger.info("Structured log test", extra={
+        'extra_fields': {
+            'user_id': 123,
+            'action': 'test_action',
+            'metadata': {'key': 'value'}
+        }
+    })
+    
+    # Check main log
+    if os.path.exists('logs/talk2me.log'):
+        with open('logs/talk2me.log', 'r') as f:
+            lines = f.readlines()
+            if lines:
+                try:
+                    # Find our test entry
+                    for line in reversed(lines):
+                        entry = json.loads(line)
+                        if entry.get('message') == 'Structured log test':
+                            print("✓ Structured log entry found")
+                            print(f"✓ Contains timestamp: {'timestamp' in entry}")
+                            print(f"✓ Contains hostname: {'hostname' in entry}")
+                            print(f"✓ Contains extra fields: {'user_id' in entry}")
+                            break
+                except json.JSONDecodeError:
+                    print("✗ Log entry is not valid JSON")
+
+def test_log_rotation():
+    """Test log rotation settings"""
+    print("\n=== Testing Log Rotation ===")
+    
+    # Check if log files exist and their sizes
+    log_files = {
+        'talk2me.log': 'logs/talk2me.log',
+        'errors.log': 'logs/errors.log',
+        'access.log': 'logs/access.log',
+        'security.log': 'logs/security.log',
+        'performance.log': 'logs/performance.log'
+    }
+    
+    for name, path in log_files.items():
+        if os.path.exists(path):
+            size = os.path.getsize(path)
+            print(f"✓ {name}: {size} bytes")
+        else:
+            print(f"- {name}: not created yet")
+
+def main():
+    """Run all tests"""
+    print("Error Logging System Tests")
+    print("==========================")
+    
+    # Create a test Flask app
+    from flask import Flask
+    app = Flask(__name__)
+    app.config['LOG_LEVEL'] = 'DEBUG'
+    app.config['FLASK_ENV'] = 'testing'
+    
+    # Initialize error logger
+    error_logger = ErrorLogger(app)
+    
+    # Run tests
+    test_basic_logging()
+    test_error_logging()
+    test_performance_logging()
+    test_structured_logging()
+    test_log_rotation()
+    
+    print("\n✅ All tests completed!")
+    print("\nCheck the logs directory for generated log files:")
+    print("- logs/talk2me.log - Main application log")
+    print("- logs/errors.log - Error log with stack traces")
+    print("- logs/performance.log - Performance metrics")
+    print("- logs/access.log - HTTP access log")
+    print("- logs/security.log - Security events")
+
+if __name__ == "__main__":
+    main()
--- a/test_session_manager.py
+++ b/test_session_manager.py
@@ -0,0 +1,264 @@
+#!/usr/bin/env python3
+"""
+Unit tests for session management system
+"""
+import unittest
+import tempfile
+import shutil
+import time
+import os
+from session_manager import SessionManager, UserSession, SessionResource
+from flask import Flask, g, session
+
+class TestSessionManager(unittest.TestCase):
+    def setUp(self):
+        """Set up test fixtures"""
+        self.temp_dir = tempfile.mkdtemp()
+        self.config = {
+            'max_session_duration': 3600,
+            'max_idle_time': 900,
+            'max_resources_per_session': 5,  # Small limit for testing
+            'max_bytes_per_session': 1024 * 1024,  # 1MB for testing
+            'cleanup_interval': 1,  # 1 second for faster testing
+            'session_storage_path': self.temp_dir
+        }
+        self.manager = SessionManager(self.config)
+    
+    def tearDown(self):
+        """Clean up test fixtures"""
+        shutil.rmtree(self.temp_dir, ignore_errors=True)
+    
+    def test_create_session(self):
+        """Test session creation"""
+        session = self.manager.create_session(
+            session_id='test-123',
+            user_id='user-1',
+            ip_address='127.0.0.1',
+            user_agent='Test Agent'
+        )
+        
+        self.assertEqual(session.session_id, 'test-123')
+        self.assertEqual(session.user_id, 'user-1')
+        self.assertEqual(session.ip_address, '127.0.0.1')
+        self.assertEqual(session.user_agent, 'Test Agent')
+        self.assertEqual(len(session.resources), 0)
+    
+    def test_get_session(self):
+        """Test session retrieval"""
+        self.manager.create_session(session_id='test-456')
+        session = self.manager.get_session('test-456')
+        
+        self.assertIsNotNone(session)
+        self.assertEqual(session.session_id, 'test-456')
+        
+        # Non-existent session
+        session = self.manager.get_session('non-existent')
+        self.assertIsNone(session)
+    
+    def test_add_resource(self):
+        """Test adding resources to session"""
+        self.manager.create_session(session_id='test-789')
+        
+        # Add a resource
+        resource = self.manager.add_resource(
+            session_id='test-789',
+            resource_type='audio_file',
+            resource_id='audio-1',
+            path='/tmp/test.wav',
+            size_bytes=1024,
+            metadata={'format': 'wav'}
+        )
+        
+        self.assertIsNotNone(resource)
+        self.assertEqual(resource.resource_id, 'audio-1')
+        self.assertEqual(resource.resource_type, 'audio_file')
+        self.assertEqual(resource.size_bytes, 1024)
+        
+        # Check session updated
+        session = self.manager.get_session('test-789')
+        self.assertEqual(len(session.resources), 1)
+        self.assertEqual(session.total_bytes_used, 1024)
+    
+    def test_resource_limits(self):
+        """Test resource limit enforcement"""
+        self.manager.create_session(session_id='test-limits')
+        
+        # Add resources up to limit
+        for i in range(5):
+            self.manager.add_resource(
+                session_id='test-limits',
+                resource_type='temp_file',
+                resource_id=f'file-{i}',
+                size_bytes=100
+            )
+        
+        session = self.manager.get_session('test-limits')
+        self.assertEqual(len(session.resources), 5)
+        
+        # Add one more - should remove oldest
+        self.manager.add_resource(
+            session_id='test-limits',
+            resource_type='temp_file',
+            resource_id='file-new',
+            size_bytes=100
+        )
+        
+        session = self.manager.get_session('test-limits')
+        self.assertEqual(len(session.resources), 5)  # Still 5
+        self.assertNotIn('file-0', session.resources)  # Oldest removed
+        self.assertIn('file-new', session.resources)  # New one added
+    
+    def test_size_limits(self):
+        """Test size limit enforcement"""
+        self.manager.create_session(session_id='test-size')
+        
+        # Add a large resource
+        self.manager.add_resource(
+            session_id='test-size',
+            resource_type='audio_file',
+            resource_id='large-1',
+            size_bytes=500 * 1024  # 500KB
+        )
+        
+        # Add another large resource
+        self.manager.add_resource(
+            session_id='test-size',
+            resource_type='audio_file',
+            resource_id='large-2',
+            size_bytes=600 * 1024  # 600KB - would exceed 1MB limit
+        )
+        
+        session = self.manager.get_session('test-size')
+        # First resource should be removed to make space
+        self.assertNotIn('large-1', session.resources)
+        self.assertIn('large-2', session.resources)
+        self.assertLessEqual(session.total_bytes_used, 1024 * 1024)
+    
+    def test_remove_resource(self):
+        """Test resource removal"""
+        self.manager.create_session(session_id='test-remove')
+        self.manager.add_resource(
+            session_id='test-remove',
+            resource_type='temp_file',
+            resource_id='to-remove',
+            size_bytes=1000
+        )
+        
+        # Remove resource
+        success = self.manager.remove_resource('test-remove', 'to-remove')
+        self.assertTrue(success)
+        
+        # Check it's gone
+        session = self.manager.get_session('test-remove')
+        self.assertEqual(len(session.resources), 0)
+        self.assertEqual(session.total_bytes_used, 0)
+    
+    def test_cleanup_session(self):
+        """Test session cleanup"""
+        # Create session with resources
+        self.manager.create_session(session_id='test-cleanup')
+        
+        # Create actual temp file
+        temp_file = os.path.join(self.temp_dir, 'test-file.txt')
+        with open(temp_file, 'w') as f:
+            f.write('test content')
+        
+        self.manager.add_resource(
+            session_id='test-cleanup',
+            resource_type='temp_file',
+            path=temp_file,
+            size_bytes=12
+        )
+        
+        # Cleanup session
+        success = self.manager.cleanup_session('test-cleanup')
+        self.assertTrue(success)
+        
+        # Check session is gone
+        session = self.manager.get_session('test-cleanup')
+        self.assertIsNone(session)
+        
+        # Check file is deleted
+        self.assertFalse(os.path.exists(temp_file))
+    
+    def test_session_info(self):
+        """Test session info retrieval"""
+        self.manager.create_session(
+            session_id='test-info',
+            ip_address='192.168.1.1'
+        )
+        
+        self.manager.add_resource(
+            session_id='test-info',
+            resource_type='audio_file',
+            size_bytes=2048
+        )
+        
+        info = self.manager.get_session_info('test-info')
+        self.assertIsNotNone(info)
+        self.assertEqual(info['session_id'], 'test-info')
+        self.assertEqual(info['ip_address'], '192.168.1.1')
+        self.assertEqual(info['resource_count'], 1)
+        self.assertEqual(info['total_bytes_used'], 2048)
+    
+    def test_stats(self):
+        """Test statistics calculation"""
+        # Create multiple sessions
+        for i in range(3):
+            self.manager.create_session(session_id=f'test-stats-{i}')
+            self.manager.add_resource(
+                session_id=f'test-stats-{i}',
+                resource_type='temp_file',
+                size_bytes=1000
+            )
+        
+        stats = self.manager.get_stats()
+        self.assertEqual(stats['active_sessions'], 3)
+        self.assertEqual(stats['active_resources'], 3)
+        self.assertEqual(stats['active_bytes'], 3000)
+        self.assertEqual(stats['total_sessions_created'], 3)
+    
+    def test_metrics_export(self):
+        """Test metrics export"""
+        self.manager.create_session(session_id='test-metrics')
+        metrics = self.manager.export_metrics()
+        
+        self.assertIn('sessions', metrics)
+        self.assertIn('resources', metrics)
+        self.assertIn('limits', metrics)
+        self.assertEqual(metrics['sessions']['active'], 1)
+
+class TestFlaskIntegration(unittest.TestCase):
+    def setUp(self):
+        """Set up Flask app for testing"""
+        self.app = Flask(__name__)
+        self.app.config['TESTING'] = True
+        self.app.config['SECRET_KEY'] = 'test-secret'
+        self.temp_dir = tempfile.mkdtemp()
+        self.app.config['UPLOAD_FOLDER'] = self.temp_dir
+        
+        # Initialize session manager
+        from session_manager import init_app
+        init_app(self.app)
+        
+        self.client = self.app.test_client()
+        self.ctx = self.app.test_request_context()
+        self.ctx.push()
+    
+    def tearDown(self):
+        """Clean up"""
+        self.ctx.pop()
+        shutil.rmtree(self.temp_dir, ignore_errors=True)
+    
+    def test_before_request_handler(self):
+        """Test Flask before_request integration"""
+        with self.client:
+            # Make a request
+            response = self.client.get('/')
+            
+            # Session should be created
+            with self.client.session_transaction() as sess:
+                self.assertIn('session_id', sess)
+
+if __name__ == '__main__':
+    unittest.main()
--- a/test_size_limits.py
+++ b/test_size_limits.py
@@ -0,0 +1,146 @@
+#!/usr/bin/env python3
+"""
+Test script for request size limits
+"""
+import requests
+import json
+import io
+import os
+
+BASE_URL = "http://localhost:5005"
+
+def test_json_size_limit():
+    """Test JSON payload size limit"""
+    print("\n=== Testing JSON Size Limit ===")
+    
+    # Create a large JSON payload (over 1MB)
+    large_data = {
+        "text": "x" * (2 * 1024 * 1024),  # 2MB of text
+        "source_lang": "English",
+        "target_lang": "Spanish"
+    }
+    
+    try:
+        response = requests.post(f"{BASE_URL}/translate", json=large_data)
+        print(f"Status: {response.status_code}")
+        if response.status_code == 413:
+            print(f"✓ Correctly rejected large JSON: {response.json()}")
+        else:
+            print(f"✗ Should have rejected large JSON")
+    except Exception as e:
+        print(f"Error: {e}")
+
+def test_audio_size_limit():
+    """Test audio file size limit"""
+    print("\n=== Testing Audio Size Limit ===")
+    
+    # Create a fake large audio file (over 25MB)
+    large_audio = io.BytesIO(b"x" * (30 * 1024 * 1024))  # 30MB
+    
+    files = {
+        'audio': ('large_audio.wav', large_audio, 'audio/wav')
+    }
+    data = {
+        'source_lang': 'English'
+    }
+    
+    try:
+        response = requests.post(f"{BASE_URL}/transcribe", files=files, data=data)
+        print(f"Status: {response.status_code}")
+        if response.status_code == 413:
+            print(f"✓ Correctly rejected large audio: {response.json()}")
+        else:
+            print(f"✗ Should have rejected large audio")
+    except Exception as e:
+        print(f"Error: {e}")
+
+def test_valid_requests():
+    """Test that valid-sized requests are accepted"""
+    print("\n=== Testing Valid Size Requests ===")
+    
+    # Small JSON payload
+    small_data = {
+        "text": "Hello world",
+        "source_lang": "English", 
+        "target_lang": "Spanish"
+    }
+    
+    try:
+        response = requests.post(f"{BASE_URL}/translate", json=small_data)
+        print(f"Small JSON - Status: {response.status_code}")
+        if response.status_code != 413:
+            print("✓ Small JSON accepted")
+        else:
+            print("✗ Small JSON should be accepted")
+    except Exception as e:
+        print(f"Error: {e}")
+    
+    # Small audio file
+    small_audio = io.BytesIO(b"RIFF" + b"x" * 1000)  # 1KB fake WAV
+    files = {
+        'audio': ('small_audio.wav', small_audio, 'audio/wav')
+    }
+    data = {
+        'source_lang': 'English'
+    }
+    
+    try:
+        response = requests.post(f"{BASE_URL}/transcribe", files=files, data=data)
+        print(f"Small audio - Status: {response.status_code}")
+        if response.status_code != 413:
+            print("✓ Small audio accepted")
+        else:
+            print("✗ Small audio should be accepted")
+    except Exception as e:
+        print(f"Error: {e}")
+
+def test_admin_endpoints():
+    """Test admin endpoints for size limits"""
+    print("\n=== Testing Admin Endpoints ===")
+    
+    headers = {'X-Admin-Token': os.environ.get('ADMIN_TOKEN', 'default-admin-token')}
+    
+    # Get current limits
+    try:
+        response = requests.get(f"{BASE_URL}/admin/size-limits", headers=headers)
+        print(f"Get limits - Status: {response.status_code}")
+        if response.status_code == 200:
+            limits = response.json()
+            print(f"✓ Current limits: {limits['limits_human']}")
+        else:
+            print(f"✗ Failed to get limits: {response.text}")
+    except Exception as e:
+        print(f"Error: {e}")
+    
+    # Update limits
+    new_limits = {
+        "max_audio_size": "30MB",
+        "max_json_size": 2097152  # 2MB in bytes
+    }
+    
+    try:
+        response = requests.post(f"{BASE_URL}/admin/size-limits", 
+                               json=new_limits, headers=headers)
+        print(f"\nUpdate limits - Status: {response.status_code}")
+        if response.status_code == 200:
+            result = response.json()
+            print(f"✓ Updated limits: {result['new_limits_human']}")
+        else:
+            print(f"✗ Failed to update limits: {response.text}")
+    except Exception as e:
+        print(f"Error: {e}")
+
+if __name__ == "__main__":
+    print("Request Size Limit Tests")
+    print("========================")
+    print(f"Testing against: {BASE_URL}")
+    print("\nMake sure the Flask app is running on port 5005")
+    
+    input("\nPress Enter to start tests...")
+    
+    test_valid_requests()
+    test_json_size_limit()
+    test_audio_size_limit()
+    test_admin_endpoints()
+    
+    print("\n✅ All tests completed!")
--- a/tsconfig.json
+++ b/tsconfig.json
@@ -0,0 +1,41 @@
+{
+  "compilerOptions": {
+    "target": "ES2020",
+    "module": "ES2020",
+    "lib": ["ES2020", "DOM", "DOM.Iterable"],
+    "outDir": "./static/js/dist",
+    "rootDir": "./static/js/src",
+    "strict": true,
+    "esModuleInterop": true,
+    "skipLibCheck": true,
+    "forceConsistentCasingInFileNames": true,
+    "moduleResolution": "node",
+    "resolveJsonModule": true,
+    "declaration": true,
+    "declarationMap": true,
+    "sourceMap": true,
+    "removeComments": false,
+    "noEmitOnError": true,
+    "noImplicitAny": true,
+    "noImplicitThis": true,
+    "noUnusedLocals": true,
+    "noUnusedParameters": true,
+    "noImplicitReturns": true,
+    "noFallthroughCasesInSwitch": true,
+    "strictNullChecks": true,
+    "strictFunctionTypes": true,
+    "strictBindCallApply": true,
+    "strictPropertyInitialization": true,
+    "allowJs": false,
+    "types": [
+      "node"
+    ]
+  },
+  "include": [
+    "static/js/src/**/*"
+  ],
+  "exclude": [
+    "node_modules",
+    "static/js/dist"
+  ]
+}
--- a/validators.py
+++ b/validators.py
@@ -0,0 +1,243 @@
+"""
+Input validation and sanitization for the Talk2Me application
+"""
+import re
+import html
+from typing import Optional, Dict, Any, Tuple
+import os
+
+class Validators:
+    # Maximum sizes
+    MAX_TEXT_LENGTH = 10000
+    MAX_AUDIO_SIZE = 25 * 1024 * 1024  # 25MB
+    MAX_URL_LENGTH = 2048
+    MAX_API_KEY_LENGTH = 128
+    
+    # Allowed audio formats
+    ALLOWED_AUDIO_EXTENSIONS = {'.webm', '.ogg', '.wav', '.mp3', '.mp4', '.m4a'}
+    ALLOWED_AUDIO_MIMETYPES = {
+        'audio/webm', 'audio/ogg', 'audio/wav', 'audio/mp3', 
+        'audio/mpeg', 'audio/mp4', 'audio/x-m4a', 'audio/x-wav'
+    }
+    
+    @staticmethod
+    def sanitize_text(text: str, max_length: int = None) -> str:
+        """Sanitize text input by removing dangerous characters"""
+        if not isinstance(text, str):
+            return ""
+        
+        if max_length is None:
+            max_length = Validators.MAX_TEXT_LENGTH
+            
+        # Trim and limit length
+        text = text.strip()[:max_length]
+        
+        # Remove null bytes
+        text = text.replace('\x00', '')
+        
+        # Remove control characters except newlines and tabs
+        text = re.sub(r'[\x00-\x08\x0B\x0C\x0E-\x1F\x7F]', '', text)
+        
+        return text
+    
+    @staticmethod
+    def sanitize_html(text: str) -> str:
+        """Escape HTML to prevent XSS"""
+        if not isinstance(text, str):
+            return ""
+        return html.escape(text)
+    
+    @staticmethod
+    def validate_language_code(code: str, allowed_languages: set) -> Optional[str]:
+        """Validate language code against allowed list"""
+        if not code or not isinstance(code, str):
+            return None
+            
+        code = code.strip().lower()
+        
+        # Check if it's in the allowed list or is 'auto'
+        if code in allowed_languages or code == 'auto':
+            return code
+            
+        return None
+    
+    @staticmethod
+    def validate_audio_file(file_storage) -> Tuple[bool, Optional[str]]:
+        """Validate uploaded audio file"""
+        if not file_storage:
+            return False, "No file provided"
+        
+        # Check file size
+        file_storage.seek(0, os.SEEK_END)
+        size = file_storage.tell()
+        file_storage.seek(0)
+        
+        if size > Validators.MAX_AUDIO_SIZE:
+            return False, f"File size exceeds {Validators.MAX_AUDIO_SIZE // (1024*1024)}MB limit"
+        
+        # Check file extension
+        if file_storage.filename:
+            ext = os.path.splitext(file_storage.filename.lower())[1]
+            if ext not in Validators.ALLOWED_AUDIO_EXTENSIONS:
+                return False, "Invalid audio file type"
+        
+        # Check MIME type if available
+        if hasattr(file_storage, 'content_type') and file_storage.content_type:
+            if file_storage.content_type not in Validators.ALLOWED_AUDIO_MIMETYPES:
+                # Allow generic application/octet-stream as browsers sometimes use this
+                if file_storage.content_type != 'application/octet-stream':
+                    return False, "Invalid audio MIME type"
+        
+        return True, None
+    
+    @staticmethod
+    def validate_url(url: str) -> Optional[str]:
+        """Validate and sanitize URL"""
+        if not url or not isinstance(url, str):
+            return None
+        
+        url = url.strip()
+        
+        # Check length
+        if len(url) > Validators.MAX_URL_LENGTH:
+            return None
+        
+        # Basic URL pattern check
+        url_pattern = re.compile(
+            r'^https?://'  # http:// or https://
+            r'(?:(?:[A-Z0-9](?:[A-Z0-9-]{0,61}[A-Z0-9])?\.)+[A-Z]{2,6}\.?|'  # domain...
+            r'localhost|'  # localhost...
+            r'\d{1,3}\.\d{1,3}\.\d{1,3}\.\d{1,3}|'  # ...or ipv4
+            r'\[?[A-F0-9]*:[A-F0-9:]+\]?)'  # ...or ipv6
+            r'(?::\d+)?'  # optional port
+            r'(?:/?|[/?]\S+)$', re.IGNORECASE)
+        
+        if not url_pattern.match(url):
+            return None
+        
+        # Prevent some common injection attempts
+        dangerous_patterns = [
+            'javascript:', 'data:', 'vbscript:', 'file:', 'about:', 'chrome:'
+        ]
+        if any(pattern in url.lower() for pattern in dangerous_patterns):
+            return None
+        
+        return url
+    
+    @staticmethod
+    def validate_api_key(key: str) -> Optional[str]:
+        """Validate API key format"""
+        if not key or not isinstance(key, str):
+            return None
+        
+        key = key.strip()
+        
+        # Check length
+        if len(key) < 20 or len(key) > Validators.MAX_API_KEY_LENGTH:
+            return None
+        
+        # Only allow alphanumeric, dash, and underscore
+        if not re.match(r'^[a-zA-Z0-9\-_]+$', key):
+            return None
+        
+        return key
+    
+    @staticmethod
+    def sanitize_filename(filename: str) -> str:
+        """Sanitize filename to prevent directory traversal"""
+        if not filename or not isinstance(filename, str):
+            return "file"
+        
+        # Remove any path components
+        filename = os.path.basename(filename)
+        
+        # Remove dangerous characters
+        filename = re.sub(r'[^a-zA-Z0-9.\-_]', '_', filename)
+        
+        # Limit length
+        if len(filename) > 255:
+            name, ext = os.path.splitext(filename)
+            max_name_length = 255 - len(ext)
+            filename = name[:max_name_length] + ext
+        
+        # Don't allow hidden files
+        if filename.startswith('.'):
+            filename = '_' + filename[1:]
+        
+        return filename or "file"
+    
+    @staticmethod
+    def validate_json_size(data: Dict[str, Any], max_size_kb: int = 1024) -> bool:
+        """Check if JSON data size is within limits"""
+        try:
+            import json
+            json_str = json.dumps(data)
+            size_kb = len(json_str.encode('utf-8')) / 1024
+            return size_kb <= max_size_kb
+        except:
+            return False
+    
+    @staticmethod
+    def validate_settings(settings: Dict[str, Any]) -> Tuple[bool, Dict[str, Any], list]:
+        """Validate settings object"""
+        errors = []
+        sanitized = {}
+        
+        # Boolean settings
+        bool_settings = [
+            'notificationsEnabled', 'notifyTranscription', 
+            'notifyTranslation', 'notifyErrors', 'offlineMode'
+        ]
+        
+        for setting in bool_settings:
+            if setting in settings:
+                sanitized[setting] = bool(settings[setting])
+        
+        # URL validation
+        if 'ttsServerUrl' in settings and settings['ttsServerUrl']:
+            url = Validators.validate_url(settings['ttsServerUrl'])
+            if not url:
+                errors.append('Invalid TTS server URL')
+            else:
+                sanitized['ttsServerUrl'] = url
+        
+        # API key validation
+        if 'ttsApiKey' in settings and settings['ttsApiKey']:
+            key = Validators.validate_api_key(settings['ttsApiKey'])
+            if not key:
+                errors.append('Invalid API key format')
+            else:
+                sanitized['ttsApiKey'] = key
+        
+        return len(errors) == 0, sanitized, errors
+    
+    @staticmethod
+    def rate_limit_check(identifier: str, action: str, max_requests: int = 10, 
+                        window_seconds: int = 60, storage: Dict = None) -> bool:
+        """
+        Simple rate limiting check
+        Returns True if request is allowed, False if rate limited
+        """
+        import time
+        
+        if storage is None:
+            return True  # Can't track without storage
+        
+        key = f"{identifier}:{action}"
+        current_time = time.time()
+        window_start = current_time - window_seconds
+        
+        # Get or create request list
+        if key not in storage:
+            storage[key] = []
+        
+        # Remove old requests outside the window
+        storage[key] = [t for t in storage[key] if t > window_start]
+        
+        # Check if limit exceeded
+        if len(storage[key]) >= max_requests:
+            return False
+        
+        # Add current request
+        storage[key].append(current_time)
+        return True
--- a/whisper_config.py
+++ b/whisper_config.py
@@ -0,0 +1,39 @@
+"""
+Whisper Model Configuration and Optimization Settings
+"""
+
+# Model selection based on available resources
+# Available models: tiny, base, small, medium, large
+MODEL_SIZE = "base"  # ~140MB, good balance of speed and accuracy
+
+# GPU Optimization Settings
+GPU_OPTIMIZATIONS = {
+    "enable_tf32": True,  # TensorFloat-32 for Ampere GPUs
+    "enable_cudnn_benchmark": True,  # Auto-tune convolution algorithms
+    "use_fp16": True,  # Half precision for faster inference
+    "pre_allocate_memory": True,  # Reduce memory fragmentation
+    "warm_up_gpu": True  # Cache CUDA kernels on startup
+}
+
+# Transcription Settings for Speed
+TRANSCRIBE_OPTIONS = {
+    "task": "transcribe",
+    "temperature": 0,  # Disable sampling
+    "best_of": 1,  # No beam search
+    "beam_size": 1,  # Single beam
+    "condition_on_previous_text": False,  # Faster inference
+    "compression_ratio_threshold": 2.4,
+    "logprob_threshold": -1.0,
+    "no_speech_threshold": 0.6,
+    "word_timestamps": False  # Disable if not needed
+}
+
+# Memory Management
+MEMORY_SETTINGS = {
+    "clear_cache_after_transcribe": True,
+    "force_garbage_collection": True,
+    "max_concurrent_transcriptions": 1  # Prevent memory overflow
+}
+
+# Performance Monitoring
+ENABLE_PERFORMANCE_LOGGING = True
--- a/wsgi.py
+++ b/wsgi.py
@@ -0,0 +1,34 @@
+#!/usr/bin/env python3
+"""
+WSGI entry point for production deployment
+"""
+import os
+import sys
+from pathlib import Path
+
+# Add the project directory to the Python path
+project_root = Path(__file__).parent.absolute()
+sys.path.insert(0, str(project_root))
+
+# Set production environment
+os.environ['FLASK_ENV'] = 'production'
+
+# Import and configure the Flask app
+from app import app
+
+# Production configuration overrides
+app.config.update(
+    DEBUG=False,
+    TESTING=False,
+    # Ensure proper secret key is set in production
+    SECRET_KEY=os.environ.get('SECRET_KEY', app.config.get('SECRET_KEY'))
+)
+
+# Create the WSGI application
+application = app
+
+if __name__ == '__main__':
+    # This is only for development/testing
+    # In production, use: gunicorn wsgi:application
+    print("Warning: Running WSGI directly. Use a proper WSGI server in production!")
+    application.run(host='0.0.0.0', port=5005)