adelorenzo/talk2me

Adolfo Delorenzo a4ef775731 Implement comprehensive rate limiting to protect against DoS attacks

- Add token bucket rate limiter with sliding window algorithm
- Implement per-endpoint configurable rate limits
- Add automatic IP blocking for excessive requests
- Implement global request limits and concurrent request throttling
- Add request size validation for all endpoints
- Create admin endpoints for rate limit management
- Add rate limit headers to responses
- Implement cleanup thread for old rate limit buckets
- Create detailed rate limiting documentation

Rate limits:
- Transcription: 10/min, 100/hour, max 10MB
- Translation: 20/min, 300/hour, max 100KB
- Streaming: 10/min, 150/hour, max 100KB
- TTS: 15/min, 200/hour, max 50KB
- Global: 1000/min, 10000/hour, 50 concurrent

Security features:
- Automatic temporary IP blocking (1 hour) for abuse
- Manual IP blocking via admin endpoint
- Request size validation to prevent large payload attacks
- Burst control to limit sudden traffic spikes

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>

2025-06-03 00:14:05 -06:00

3.7 KiB

Raw Blame History

Security Configuration Guide

This document outlines security best practices for deploying Talk2Me.

Environment Variables

NEVER commit sensitive information like API keys, passwords, or secrets to version control.

Required Security Configuration

TTS_API_KEY
- Required for TTS server authentication
- Set via environment variable: export TTS_API_KEY="your-api-key"
- Or use a .env file (see .env.example)
SECRET_KEY
- Required for Flask session security
- Generate a secure key: python -c "import secrets; print(secrets.token_hex(32))"
- Set via: export SECRET_KEY="your-generated-key"
ADMIN_TOKEN
- Required for admin endpoints
- Generate a secure token: python -c "import secrets; print(secrets.token_urlsafe(32))"
- Set via: export ADMIN_TOKEN="your-admin-token"

Using a .env File (Recommended)

Copy the example file:
```
cp .env.example .env
```
Edit .env with your actual values:
```
nano .env  # or your preferred editor
```

Load environment variables:

# Using python-dotenv (add to requirements.txt)
pip install python-dotenv

# Or source manually
source .env

Python-dotenv Integration

To automatically load .env files, add this to the top of app.py:

from dotenv import load_dotenv
load_dotenv()  # Load .env file if it exists

Production Deployment

For production deployments:

Use a secrets management service:
- AWS Secrets Manager
- HashiCorp Vault
- Azure Key Vault
- Google Secret Manager
Set environment variables securely:
- Use your platform's environment configuration
- Never expose secrets in logs or error messages
- Rotate keys regularly
Additional security measures:
- Use HTTPS only
- Enable CORS restrictions
- Implement rate limiting
- Monitor for suspicious activity

Docker Deployment

When using Docker:

# Use build arguments for non-sensitive config
ARG TTS_SERVER_URL=http://localhost:5050/v1/audio/speech

# Use runtime environment for secrets
ENV TTS_API_KEY=""

Run with:

docker run -e TTS_API_KEY="your-key" -e SECRET_KEY="your-secret" talk2me

Kubernetes Deployment

Use Kubernetes secrets:

apiVersion: v1
kind: Secret
metadata:
  name: talk2me-secrets
type: Opaque
stringData:
  tts-api-key: "your-api-key"
  flask-secret-key: "your-secret-key"
  admin-token: "your-admin-token"

Rate Limiting

Talk2Me implements comprehensive rate limiting to prevent abuse:

Per-Endpoint Limits:
- Transcription: 10/min, 100/hour
- Translation: 20/min, 300/hour
- TTS: 15/min, 200/hour
Global Limits:
- 1,000 requests/minute total
- 50 concurrent requests maximum
Automatic Protection:
- IP blocking for excessive requests
- Request size validation
- Burst control

See RATE_LIMITING.md for configuration details.

Security Checklist

All API keys removed from source code
Environment variables configured
.env file added to .gitignore
Secrets rotated after any potential exposure
HTTPS enabled in production
CORS properly configured
Rate limiting enabled and configured
Admin endpoints protected with authentication
Error messages don't expose sensitive info
Logs sanitized of sensitive data
Request size limits enforced
IP blocking configured for abuse prevention

Reporting Security Issues

If you discover a security vulnerability, please report it to:

Create a private security advisory on GitHub
Or email: security@yourdomain.com

Do not create public issues for security vulnerabilities.