Go to file
Adolfo Delorenzo 343bfbf1de Fix temporary file accumulation to prevent disk space exhaustion
Automatic Cleanup System:
- Background thread cleans files older than 5 minutes every minute
- Tracks all temporary files in a registry with creation timestamps
- Automatic cleanup on app shutdown with atexit handler
- Orphaned file detection and removal
- Thread-safe cleanup implementation

File Management:
- Unique filenames with timestamps prevent collisions
- Configurable upload folder via UPLOAD_FOLDER environment variable
- Automatic folder creation with proper permissions
- Fallback to system temp if primary folder fails
- File registration for all uploads and generated audio

Health Monitoring:
- /health/storage endpoint shows temp file statistics
- Tracks file count, total size, oldest file age
- Disk space monitoring and warnings
- Real-time cleanup status information
- Warning when files exceed thresholds

Administrative Tools:
- maintenance.sh script for manual operations
- Status checking, manual cleanup, real-time monitoring
- /admin/cleanup endpoint for emergency cleanup (requires auth token)
- Configurable retention period (default 5 minutes)

Security Improvements:
- Filename sanitization in get_audio endpoint
- Directory traversal prevention
- Cache headers to reduce repeated downloads
- Proper file existence checks

Performance:
- Efficient batch cleanup operations
- Minimal overhead with background thread
- Smart registry management
- Automatic garbage collection after operations

This prevents disk space exhaustion by ensuring temporary files are
automatically cleaned up after use, with multiple failsafes.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-06-02 23:27:59 -06:00
static Implement streaming translation for 60-80% perceived latency reduction 2025-06-02 23:10:58 -06:00
templates Implement streaming translation for 60-80% perceived latency reduction 2025-06-02 23:10:58 -06:00
venv quasi-final 2025-04-05 11:50:31 -06:00
.gitignore Major improvements: TypeScript, animations, notifications, compression, GPU optimization 2025-06-02 21:18:16 -06:00
app.py Fix temporary file accumulation to prevent disk space exhaustion 2025-06-02 23:27:59 -06:00
GPU_SUPPORT.md Major improvements: TypeScript, animations, notifications, compression, GPU optimization 2025-06-02 21:18:16 -06:00
health-monitor.py Add health check endpoints and automatic language detection 2025-06-02 22:37:38 -06:00
maintenance.sh Fix temporary file accumulation to prevent disk space exhaustion 2025-06-02 23:27:59 -06:00
package-lock.json Major improvements: TypeScript, animations, notifications, compression, GPU optimization 2025-06-02 21:18:16 -06:00
package.json Major improvements: TypeScript, animations, notifications, compression, GPU optimization 2025-06-02 21:18:16 -06:00
README_TYPESCRIPT.md Major improvements: TypeScript, animations, notifications, compression, GPU optimization 2025-06-02 21:18:16 -06:00
README.md quasi-final 2025-04-05 11:50:31 -06:00
requirements.txt Major improvements: TypeScript, animations, notifications, compression, GPU optimization 2025-06-02 21:18:16 -06:00
setup-script.sh quasi-final 2025-04-05 11:50:31 -06:00
tsconfig.json Major improvements: TypeScript, animations, notifications, compression, GPU optimization 2025-06-02 21:18:16 -06:00
tts_test_output.mp3 quasi-final 2025-04-05 11:50:31 -06:00
tts-debug-script.py quasi-final 2025-04-05 11:50:31 -06:00
validators.py Add comprehensive input validation and sanitization 2025-06-02 22:58:17 -06:00
whisper_config.py Major improvements: TypeScript, animations, notifications, compression, GPU optimization 2025-06-02 21:18:16 -06:00

Voice Language Translator

A mobile-friendly web application that translates spoken language between multiple languages using:

  • Gemma 3 open-source LLM via Ollama for translation
  • OpenAI Whisper for speech-to-text
  • OpenAI Edge TTS for text-to-speech

Supported Languages

  • Arabic
  • Armenian
  • Azerbaijani
  • English
  • French
  • Georgian
  • Kazakh
  • Mandarin
  • Farsi
  • Portuguese
  • Russian
  • Spanish
  • Turkish
  • Uzbek

Setup Instructions

  1. Install the required Python packages:

    pip install -r requirements.txt
    
  2. Make sure you have Ollama installed and the Gemma 3 model loaded:

    ollama pull gemma3
    
  3. Ensure your OpenAI Edge TTS server is running on port 5050.

  4. Run the application:

    python app.py
    
  5. Open your browser and navigate to:

    http://localhost:8000
    

Usage

  1. Select your source language from the dropdown menu
  2. Press the microphone button and speak
  3. Press the button again to stop recording
  4. Wait for the transcription to complete
  5. Select your target language
  6. Press the "Translate" button
  7. Use the play buttons to hear the original or translated text

Technical Details

  • The app uses Flask for the web server
  • Audio is processed client-side using the MediaRecorder API
  • Whisper for speech recognition with language hints
  • Ollama provides access to the Gemma 3 model for translation
  • OpenAI Edge TTS delivers natural-sounding speech output

Mobile Support

The interface is fully responsive and designed to work well on mobile devices.