diff options
| author | Pinapelz <yukais@pinapelz.com> | 2026-05-09 00:27:38 -0700 |
|---|---|---|
| committer | Pinapelz <yukais@pinapelz.com> | 2026-05-09 00:27:57 -0700 |
| commit | ebc390f9e74378fdaba3e79e9da4a76c436464e2 (patch) | |
| tree | 0b2829e10d0c846902eb2bf53ad05dcdb8cff3ce /README.md | |
| parent | 41c4a2b287030f93b96db27dc6783e2b12aab99e (diff) | |
implement gpt-realtime-translate
Diffstat (limited to 'README.md')
| -rw-r--r-- | README.md | 12 |
1 files changed, 8 insertions, 4 deletions
@@ -16,6 +16,14 @@ A GUI is available for configuration `server.py` serves a backend for translating incoming audio data. It expects some other client to hit the `/events` endpoint to fetch the translated data. +## Translation backends + +You can now choose between two subtitle backends in the settings dialog: + +- **Whisper + optional Ollama cleanup** +- **OpenAI Realtime translation** using `gpt-realtime-translate` (requires OpenAI API Key, billed usage) + - This bypasses `faster-whisper` and `ollama`, in this use case auto-live-tl only serves to encode PCM data into the appropriate format for `gpt-realtime-translate` + # Clients: @@ -103,7 +111,3 @@ https://github.com/user-attachments/assets/db602a11-2d13-4e58-a5e8-1d4a71c1be0e Example 2: https://github.com/user-attachments/assets/a480809e-77f7-4b66-9686-aa2ffea8333d - - - - |
