I am working on some fixes for implementing VOSK with Jigasi and Jitsi. Since the new stable version of Jitsi (2.0.7830) there has been an issue when selecting the transcription language.
A potential fix involves editing the frontend, so that it shows only “Enable subtitles” when only transcription (not translation) is enabled and shows the whole language menu when translation is enabled. This causes an issue where someone won’t be able to select a transcription language for VOSK (it does not support multiple languages).
This causes another issue if we want to implement multiple transcription languages for VOSK. If we implement multiple instances of VOSK with different languages, the user will no longer be able to choose the transcription language.
So my question is: how can we decide which solution works best for the user? My end goal is to have a self-hosted transcription solution (using VOSK or Whispering), which supports multiple languages.
Thanks you very much in advance for any help/directions