this is directed to the developers of transcription and translation. I hope you can find the time to answer me a few questions.
- I hosted Jitsi on a server, and I build the projects from the code from master branch for Jitsi-Meet and Jigasi. If I only want to modify the transcription and translation modules, do I need to build also other projects like Jicofo, JVB or lib-jitsi-meet from source?
- Was there ever released a stable version of Jigasi with the transcription and translation functionalities working? If so, where is the source code, because I could not find it?
- The Speech-to-Text in Google Search for example is quite accurate, but in the implementation on Jitsi, the accuracy is noticeably lower, why is that? I read in Nik Vaessen’s blog that this could be because of the data being sent in raw 48000 kHz rather than on 16000 kHz with flac encoding. But, in the Google Speech-to-Text API documentation it says that the frequency being higher than 16000 kHz does not affect the transcribing.