I plenty of information on creating high capacity niginx/Jitsi-meet + jicofo +jvb, and some of the challenges with prosody’s single-threading.
I am having trouble finding any information on design/plan/implement high capacity recording and closed captions approaches for Jibri and Jigasi.
There is plenty of information about multiple JVBs to handle more demand/capacity. Is there information on similar approach for Jibri and Jigasi?
For example, if supporting 20,000 simultaneous participants, average 20 participants per room, with 1-2 video/audio per room. That is pretty clear how to plan for the 1,000 rooms.
But what if 500 rooms want to record the audio/video sessions, and another 500 want to have closed captioning running. How does one prepare for that?
For example we are looking at using Vosk for CC (no info on how many vosk servers we’ll need yet, have to test that unless someone has information I can see about it?). We are to expect at least 2 million minutes per month minimum of closed captioning. This would be over $500,000 USD per month minimum if using Google. So trying to setup on own Vosk CC instead.
Want to try to prepare for this high volume of CC demand though, and having trouble finding many conversations at this scale.
One resource suggested that for each room recording their session (Jibri), we should have a dedicated Jibri instance, especially because of the issues with the alsa audio loopback challenges. Can this be a very low end like t3.small, or does it need to be a little strong with something like 2cpu 4gb ram (t4g.large)? Is this accurate? Overkill? Underpowered? Unknown? How does that get resourced on demand?
For closed captioning to Vosk, I so far can’t find any information on how many simultaneous real-time closed captions it could handle from Jitsi at any level of hardware. Any information you are aware of? Is this another situation where will need to spin up a bunch of small instances for every room that enables CC? Is there a ratio it can handle, for example 4 rooms per server instance?
Thank you for thinking about this and any suggestions or resources you could point me to for scaling these components.