Performance question


I am using lib-jitsi-meet to build my own client interface.
In my scenario, a teacher will show his video to students and students will not show their video or audio. When a student raises his hand, his video / audio will be authorized by the teacher instead of the video of the teacher himself.
In my tests I can already do this, lib-jitsi-meet is very good.
My question is: Since we will always have only one video / audio stream and approximately 40 students, will it be necessary to have a very powerful CPU, a lot of memory and a lot of network bandwidth? What would be the ideal configuration?
I thank you all for your help.