Where to capture audio tracks?

we implated our own recording feature.
we wanted to add captioning (Transcription) for the recorded video.
how to achieve this any suggestion, please?

and also wanted to know where we can capture audio tracks for each user? on the client-side?