I heard about existence of some speech recognition systems, and it seems I need one of those. Basically, I have an audio file with speech (only one person is speaking most of the time), and I want to get a transcript of the speech.
Is something like that possible?