General-purpose speech recognition model for converting audio to text
General-purpose speech recognition model
whisper
$ whisper audio.mp3
$ whisper audio.wav --language en --output_format txt
$ whisper audio.m4a --model base