Setting up speech to text - STT settings

Thank you again or the response. I have pulled several models via ollama so I am familiar with that process. Where I am at a loss is loading the speech to text under the audio section. I do not see where I can use the models I pulled. It also starts to load kokoro.js and stays loading after it goes to 100%. Am I doing something wrong? Am I able to load a model like mistral and use it with the audio settings for speech to text? Basically I am trying to transcribe a bunch of mp3s of lectures. Thanks again for all your help.