Hello, is the audio encoder you are using Whisper? Which version is it? Can it be directly downloaded and used from Hugging Face?