2024-08-07 09:11:21.AIbase.10.9k
Israeli Company Launches Open Source Speech Recognition Model Whisper Medusa with 50% Speed Increase
Israeli AI company aiOla has released an open source speech recognition model named Whisper Medusa, which is based on an improved architecture design that incorporates multi-head attention mechanisms, allowing it to process speech 50% faster than OpenAI's Whisper model. Whisper Medusa makes parallel predictions of ten tokens instead of the traditional one at a time, significantly enhancing speech recognition speed while maintaining performance. Its innovative training method employs weak supervision, freezing the backbone system and utilizing...