Whisper large-v3-turbo is an advanced automatic speech recognition (ASR) and speech translation model proposed by OpenAI. It is trained on over 5 million hours of labeled data and can generalize to various datasets and domains in zero-shot settings. This model is a fine-tuned version of Whisper large-v3, reducing the number of decoding layers from 32 to 4 to enhance speed, though it may result in a slight decrease in quality.