WeST

Implement speech transcription based on LLM in just 300 lines of code.

CommonProductProgrammingSpeech RecognitionNatural Language Processing
WeST is an open-source speech recognition transcription model that achieves speech-to-text conversion in a concise format of 300 lines of code, based on a large language model (LLM). It includes a large language model, a speech encoder, and a projector, with only the projector being trainable. The development of WeST is inspired by SLAM-ASR and LLaMA 3.1, aiming to deliver efficient speech recognition capabilities through simplified code.
Visit

WeST Visit Over Time

Monthly Visits

494758773

Bounce Rate

37.69%

Page per Visit

5.7

Visit Duration

00:06:29

WeST Visit Trend

WeST Visit Geography

WeST Traffic Sources

WeST Alternatives