["OneLLM is a unified framework for multimodal alignment.", "It aligns multimodal inputs with LLM using a general encoder and a unified projection module.", "Supports understanding of various modalities such as images, audio, and video.", "Experiments show superiority over existing methods in multiple tasks.", "Exhibits strong zero-shot capabilities."]