WePOINTS is a series of multimodal models developed by the WeChat AI team, aimed at creating a unified framework that accommodates various modalities. These models utilize the latest advancements and technologies in multimodal modeling to promote seamless integration of content understanding and generation. The WePOINTS project provides not only models but also pre-trained datasets, evaluation tools, and usage tutorials, making it a significant contribution to the field of multimodal artificial intelligence.