CAP4D is a technology that creates 4D human avatars using Morphable Multi-View Diffusion Models. It can generate images from an arbitrary number of reference images, producing different perspectives and expressions, and adapt them into a 4D avatar that can be controlled with a 3DMM and rendered in real-time. The main advantages of this technology include highly realistic image generation, adaptability across multiple viewpoints, and real-time rendering capabilities. CAP4D is based on the latest advancements in deep learning and image generation, particularly in diffusion models and 3D facial modeling. With its high-quality image generation and real-time rendering capabilities, CAP4D holds broad application prospects in entertainment, game development, and virtual reality. Currently, the technology provides the code for free, but specific commercial applications may require further licensing and pricing agreements.