The CustomNet technology, jointly developed by Tsinghua University and the University of Tokyo, is an innovative technique that seamlessly integrates images of specified objects into newly generated pictures while preserving the original object's style and texture details. This technology leverages 3D perspective synthesis capabilities to achieve clear spatial positioning and perspective adjustments, producing diverse outputs. Additionally, CustomNet offers flexible background control features, allowing users to adjust the background through text descriptions or specific images to create a more harmonious composition with the object. Moreover, CustomNet is capable of handling complex real-world scene data, generating high-quality personalized outputs. This technology brings a glimmer of hope to the field of SD product image fusion and holds significant implications for the development of the object customization domain.