image-textualization is an automated framework for generating rich and detailed image descriptions. This framework utilizes deep learning technology, enabling it to automatically extract information from images and generate accurate, comprehensive textual descriptions. This technology holds significant application value in areas such as image recognition, content generation, and assisting individuals with visual impairments.