SLD is a self-correcting LLM-controlled diffusion model framework that achieves precise text-to-image alignment by integrating a detector-enhanced generator. The SLD framework supports both image generation and fine-grained editing, and is compatible with any image generator, such as DALL-E 3, without requiring additional training or data.