Translated data: Tsinghua University and Microsoft have proposed the SoT method, aimed at addressing the issue of slow generation speeds in large language models. Through a unique two-stage process, SoT achieves significant speed improvements across multiple domains while maintaining answer quality. This method treats language models as black boxes and introduces data-level efficiency optimizations, providing a new perspective for content generation and bringing new exploration directions to the field of artificial intelligence.