This paper proposes a new metric for evaluating image generation models. It highlights the issues with the Frechet Inception Distance (FID) metric and introduces a new metric called CMMD. Extensive experiments demonstrate that the FID metric may be unreliable for evaluating text-to-image models, while the CMMD metric can more reliably assess image quality.