The datasets FunQA, released by institutions such as Beijing University of Posts and Telecommunications and Nanyang Technological University in Singapore, utilize 4,000 comedic videos and 310,000 pieces of commentary text to enhance AI's capabilities in accurate video comprehension, counterfactual reasoning, sense of humor, and free-form text generation. FunQA consists of three subsets, covering tasks such as timestamp localization, video description, and counterintuitive reasoning, aiming to assess the model's understanding of counterintuitive videos. However, the performance of models on the FunQA dataset is generally suboptimal, facing challenges such as accurate information comprehension, logical reasoning, and application of additional knowledge. To promote research, the FunQA Challenge algorithm competition has been launched, with prizes totaling up to $1 million.