OpenAI Launches Multimodal Model GPT-Vision to Compete with Google's Gemini
站长之家
41
OpenAI is gearing up to launch a multimodal model named GPT-Vision, competing with Google's Gemini. GPT-Vision will enable GPT-4 to have broader image application capabilities, generating text related to image content. Additionally, OpenAI is also developing a multimodal AI model named Gobi, which could potentially be the next iteration, GPT-5. OpenAI plans to announce new features of GPT-4 at their developer conference on November 6th. The competition between OpenAI and Google will drive advancements in AI technology, ultimately benefiting consumers.
© Copyright AIbase Base 2024, Click to View Source - https://www.aibase.com/news/1491