Mini-Gemini
A multi-modal AI model with both image understanding and generation capabilities.
CommonProductProductivityAI ModelImage Processing
Developed by Professor Jia Jiayin's team at the Chinese University of Hong Kong, Mini-Gemini is a multi-modal model with precise image understanding capabilities and high-quality training data. Combining image reasoning and generation, it offers versions of different scales, with performance comparable to GPT-4 and DALLE3. Mini-Gemini utilizes Gemini's visual dual-branch information mining method and SDXL technology. It encodes images through convolutional networks and leverages the Attention mechanism to extract information, simultaneously connecting the two models by incorporating LLM for text generation.
Mini-Gemini Visit Over Time
Monthly Visits
494758773
Bounce Rate
37.69%
Page per Visit
5.7
Visit Duration
00:06:29