Title: Google Open-Sources Lightweight Language Model Gemina 2 to Enhance AI Performance, Speed, and Accessibility Subtitle: The new language model aims to make AI more efficient and accessible for a wide range of applications.

Google Open-Sources Lightweight Language Model Gemina 2 to Enhance AI Performance, Speed, and Accessibility Subtitle: The new language model aims to make AI more efficient and accessible for a wide range of applications.

AIbase

Published inAI News · 5 min read · Jul 5, 2024

105

Google has launched Gemma2, the latest version of its open-source lightweight language model, offering parameter sizes of 9 billion (9B) and 27 billion (27B). Compared to its predecessor, the Gemma model, this new version promises enhanced performance and faster inference speed.

Product Access: https://top.aibase.com/tool/google-gemma-2

Gemma2 originates from Google's Gemini model and aims to make it easier for researchers and developers to access, significantly improving speed and efficiency. Unlike the multilingual and multimodal Gemini model, Gemma2 focuses solely on language processing.

Gemma2 not only outperforms Gemma1 in terms of performance but also competes effectively with models twice its size. It is designed to run efficiently across various hardware setups, including laptops, desktops, IoT devices, and mobile platforms. Gemma2 is specifically optimized for single GPUs and TPUs, improving the efficiency of its predecessor, especially on resource-constrained devices. For example, the 27B model excels in running inference on a single NVIDIA H100 Tensor Core GPU or TPU, making it an economical and efficient choice for developers who require high performance without substantial hardware investment.

In addition, Gemma2 offers enhanced tuning features for developers across various platforms and tools. Whether using cloud-based solutions like Google Cloud or popular platforms like Axolotl, Gemma2 provides a wide range of fine-tuning options. Integrations with platforms such as Hugging Face, NVIDIA TensorRT-LLM, and Google's JAX and Keras enable researchers and developers to achieve optimal performance and efficient deployment on various hardware configurations.

When comparing Gemma2 with Llama370B, both models stand out in the open-source language model category. Google researchers claim that despite being much smaller in size, the 27B version of Gemma2 performs comparably to Llama370B. Additionally, the 9B version of Gemma2 consistently outperforms Llama38B in various benchmarks, including language understanding, coding, and solving mathematical problems.

One significant advantage of Gemma2 over Meta's Llama3 is its handling of Indian languages. Gemma2 excels due to its tokenizer, which is designed specifically for these languages and contains a vast amount of 256k tokens to capture the nuances of the language. On the other hand, although Llama3 supports multiple languages, it faces difficulties in tokenization of Indian language scripts due to limited vocabulary and training data. This gives Gemma2 an advantage in tasks involving Indian languages, making it a better choice for developers and researchers working in these fields.

Actual use cases for Gemini2 include multilingual assistants, educational tools, coding assistance, and RAG systems. Despite showing significant progress, Gemini2 still faces challenges in training data quality, multilingual capabilities, and accuracy.

Highlight:

🌟 Gemini2 is Google's latest open-source language model, offering faster and more efficient language processing tools.

🌟 The model is based on the decoder-encoder architecture, pre-trained using knowledge distillation methods and further fine-tuned through instruction tuning.

🌟 Gemini2 has an advantage in handling Indian languages, suitable for practical applications such as multilingual assistants, educational tools, coding assistance, and RAG systems.

UK Government Proposes AI Plan to Save £4.5 Billion, but Experts Question Its Feasibility

The UK government plans to use AI to save £4.5 billion in the public sector, but parliamentary experts have questioned the validity of this figure, which is based on rough assumptions. Since government funding is mainly used for wages and infrastructure, how to achieve such significant savings has become a focal point.

Report says Google is about to release VEO 3.1 version on Gemini and API

Google is about to publicly release the VEO 3.1 AI model. The Gemini application has already shown related disclaimers, indicating that new features will be integrated into the user-friendly interface. Community member Logan Kilpatrick confirmed the release information in advance on social media. At the same time, the Vertex AI platform has referenced the VEO 3.0 preview model, showing that Google is accelerating the deployment of AI products.

AI Daily: Alibaba Launches Compact Qwen3-VL Model; iFlytek AI Translation Earbuds Launch Globally; Gemini Code Appears in Veo3.1

Alibaba launches the compact Qwen3-VL series of visual language models, including 400 million and 800 million parameter versions, aiming to promote the application of multimodal AI technology on edge devices. The model helps enhance AI processing capabilities on devices and promotes the popularization of the technology.

Airtel and IBM Partner to Advance Innovation in Cloud Computing and AI Technologies

Airtel has formed a strategic partnership with IBM to enhance Airtel Cloud services. Combining Airtel's high reliability and data residency advantages in the telecommunications sector with IBM's specialized expertise in cloud infrastructure and AI inference software, the two companies will help regulatory industry enterprises efficiently scale AI workloads, ensuring interoperability across on-premises, cloud, and edge infrastructures.

Alibaba Tongyi Qianwen Launches Qwen3-VL Lightweight Model: 4B and 8B Parameter Versions Performance Approaches Previous 72B Flagship

The Alibaba Tongyi Qianwen team has launched two lightweight models in the Qwen3-VL series, with parameter scales of 4B and 8B. This series is the strongest family of vision-language models to date, adding small-parameter versions to lower deployment barriers while maintaining strong performance. Each scale offers two versions: instruction following and chain-of-thought reasoning, providing developers with more flexible options.

Google Meet Launches AI Makeup Feature to Help Users Feel More Confident Before Meetings

Google Meet has introduced an AI makeup filter feature, offering 12 virtual makeup options to help users enhance their appearance during video conferences. This move aims to compete with similar features already launched by competitors such as Microsoft Teams and Zoom, enhancing market competitiveness. Users can find this feature under the "Appearance" settings in "Portrait Editing." The feature has been available since 2023 and continues to optimize virtual beauty effects.

Turing Award Winner Hinton: AI May Already Have Subjective Experiences, but Human Understanding of Consciousness Is Limited

AI pioneer Hinton presents a controversial view in an interview: current AI systems may already possess some form of subjective experience, but have not yet developed self-awareness. He emphasizes that the key lies in human misunderstanding of the nature of consciousness, rather than whether AI has awareness. At the same time, he reviews the development of AI from simple keyword matching to the present.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Google Open-Sources Lightweight Language Model Gemina 2 to Enhance AI Performance, Speed, and Accessibility Subtitle: The new language model aims to make AI more efficient and accessible for a wide range of applications.

AIbase

This article is from AIbase Daily

AI News Recommendations

UK Government Proposes AI Plan to Save £4.5 Billion, but Experts Question Its Feasibility

Report says Google is about to release VEO 3.1 version on Gemini and API

AI Daily: Alibaba Launches Compact Qwen3-VL Model; iFlytek AI Translation Earbuds Launch Globally; Gemini Code Appears in Veo3.1

Airtel and IBM Partner to Advance Innovation in Cloud Computing and AI Technologies

Alibaba Tongyi Qianwen Launches Qwen3-VL Lightweight Model: 4B and 8B Parameter Versions Performance Approaches Previous 72B Flagship

Alibaba Launches Compact Qwen3-VL Model to Enhance Multimodal AI Efficiency and Accelerate Edge Device Deployment

Google Meet Launches AI Makeup Feature to Help Users Feel More Confident Before Meetings

Turing Award Winner Hinton: AI May Already Have Subjective Experiences, but Human Understanding of Consciousness Is Limited

Apple Takes New Action! Will Produce AI Home Devices and Mobile Desktop Robots in Vietnam

University of Pennsylvania Study Finds: The Ruder the AI is, the Higher the Accuracy Rate

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Google Open-Sources Lightweight Language Model Gemina 2 to Enhance AI Performance, Speed, and Accessibility Subtitle: The new language model aims to make AI more efficient and accessible for a wide range of applications.

AIbase

This article is from AIbase Daily

AI News Recommendations

UK Government Proposes AI Plan to Save £4.5 Billion, but Experts Question Its Feasibility

Report says Google is about to release VEO 3.1 version on Gemini and API

AI Daily: Alibaba Launches Compact Qwen3-VL Model; iFlytek AI Translation Earbuds Launch Globally; Gemini Code Appears in Veo3.1

Airtel and IBM Partner to Advance Innovation in Cloud Computing and AI Technologies

Alibaba Tongyi Qianwen Launches Qwen3-VL Lightweight Model: 4B and 8B Parameter Versions Performance Approaches Previous 72B Flagship

Alibaba Launches Compact Qwen3-VL Model to Enhance Multimodal AI Efficiency and Accelerate Edge Device Deployment

Google Meet Launches AI Makeup Feature to Help Users Feel More Confident Before Meetings

Turing Award Winner Hinton: AI May Already Have Subjective Experiences, but Human Understanding of Consciousness Is Limited

Apple Takes New Action! Will Produce AI Home Devices and Mobile Desktop Robots in Vietnam

University of Pennsylvania Study Finds: The Ruder the AI is, the Higher the Accuracy Rate

GEO Services