Rakuten Launches First Japanese Large Language Model Rakuten AI 2.0

AIbase基地

Published inAI News · 4 min read · Feb 13, 2025

258

Rakuten Group has announced the launch of its first Japanese large language model (LLM) and small language model (SLM), named Rakuten AI2.0 and Rakuten AI2.0mini.

The release of these two models aims to advance artificial intelligence (AI) development in Japan. Rakuten AI2.0 is based on a mixture of experts (MoE) architecture and is an 8x7B model, consisting of eight models, each with 7 billion parameters, acting as individual experts. When processing an input token, the system sends it to the two most relevant experts, selected by a router. These experts and the router are continuously jointly trained on a large volume of high-quality Japanese-English bilingual data.

Rakuten AI2.0mini is a new dense model with 1.5 billion parameters, designed for cost-effective deployment on edge devices suitable for specific application scenarios. It is also trained on mixed Japanese-English data, aiming to provide convenient solutions. Both models have undergone instruction fine-tuning and preference optimization, with a base model and an instruction model released to support businesses and professionals in developing AI applications.

All models are licensed under the Apache 2.0 license, and users can access them in the official Rakuten Group Hugging Face repository. Commercial uses include text generation, content summarization, question answering, text understanding, and dialogue system construction. Additionally, these models can serve as a foundation for further development and applications.

Cai Ting, Chief AI and Data Officer of Rakuten Group, stated: “I am incredibly proud of how our team has combined data, engineering, and science to launch Rakuten AI2.0. Our new AI models offer powerful and cost-effective solutions that help businesses make smart decisions, accelerate value realization, and unlock new possibilities. By open-sourcing these models, we hope to accelerate AI development in Japan and encourage all Japanese companies to build, experiment, and grow, fostering a collaborative and win-win community.”

Official Blog: https://global.rakuten.com/corp/news/press/2025/0212_02.html

Key Points:
🌟 Rakuten Group launches its first Japanese large language model (LLM) and small language model (SLM), named Rakuten AI2.0 and Rakuten AI2.0mini.
📊 Rakuten AI2.0 is based on a mixture of experts architecture, featuring eight expert models with 7 billion parameters each, dedicated to efficiently processing Japanese-English bilingual data.
🛠️ All models are available in the Rakuten Hugging Face official repository for various text generation tasks and can serve as a foundation for other models.

HappyGroup RakutenAI2.0 AI LargeLanguageModel

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

ChatGPT Updates Image Generation Capabilities, Now Including Cursive Script

ChatGPT's recent image generation update has driven a significant surge in paying users, with a 20-million increase reported. The creative applications showcased demonstrate impressive advancements in ChatGPT4.0's capabilities, even addressing previously challenging aspects like Chinese character generation. Now, ChatGPT has further enhanced its 'Creat image' function, moving beyond standard fonts to generate accurate cursive script.

Apr 3, 2025

Meta's High-End Smart Glasses, Hypernova, Leaked: Features Built-in Display, Potentially $1400 Price Tag

Bloomberg reports that Meta is preparing to launch a high-end version of its Ray-Ban Meta smart glasses with a built-in display, potentially as early as the end of this year. Codenamed Hypernova, these glasses will support running apps and displaying photos, controlled via gestures and capacitive touch on the sides of the frames. The screen, according to the report, will only appear in the bottom-right quadrant of the right lens, optimized for viewing when looking downward. Upon startup, the main screen displays icons horizontally, similar to Meta's...

Apr 3, 2025

Generate Ghibli-Style Images Without ChatGPT: 5 AI Image Generation Platforms Recommended

This article unveils 5 of the hottest AI image generators. These tools not only understand your creative needs but also visualize them with incredible precision. Whether you're a professional designer seeking inspiration or a casual user wanting to explore creativity, these tools will become your magic paintbrushes. From Ghibli style transformations to intelligent photo editing, from Chinese-style art creation to multi-modal generation, let's explore how AI makes art creation as easy as sending a text message!

Apr 3, 2025

Jimeng 3.0 Internal Testing: Direct Output of 2K Commercial Posters, Enhanced Image Quality and More Precise Design Layout

Designers woke up to a collapsing world. Jimeng quietly launched the internal testing of its 3.0 model. The new model has made significant breakthroughs in image quality, generating images with rich details and superior quality from simple text prompts. Jimeng 3.0's core advantage lies in its precise control over complex scenes and details. By inputting short prompts, the model can generate visually stunning images in a short time, such as realistic natural landscapes or exquisite portraits. Compared to previous versions, Jimeng 3.0 shows significant improvements in scene layout, color matching, and detail rendering.

Apr 3, 2025

Tinder Launches AI-Powered Flirting Game 'Game Game' in Partnership with OpenAI, Sparking Controversy

Tinder recently announced a partnership with OpenAI to launch an AI-powered flirting game called 'Game Game'. Utilizing OpenAI's voice models and GPT-4 reasoning model, the game encourages users to role-play in various hypothetical encounter scenarios and earn points based on their flirting skills. The company emphasizes that voice data collected in the game will not be used to train any new AI models. This follows the recent appointment of a former Zillow executive as CEO of Tinder's parent company, Match Group.

Apr 3, 2025

OpenAI Releases PaperBench, a Benchmark for Evaluating AI Agents

Apr 3, 2025

OpenAI Establishes New Committee to Build the Most Powerful Non-profit

As an established non-profit, OpenAI is committed to building the world's best-equipped non-profit organization, aiming to enhance human creativity through historic financial resources and powerful technology. Imagine a model where a charity's investment capacity grows as the value of its affiliated companies increases. In OpenAI's vision, philanthropy is not merely the flow of money, but a fundamental form of support. Leveraging technology developed by leading AI companies, non-profit organizations will be able to...

Apr 3, 2025

Google Gemini App Head Sissie Hsiao to Depart, Lab VP to Take Over

According to Semafor, Sissie Hsiao, the executive in charge of Google's AI chatbot, is stepping down as head of Gemini applications. An internal memo obtained by the outlet indicates that Josh Woodward, Google's vice president of labs, will assume her role. A Google spokesperson, Alex Joseph, confirmed the change but declined further comment. In the memo, Google DeepMind CEO Demis Hassabis stated that this change will allow...

Apr 3, 2025

Lumai Secures $10 Million in Funding to Revolutionize AI with 3D Optical Computing

Oxford-based startup Lumai, a key player in AI infrastructure, has announced securing over $10 million in funding. The round was led by deep tech investor Constructor Capital, with participation from IP Group, Ventures, Journey Ventures, LIFTT, Qubits Ventures, State Farm Ventures, and TIS.

Apr 3, 2025

Anthropic Launches Claude for Education: An AI Tutor to Foster Critical Thinking in Students

Anthropic today announced Claude for Education, an AI assistant designed for the education sector to enhance learning by fostering critical thinking skills, rather than simply providing answers. The product is already partnering with Northeastern University, the London School of Economics, and Champlain College to extensively test how AI can effectively augment, not shorten, the learning experience. A core innovation in Claude for Education is its learning mode, a feature fundamentally altering how students interact with AI.

Apr 3, 2025

AI News

AI Daily

AI Timeline

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Rakuten Launches First Japanese Large Language Model Rakuten AI 2.0

AIbase基地

This article is from AIbase Daily

AI News Recommendations

ChatGPT Updates Image Generation Capabilities, Now Including Cursive Script

Meta's High-End Smart Glasses, Hypernova, Leaked: Features Built-in Display, Potentially $1400 Price Tag

Generate Ghibli-Style Images Without ChatGPT: 5 AI Image Generation Platforms Recommended

Jimeng 3.0 Internal Testing: Direct Output of 2K Commercial Posters, Enhanced Image Quality and More Precise Design Layout

Tinder Launches AI-Powered Flirting Game 'Game Game' in Partnership with OpenAI, Sparking Controversy

OpenAI Releases PaperBench, a Benchmark for Evaluating AI Agents

OpenAI Establishes New Committee to Build the Most Powerful Non-profit

Google Gemini App Head Sissie Hsiao to Depart, Lab VP to Take Over

Lumai Secures $10 Million in Funding to Revolutionize AI with 3D Optical Computing

Anthropic Launches Claude for Education: An AI Tutor to Foster Critical Thinking in Students