RWKV Open Source Foundation Releases RWKV-6-World 14B Model

AIbase基地

Published inAI News · 4 min read · Jul 22, 2024

153

On July 19, 2024, the RWKV Open Source Foundation announced the global open-source release of the RWKV-6-World14B model, which is currently the most powerful dense pure RNN large language model. The model has performed exceptionally well in the latest performance tests, with English capabilities comparable to Llama213B and significantly leading in multilingual performance, supporting over 100 global languages and code.

The model's benchmark tests included four open-source large language models with nearly 14B parameters. English performance was evaluated through 12 independent benchmark tests, while multilingual capabilities were assessed using xLAMBDA, xStoryCloze, xWinograd, and xCopa benchmarks. RWKV-6-World14B excelled in these tests, particularly in the Uncheatable Eval ranking, where it surpassed both llama213B and Qwen1.514B in comprehensive evaluation scores.

WeChat Screenshot_20240722082902.png

The performance improvement of the RWKV-6-World14B model is attributed to the architectural enhancements from RWKV-4 to RWKV-6. The model was trained without incorporating any benchmark datasets, avoiding special optimizations, and thus its actual capabilities are stronger than the scoring rankings suggest. In the Uncheatable Eval evaluation, RWKV-6-World14B was assessed on real-time data such as the latest arXiv papers, news, ao3 novels, and GitHub code, demonstrating its true modeling and generalization capabilities.

Currently, the RWKV-6-World14B model can be downloaded and deployed locally through platforms such as Hugging Face, ModelScope, and WiseModel. Since Ai00 only supports models in safetensor (.st) format, models converted to .st format can also be downloaded from the Ai00HF repository. The GPU memory requirements for local deployment and inference of the RWKV-6-World14B model vary from approximately 10G to 28G depending on the quantization method.

The preview of the RWKV-6-World14B model's effects includes various applications such as natural language processing (sentiment analysis, machine reading comprehension), prose poetry literary creation, reading and modifying code, financial thesis topic suggestions, extracting key news content, expanding text with a single sentence, and writing a Python snake game, among others.

It is important to note that all open-source RWKV models are base models with a certain level of instruction and dialogue capabilities but are not optimized for specific tasks. If you want the RWKV model to perform well on a specific task, it is recommended to fine-tune the training with relevant task datasets.

Project Links:

Hugging Face: https://huggingface.co/BlinkDL/rwkv-6-world/tree/main
ModelScope: https://modelscope.cn/models/RWKV/rwkv-6-world/files
WiseModel: https://wisemodel.cn/models/rwkv4fun/Rwkv-6-world/file

Mistral Seeks $1 Billion in Funding to Target the Throne of AI in Europe!

French AI company Mistral is seeking $1 billion in equity financing, with a valuation of $6.51 billion. The company is known for its open-source large language model and chatbot Le Chat, and has raised a total of $1.19 billion in funding so far. This round of financing will be used for research and development and market expansion. Additionally, it will collaborate with MGX Fund and NVIDIA to build the largest AI data center park in Europe, supporting France's AI sovereignty initiative. Mistral's development will enhance Europe's position in the global AI competition.

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

RWKV Open Source Foundation Releases RWKV-6-World 14B Model

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Daily: Alibaba Tongyi Opens Source Audio Generation Model ThinkSound; Google Veo3 Generates Images into Videos; Feishu Announces Several New AI Products

Kunlun Wildfire Launches Skywork-R1V 3.0: Cross-modal Reasoning Capabilities Approaching Those of Human Experts!

Hong Kong's First AI Q&A System Launches, Taking You to Explore the Intelligent Era

Mistral Seeks $1 Billion in Funding to Target the Throne of AI in Europe!

Lark Launches Multiple AI New Products to Help Enterprises Build a Smart Office Ecosystem!

Hugging Face Launches SmolLM3: A 3B-Parameter Small Model Competes with 4B Giants, 128K Context Leads a New Trend in Efficient AI!

Vidu Q1 Shock Upgrade: Reference to Video Supports Up to Seven Images, AI Video Generation Sets New Records

Feishu Launches Multiple AI Products and Builds an Enterprise-Level Doubao

Google Veo3 Makes a Major Upgrade, Supporting the Generation of Animated Videos from Static Images

Apple is developing an AI customer service assistant similar to ChatGPT to enhance user support experience