IBM Enhances watsonx.ai: DeepSeek-R1 Distilled Version of Llama Model Launched

AIbase基地

Published inAI News · 2 min read · Feb 11, 2025

209

IBM recently announced that its AI development platform watsonx.ai now supports the distilled versions of the Llama3.18B and Llama3.370B models, known as DeepSeek-R1. DeepSeek optimizes various Llama and Qwen variants using data generated by the R1 model through knowledge distillation technology, further enhancing model performance.

On the watsonx.ai platform, users can utilize the DeepSeek distilled models in two ways. First, IBM offers the Llama distilled version in the "On-Demand Deployment" catalog, allowing users to deploy dedicated instances to ensure secure inference. Secondly, users can also import other variants of DeepSeek-R1, such as the Qwen distilled model, through the "Custom Base Model" upload feature to meet diverse application needs.

DeepSeek

DeepSeek-R1 possesses powerful inference capabilities, making it suitable for a wide range of fields and providing efficient and flexible AI solutions for enterprises and developers. This update further enriches the model ecosystem of watsonx.ai, enabling users to develop and deploy AI applications more conveniently.

watsonx.ai DeepSeek-R1 Llama Qwen

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Microsoft's New Open-Source Model MAI-DS-R1: Improved Sensitive Topic Response and Reduced Safety Risks

Apr 18, 2025

Kimina-Prover: An Open-Source Mathematical Theorem Proving Model

The Kimi team recently released a technical report and open-sourced the preview version of Kimina-Prover, including 1.5B and 7B parameter distilled models, the Kimina-Autoformalizer-7B model for data generation, and a revised miniF2F benchmark dataset. Kimina-Prover, jointly developed by the Numina and Kimi teams, is a mathematical theorem proving model that excels in the field of formal theorem proving.

Apr 17, 2025

180

Meta's Llama-4-Maverick Plummets in Rankings, Raising Concerns of Benchmark Manipulation

Meta's open-source large language model, Llama-4-Maverick, has experienced a dramatic drop in LMArena rankings, plummeting from second place to 32nd. This significant shift has sparked widespread skepticism among developers, who suspect Meta may have manipulated the benchmark by submitting a specially optimized version. The issue stems from Meta's April 6th release of its latest large language model, Llama 4, encompassing three versions: Scout, Maverick, and Behemoth.

Apr 14, 2025

610

Stanford Report Confirms: Alibaba's Qwen Ranks Third Globally in Large Model Contribution, Reshaping Global Competition with Computing Power!

Stanford University's AI Index Report 2025 offers a fresh perspective on the global AI landscape. The report highlights Alibaba's significant contribution, ranking third globally among major large language models, establishing it as a leading Chinese tech company. In 2024, China contributed 15 models globally, with Alibaba contributing 6, trailing only Google and OpenAI with 7 models each. This achievement reflects Alibaba's ongoing commitment to technological innovation.

Apr 12, 2025

990

AI Code Model Open-Source Boom: Cogito v1 Preview Unveiled, 70B Parameter Model Outperforms Llama 4

Recently, the AI code generation field has seen a surge in open-source releases, with several heavyweight models making their debut. Among them, the Cogito v1 Preview series from Deep Cogito stands out. According to AIbase, this new family of open-source models includes various sizes: 3B, 8B, 14B, 32B, and 70B parameters. Not only does it outperform competitors in its class, but its 70B version even surpasses Meta's recently released Llama 4 109B MoE model, sparking considerable industry discussion.

Apr 10, 2025

820

Llama 4 Arrives on Vertex AI: Deploy Meta's New Model with One Click

Google Cloud Platform recently announced that Meta's latest generation of open-source large language models, Llama 4, is now available in its Vertex AI Model Garden. The news has generated significant excitement in the global tech community. The Scout and Maverick models from the Llama 4 series are now integrated into Vertex AI and available to developers via fully managed Model-as-a-Service (MaaS) API endpoints in preview.

Apr 10, 2025

260

From Text to Complex Characters: The OmniSVG, the Most Powerful SVG Generation Model, Has Arrived!

On April 9th, 2025, a powerful SVG (Scalable Vector Graphics) generation model named OmniSVG was officially unveiled, marking a new stage in vector graphic generation technology. Jointly developed by StepFun and Fudan University, this model is hailed as the most advanced SVG generation model currently available. Its outstanding multi-modal generation capabilities and efficient performance have attracted widespread attention. OmniSVG's technological breakthrough is based on a pre-trained Vision-Language Model (VLM)...

Apr 10, 2025

270

NVIDIA Unveils Llama 3.1 Nemotron Ultra 253B, Outperforming Llama 4 Behemoth

Apr 9, 2025

520

NVIDIA Unveils Llama 3.1 Nemotron Ultra 253B: Redefining AI Performance Standards

NVIDIA, a global leader in chip and AI technology, recently launched a groundbreaking new open-source large language model, Llama 3.1 Nemotron Ultra 253B, generating significant excitement within the AI community. Built upon Meta's Llama-3.1-405B, this model boasts innovative optimizations that surpass competitors like Llama 4 Behemoth and Maverick in performance, while demonstrating superior resource efficiency and exceptional multi-tasking capabilities.

Apr 9, 2025

310

NVIDIA Unveils Llama 3.1 Nemotron Ultra 253B: A New Benchmark in Performance

On April 8th, 2025, NVIDIA launched Llama 3.1 Nemotron Ultra 253B, an open-source model optimized from Llama-3.1-405B. With 25.3 billion parameters, it surpasses Meta's Llama 4 Behemoth and Maverick, becoming a focal point in the AI field. This model demonstrates superior performance in benchmarks such as GPQA-Diamond, AIME 2024/25, and LiveCodeBench, achieving inference throughput comparable to DeepSeek.

Apr 9, 2025

430

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

IBM Enhances watsonx.ai: DeepSeek-R1 Distilled Version of Llama Model Launched

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Microsoft's New Open-Source Model MAI-DS-R1: Improved Sensitive Topic Response and Reduced Safety Risks

Kimina-Prover: An Open-Source Mathematical Theorem Proving Model

Meta's Llama-4-Maverick Plummets in Rankings, Raising Concerns of Benchmark Manipulation

Stanford Report Confirms: Alibaba's Qwen Ranks Third Globally in Large Model Contribution, Reshaping Global Competition with Computing Power!

AI Code Model Open-Source Boom: Cogito v1 Preview Unveiled, 70B Parameter Model Outperforms Llama 4

Llama 4 Arrives on Vertex AI: Deploy Meta's New Model with One Click

From Text to Complex Characters: The OmniSVG, the Most Powerful SVG Generation Model, Has Arrived!

NVIDIA Unveils Llama 3.1 Nemotron Ultra 253B, Outperforming Llama 4 Behemoth

NVIDIA Unveils Llama 3.1 Nemotron Ultra 253B: Redefining AI Performance Standards

NVIDIA Unveils Llama 3.1 Nemotron Ultra 253B: A New Benchmark in Performance