AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

Princeton University Chen Danqi Team Proposes LLM-Shearing Large Model Pruning Method

机器之心

Published inAI News · 1 min read · Oct 12, 2023

Translated data: The LLM-Shearing pruning method developed by the team of Chen Danqi at Princeton University employs structured pruning and dynamic batch loading to reduce large language models into smaller, more efficient versions, significantly cutting down on computational resource requirements. The pruned models excel in various downstream tasks, showcasing high versatility and offering a new approach for constructing medium-scale powerful language models.

Large Model Pruning Algorithm LLM-Shearing

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Baidu Releases PaddlePaddle Framework 3.0 to Empower Intelligent Development in the Age of Large Models

Apr 3, 2025

170

PaddlePaddle 3.0 Officially Released: Supports Large Models Like Wenxin 4.5, Reduces Cross-Chip Adaptation Costs by 80%

Baidu's deep learning platform, PaddlePaddle, recently announced the official release of its new generation framework, PaddlePaddle 3.0. This release introduces five core technological innovations, including "dynamic and static unified automatic parallelism," aiming to effectively reduce the development and training costs of large models and support the infrastructure construction of the large model era. As the core infrastructure supporting large model training and inference tasks, PaddlePaddle 3.0 demonstrates excellent performance optimization. The framework already supports multiple mainstream large models, including Wenxin 4.5 and Wenxin X1, and through optimization...

Apr 2, 2025

300

National Astronomical Observatories of China and Alibaba Cloud Release World's First Solar Large Model, Jinwu: M5-Class Solar Flare Prediction Accuracy Exceeds 91%

Recently, the National Astronomical Observatories of China (NAOC) and Alibaba Cloud jointly announced the launch of Jinwu, the world's first solar large model, marking a significant step in the deep integration of solar physics research and artificial intelligence technology. The model, built on Alibaba Cloud's open-source Tongyi Qianwen framework, boasts a prediction accuracy exceeding 91% for M5-class solar flares, achieving the highest global level for this type of prediction. This achievement not only improves the accuracy of space weather forecasting but also provides new techniques for addressing the potential terrestrial impacts of solar activity.

Apr 1, 2025

360

Tuniu Launches AI Assistant Xiao Niu: Open-Source Large Model Empowers One-Stop Smart Travel Service

On April 1st afternoon, Tuniu Travel announced the official launch of its self-developed AI assistant, "Xiao Niu," a travel application agent available on both the Tuniu Travel app and the "Xiao Niu" mini-program. According to the announcement, "Xiao Niu" innovatively utilizes the open-source large models DeepSeek and Tongyi Qianwen, deeply integrating with vertical travel application scenarios to provide users with a more convenient and efficient travel experience. Through "Xiao Niu," users can easily search and book air tickets, hotels, and train tickets. Furthermore, this AI...

Apr 1, 2025

250

SF Express Same City: Partnerships with Doubao, Tencent HunYuan, and Others

SF Express Same City recently announced a comprehensive push towards digitalization and AI-driven decision-making across all operational aspects. The company aims to build a large model infrastructure tailored to the on-demand delivery industry for increased efficiency and improved service. Leveraging the DeepSeek open-source ecosystem and its multimodal AI capabilities, SF Express Same City can rapidly develop customized solutions. This allows for quick adaptation and adjustments to services and products based on specific client needs and market demands.

Apr 1, 2025

300

National Astronomical Observatory of China Unveils 'Jinwu', the World's First Solar Large Model Powered by Tongyi Qianwen

The National Astronomical Observatory of China (NAOC), in collaboration with Alibaba Cloud, has announced the successful development of 'Jinwu', the world's first solar large model. This groundbreaking achievement, built upon Alibaba Cloud's Tongyi Qianwen series of open-source models, marks a significant breakthrough in the application of artificial intelligence in astronomy.

Apr 1, 2025

380

Baidu's Wenxiao Speech Model Receives Comprehensive Upgrade with Multi-Model Fusion Scheduling and New Speech Large Model

Mar 31, 2025

340

Tencent HunYuan Large Model Application Practice Course Officially Launched on the National Smart Education Platform

Mar 31, 2025

320

iFLYTEK Medical Releases World's First Type 1 Diabetes-Specific Large Language Model, Claimed to Surpass GPT-4!

iFLYTEK Medical announced today the launch of the world's first Type 1 Diabetes-specific large language model, a significant achievement stemming from the core results of a national major project on four chronic diseases. This marks a crucial step in translating key research findings from the laboratory to clinical applications, representing a first for Anhui Province in translating national-level major research project results in chronic disease prevention and control. This project focuses on key pain points in the diagnosis and treatment of Type 1 Diabetes, integrating multimodal data and extensive clinical experience, and leveraging the powerful capabilities of the iFLYTEK Starfire Medical large language model X1.

Mar 30, 2025

400

Huawei's ModelEngine Achieves Certification from China Academy of Information and Communications Technology, Boosting AI Large Model Development

Mar 27, 2025

320