AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

ExllamaV2: A Inference Library for Running Local LLMs on Modern Consumer GPUs

站长之家

Published inAI News · 1 min read · Sep 15, 2023

236

ExllamaV2 is an inference library designed to efficiently run large-scale language models on consumer-grade GPUs. It supports the new tunable quantization format EXL2, achieving a performance improvement of 1.5 to 2 times. The project aims to be an easy-to-use LLM inference solution, compatible with HuggingFace models, and provides interactive examples, allowing seamless experience of the powerful capabilities brought by LLMs. Overall, ExllamaV2 offers a practical way to utilize home GPU resources for running large-scale language models.

exllamav2 GPU LLM

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

OpenAI Releases PaperBench, a Benchmark for Evaluating AI Agents

Apr 3, 2025

230

Alibaba's Qwen-2.5-Omni Tops Global Open-Source Model Leaderboard

Apr 2, 2025

2.8k

NVIDIA AI Researchers Introduce FFN Fusion Technology: Accelerating Large Language Model Inference

Mar 31, 2025

340

Breaking Free from Stiff AI: Midjourney and NYU Unlock New Dimensions in Creative Text Generation, Diversity Soars by 23%!

Researchers from Midjourney and New York University have collaborated on a novel approach to significantly enhance the diversity of creative text generated by language models while minimizing quality loss. Detailed in a recent research paper, this technique centers on incorporating a 'deviation metric' into the AI's training process. It works by quantifying the difference between each generated text and other texts created for the same prompt. Researchers utilize text embeddings and their pairwise cosine distances to calculate these differences, thereby providing the system with...

Mar 30, 2025

280

Google AI Releases TxGemma: A New Large Language Model for Drug Discovery

Mar 28, 2025

210

Basic Memory: Enabling Persistent Dialogue Knowledge for LLMs and Building a Powerful Local Knowledge Base

Mar 28, 2025

Amazon Launches Personalized Shopping Recommendations, Driving Generative AI Adoption

Mar 27, 2025

140

Tsinghua University Open-Sources Video-T1: AI Transforms Videos into High-Definition Masterpieces Without Retraining

Mar 26, 2025

300

Former Intel CEO Criticizes Nvidia's AI Chip Pricing, Sees Inference as Future Opportunity

Former Intel CEO Pat Gelsinger recently criticized Nvidia's pricing strategy for its AI GPUs on the Acquired podcast during Nvidia's 2025 GPU Technology Conference, arguing that the high cost makes them unsuitable for large-scale AI inference tasks. Gelsinger highlighted that inference is crucial for deploying AI models and that the industry should focus more on inference, a domain where Nvidia's technology falls short in terms of cost-effectiveness. Image source omitted.

Mar 25, 2025

100

Midjourney's New Research Boosts Creative Text Generation, Enhancing LLM Writing

Mar 25, 2025

200