2025-03-07 16:19:22.AIbase.
No Training Needed! Q-Filters Enable Efficient Compression of KV Cache and Improved Inference Performance
2025-03-06 10:52:45.AIbase.
IBM Launches Compact AI Model Granite 3.2, Emphasizing Efficient Inference and Practicality
2025-03-06 10:04:01.AIbase.
No Need for High-End Hardware! Alibaba Open-Sources the New Inference Model Tongyi Qianwen QwQ-32B; Consumer-Grade GPUs Achieve S-Tier Performance!
2025-03-06 09:17:43.AIbase.
Alibaba Open-Sources New Inference Large Model QwQ-32B, Rivaling DeepSeek-R1 with Lower VRAM Requirements
2025-03-03 21:42:17.AIbase.
Qwen2.5-Max Inference Model Launched by Tongyi Lingma
2025-03-03 09:45:19.AIbase.
DeepSeek Open Source Week Day Six: Extreme Inference Optimization System for Enhanced GPU Computing Efficiency
2025-03-02 10:26:31.AIbase.
DeepSeek Unveiled: The Astonishing 545% Profit Margin Behind its AI Inference System
2025-02-28 11:08:32.AIbase.
ByteDance Launches AIBrix: A New Open-Source Inference System Designed for Large Language Models
2025-02-28 09:03:39.AIbase.
Free to Use! ByteDance's AI Programming Software Trae Integrates Claude 3.7; Developers Rejoice!
2025-02-26 09:33:45.AIbase.
DeepSeek Open Source Week Day 3: Announcing DeepGEMM, an FP8 GEMM Library for AI Training and Inference
2025-02-25 08:26:15.AIbase.
AI Programming Tool Cursor Integrates Claude 3.7 Sonnet Reasoning Model
2025-02-25 08:16:56.AIbase.
Anthropic Unveils Claude 3.7 Sonnet: A Hybrid Reasoning Model Surpassing DeepSeek
2025-02-18 20:33:46.AIbase.
DeepSeek Launches NSA Technology: Accelerating Long Context Training and Inference
2025-02-12 14:04:43.AIbase.
ByteDance's UltraMem Architecture Reduces Large Model Inference Costs by 83%
2025-02-10 14:16:32.AIbase.
DeepSeek Full Series Launches on iFlytek Open Platform with Limited Time Free Inference API
2025-02-08 09:38:42.AIbase.
In Response to DeepSeek Challenge, OpenAI Reveals o3-mini Public Inference Process
2025-02-06 10:57:40.AIbase.
The DeepSeek-R1 Model Faces Severe Hallucination Issues, Challenging Its Inference Ability and Accuracy
2025-01-24 10:48:09.AIbase.
Sakana AI's Transformer² Model Breaks LLM Limitations, Achieving Dynamic Inference
2025-01-24 10:04:42.AIbase.
Pipeshift Launches Modular Inference Engine, Reducing AI Inference GPU Utilization by 75%
2025-01-22 14:28:54.AIbase.