Information

Latest AI News

Explore AI Frontiers, Master Industry Trends

AI Daily Brief

Your Daily AI Brief - Never Miss What's Next

Information

AI Product Finder

Smart Product Discovery - Comprehensive Market Intelligence

AI Product Rankings

AI Product Power Rankings - Performance, Buzz & Trends

AI Product Submit

Submit Your AI Product - Amplify Reach & Drive Growth

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Information

AI Models Finder

Comprehensive AI Models Collection for All Your Development & Research Needs

LLM Leaderboard

AI LLM Power Rankings - Performance, Buzz & Trends

Model Providers

Discover Trusted AI Model Partners - Guaranteed Reliable Support

Submit Your Model

Submit Your Model Info & Services - Precision Marketing & User Targeting

Tools

Compare LLMs

Multi-Dimensional Large Model Comparison - Find Your Perfect Match

LLM Cost Calculator

Calculate AI Model Costs Accurately - Optimize Your Budget

LLM Arena

Multi-Model Real-Time Evaluation & Quick Output Comparison

Information

MCP Servers

Discover Popular AI-MCP Services - Find Your Perfect Match Instantly

MCP Client

Easy MCP Client Integration - Access Powerful AI Capabilities

MCP Case Tutorials

Master MCP Usage - From Beginner to Expert

MCP Ranking

Top MCP Service Performance Rankings - Find Your Best Choice

MCP Service Submission

Publish & Promote Your MCP Services

Tools

MCP Playground

Test MCP Services Freely - Quick Online Experience

MCP Inspector

Quick MCP Service Testing - Fast Deployment

AI Brand Monitoring Tool

Analyze & Track How AI Models Cite Your Brand

GEO Services

Achieve Dominant Visibility in AI Search for Your Business or Brand with GEO Services

AI Search Visibility Checker

Detect brand's visibility on AI platforms

Tools

AI Model Compatibility Checker

Free PC Hardware Test for DeepSeek & Llama

AI Deployment Calculator

Enter Your Large Model Computing Requirements for Instant GPU, Memory & Server Configuration Recommendations

AI Tutorial

Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

Zhipu AI Open Source End-Side Large Language and Multimodal Model GLM-Edge Series

AIbase基地

Published inAI News · 3 min read · Nov 30, 2024

388

Recently, Zhipu Technology announced the open-source release of its edge-side large language and multimodal model series, GLM-Edge. This initiative marks an important attempt by the company to implement real-world use cases on edge devices. The GLM-Edge series consists of four models of different sizes: GLM-Edge-1.5B-Chat, GLM-Edge-4B-Chat, GLM-Edge-V-2B, and GLM-Edge-V-5B, which are optimized for mobile platforms such as smartphones and car systems, as well as desktop platforms like PCs.

Zhipu AI

Building on the technological foundation of the GLM-4 series, Zhipu's research team has adjusted the model structure and size to achieve the best balance between model performance, real-time inference effects, and ease of deployment. Through in-depth collaboration with partners and optimization of inference, the GLM-Edge series models have demonstrated exceptional operating speeds on certain edge platforms. Notably, on the Qualcomm Snapdragon 8 Elite platform, leveraging NPU computing power and a mixed quantization scheme, the 1.5B chat model and the 2B multimodal model can achieve decoding speeds of over 60 tokens per second. With the application of speculative sampling techniques, the decoding speed can exceed 100 tokens per second.

The open-source GLM-Edge series models not only showcase the company's technological prowess in the field of artificial intelligence but also provide developers and researchers with powerful tools and resources to promote the development and innovation of edge AI applications.

GLM-Edge Collection:

https://modelscope.cn/collections/GLM-Edge-ff0306563d2844

GLM-Edge SmartTechnology HighThroughput LargeLanguageModel

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Google Gemini 3 Pro Model Launches on AI Studio, Developers Can Flexibly Adjust Parameters

Google releases Gemini 3 LLM, with Pro Preview available on AI Studio for developers, researchers, and students to build applications with customizable parameters.....

Nov 19, 2025

150

Yang Likun Criticizes LLMs: Meta AI's Strategy is on the Wrong Track

Yann LeCun criticizes big tech's focus on large language models as a strategic error, arguing they lack true intelligence, world understanding, and reasoning. He advocates for 'world models' as the next breakthrough.....

Nov 18, 2025

140

LMArena Releases Latest Large Model Rankings: Claude, GPT-5, and Zhipu GLM-4.6 Tie for First

The latest AI programming model rankings from LMArena show that Claude from Anthropic, GPT-5 from OpenAI, and Zhipu GLM-4.6 are tied for first place globally. These models, designed specifically for programming, can significantly improve the efficiency of code writing, debugging, and optimization, driving advancements in software development.

Nov 13, 2025

270

Microsoft Leverages OpenAI Chip Technology for a Competitive Edge! A Major Step in Developing Its Own AI Chips, Satya Nadella Reveals Collaboration Details

Microsoft integrates OpenAI's AI chip tech to speed up its own AI chip development, aiming to reduce Nvidia reliance and build a full-stack AI infrastructure.....

Nov 13, 2025

240

OpenAI Launches GPT-5.1: A Faster, More Accurate, and More Human-Like Personal AI Assistant

OpenAI launches GPT-5.1, upgrading ChatGPT to a more responsive, personalized digital assistant with adaptive dialogue styles for warmer, precise interactions.....

Nov 13, 2025

210

One-Sentence Booking, Photo Taking, and Ordering Takeout! ZTE Nebula-GUI Tops the List of Domestic Smartphone AI Assistants with Over 90% Offline Accuracy

When ZTE integrates the Nebula-GUI small model into the smartphone system, the努比亚 flagship phone becomes a portable personal assistant - no need to open an app, just by voice commands, it can automatically complete the entire process of booking tickets, taking photos, etc. The smartphone is moving from marketing slogans toward a true intelligent experience that understands human intentions.

Nov 5, 2025

260

Apple Siri Will Undergo a Major Transformation! Paying Google to Help with AI Upgrades

Apple faced obstacles in developing its own Siri large model, so it turned to a collaboration with Google, adopting a customized Gemini language model to enhance AI capabilities. The new strategy will adopt an 'edge-cloud collaboration' hybrid model, combining the advantages of cloud-based large models with local data privacy protection, aiming to optimize user experience and address shortcomings in handling complex tasks.

Nov 4, 2025

180

Generate an AI Agent in One Sentence! Pokee AI's No-Code Automation Revolution Has OpenAI and n8n on Edge?

AI Agent tools are becoming more accessible. Pokee AI simplifies workflow creation with natural language commands, eliminating coding needs for automated task execution.....

Nov 3, 2025

740

World's First Embodied Intelligence Open Platform Launches! 3D Digital Humans Now Ready to Use Out of the Box: Mofa Xingyun Integrates Large Models into Hundreds of Yuan Chips

Mofa Tech launches 'Mofa Nebula', the first 3D digital human platform, enabling AI to generate real-time expressions, gestures, and movements from text via its 3D multimodal engine, compatible with mobile and automotive devices.....

Oct 31, 2025

220

Meta Researchers Uncover the Black Box of Large Language Models and Fix AI Reasoning Flaws

Meta and Edinburgh University develop CRV technology to analyze LLM reasoning circuits, predict correctness, and fix errors, enhancing AI reliability via activation computation graphs.....

Oct 31, 2025

250

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Zhipu AI Open Source End-Side Large Language and Multimodal Model GLM-Edge Series

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Google Gemini 3 Pro Model Launches on AI Studio, Developers Can Flexibly Adjust Parameters

Yang Likun Criticizes LLMs: Meta AI's Strategy is on the Wrong Track

LMArena Releases Latest Large Model Rankings: Claude, GPT-5, and Zhipu GLM-4.6 Tie for First

Microsoft Leverages OpenAI Chip Technology for a Competitive Edge! A Major Step in Developing Its Own AI Chips, Satya Nadella Reveals Collaboration Details

OpenAI Launches GPT-5.1: A Faster, More Accurate, and More Human-Like Personal AI Assistant

One-Sentence Booking, Photo Taking, and Ordering Takeout! ZTE Nebula-GUI Tops the List of Domestic Smartphone AI Assistants with Over 90% Offline Accuracy

Apple Siri Will Undergo a Major Transformation! Paying Google to Help with AI Upgrades

Generate an AI Agent in One Sentence! Pokee AI's No-Code Automation Revolution Has OpenAI and n8n on Edge?

World's First Embodied Intelligence Open Platform Launches! 3D Digital Humans Now Ready to Use Out of the Box: Mofa Xingyun Integrates Large Models into Hundreds of Yuan Chips

Meta Researchers Uncover the Black Box of Large Language Models and Fix AI Reasoning Flaws

GEO Services