ByteDance officially launches its latest Doubao large model 1.5 Pro (Doubao-1.5-pro), which demonstrates outstanding comprehensive capabilities in various fields, successfully surpassing the well-known GPT-4o and Claude3.5Sonnet in the industry. The release of this model marks an important step forward for ByteDance in the field of artificial intelligence. Doubao 1.5 Pro adopts a novel sparse MoE (Mixture of Experts) architecture, utilizing a smaller set of activation parameters for pre-training. This design's innovation...
In today's digital world, the use of short text has become central to online communication. However, these texts often lack common vocabulary or context, posing numerous challenges for Artificial Intelligence (AI) during analysis. In response, Justin Miller, an English Literature graduate student and data scientist from the University of Sydney, proposed a novel approach that utilizes Large Language Models (LLMs) to gain deeper understanding and analysis of short texts. Miller's research focuses on how to analyze a vast array of short texts, such as social media profiles,
Google Research recently announced the innovative 'Titans' series model architecture, achieving a groundbreaking 2 million token context length through bionic design, with plans to open-source related technologies in the future. The core innovation of this architecture is the introduction of a Deep Neural Long-Term Memory Module, inspired by the human memory system. Titans cleverly combines the rapid response capability of short-term memory with the persistence characteristics of long-term memory, while utilizing an attention mechanism to handle immediate context, forming an efficient information processing system.
On January 20, 2025, Doubao App officially released its latest 'end-to-end' voice large model, with significant updates to its real-time voice call functionality. This development marks another leap for Doubao in the field of voice interaction, surpassing the previous ASR (Automatic Speech Recognition), LLM (Large Language Model), and TTS (Text-to-Speech) cascading solutions by integrating voice recognition, understanding, and generation into a single model. After testing by 'Intelligent Emergence,' the standout feature of the new Doubao version is its human-like capabilities.