en
每月不到10元,就可以无限制地访问最好的AIbase。立即成为会员
Home
News
Daily Brief
Income Guide
Tutorial
Tools Directory
Product Library
en
Search AI Products and News
Explore worldwide AI information, discover new AI opportunities
AI News
AI Tools
AI Cases
AI Tutorial
Type :
AI News
AI Tools
AI Cases
AI Tutorial
2024-11-15 08:36:19
.
AIbase
.
13.2k
iFLYTEK Spark Multimodal Interaction Model Launched, Achieving 'Voice, Vision, and Digital Human Interaction' Integration
iFLYTEK recently announced the official launch of its latest development, the iFLYTEK Spark Multimodal Interaction Model. This technological breakthrough marks iFLYTEK's expansion from a single voice interaction technology to a new stage of real-time audio-visual multimodal interaction. The new model integrates voice, vision, and digital human interaction functions, allowing users to seamlessly combine all three with a single button.
2024-07-05 13:41:48
.
AIbase
.
10.1k
SenseTime Unveils Daily New 5o: Real-time Streaming Multimodal Interaction Competing with GPT-4o
At the 2024 World Artificial Intelligence Conference, SenseTime Technology released the first domestic "What You See Is What You Get" model, named "Ri Ri Xin 5o". This model offers an interactive experience comparable to GPT-4o, achieving real-time streaming of multi-modal interactions. By integrating cross-modal information such as sound, text, images, and video, it can understand and respond in real-time. For example, it can recognize the name tags worn by staff, determine the venue location,