Title: AI Daily: SenseTime's Vimi Video Generation Model Opens for Beta Testing; Tencent's AI Video Features Go Live; UltraPixel Generator Directly Produces 6K Images Summary: In the latest AI news, SenseTime has launched the beta testing phase for its Vimi video generation model, showcasing advancements in AI-driven content creation. Tencent's AI-powered video platform, Tencent Zhiying, has introduced new AI video functions, enhancing the capabilities of digital media production. Additionally, UltraPixel, a cutting-edge image generator, now offers the ability to directly produce high-resolution 6K images, pushing the boundaries of digital imaging technology.

Welcome to the AI Daily section! Here, you'll find your daily guide to exploring the world of artificial intelligence. Each day, we bring you the hottest topics in the AI field, focusing on developers to help you understand technology trends and discover innovative AI product applications.

Discover fresh AI products by clicking here: https://top.aibase.com/

1. SenseTime Launches Vimi Video Generation Large Model; Vimi Camera Opens Beta for C-End Applications

SenseTime introduced the Vimi video generation large model at the 2024 World Artificial Intelligence Conference (WAIC), offering precise facial and body control for users, supporting multiple driving methods, and generating highly consistent video content with outstanding stability. Vimi Camera, as the first C-end application, meets the entertainment and creative needs of a wide female audience, supporting diverse generation styles and personalized creation.

AiBase Highlights:

👩‍💻 The Vimi model leverages SenseTime's advanced large model technology to generate character videos consistent with target movements, featuring years of accumulated facial tracking technology and precise control capabilities.

🎥 Vimi can produce single-shot character videos over a minute long without degradation in visual quality over time, supporting environmental scene adjustments and realistic visual effect simulations.

📸 Vimi Camera allows users to upload high-definition character images to generate digital avatars and portrait videos, offering diverse generation styles and fun character expression packs.

2. Limited-Time Free Offer! Tencent Zhiying Mini Program Launches 'AI Video' Feature

The Zhiying Mini Program has introduced a new 'AI Video' feature that allows users to convert ordinary videos into stylized ones, particularly anime style, enhancing the appeal of videos. This feature is currently available for free to help users improve video aesthetics and趣味性.

AiBase Highlights:

🎥 One-click operation: Easy to get started, even beginners can create professional-level stylized videos.

🎨 Multiple style templates: Offers diverse templates to enhance video aesthetics and storytelling.

🚀 Boosts video dissemination: Stylized videos are easy to share, attracting more viewers.

3. UltraPixel: A Tool for Generating Ultra-High Resolution Images

UltraPixel is a cutting-edge technology capable of generating ultra-high resolution images, bringing good news to designers and creators. Through Stable cascade training and fine-tuning, it supports direct generation of images from 1K to 6K resolution. Its techniques include implicit neural representation and scale-aware normalization layers, maintaining high detail and realism. It also processes efficiently within minimal space, with parameter utilization as high as 97%, enhancing training and inference efficiency.

QQ截图20240709110659.jpg

AiBase Highlights:

🔍 UltraPixel supports direct generation of images from 1K to 6K resolution, with details as fine as pores, clear and sharp.

🚀 Based on Stable cascade training and fine-tuning, it is set to be open-sourced, allowing more people to experience the charm of this technology.

💡 Guides high-resolution image generation through rich semantic information in low-resolution images, reducing complexity while maintaining high detail and realism.

Details link: https://top.aibase.com/tool/ultrapixel

4. Groq Launches Lightning-Fast LLM Engine, Attracting 280,000 Developers in Just Four Months

Groq recently launched a lightning-fast LLM engine, garnering widespread attention. This engine processes 1256.54 tokens per second, far exceeding GPU speeds, showcasing the rapid and flexible nature of LLM chatbots. Groq offers free LLM workload services, with over 280,000 developers already using them. CEO Ross predicts that by next year, half of the global inference computing will run on Groq's chips.

AiBase Highlights:

🚀 Groq's LLM engine processes 1256.54 tokens per second, significantly faster than GPUs.

🤖 The engine demonstrates the speed and flexibility of LLM chatbots, attracting both developers and non-developers.

💻 Groq offers free LLM workload services, with over 280,000 developers using them, and predicts that half of the global inference computing will run on its chips.

5. Autonomous Vehicle Team Launches Cinematic AI Visual Effects Odyssey

The autonomous vehicle team has ventured into Hollywood, launching revolutionary cinematic AI visual effects Odyssey, which disrupts the way movies, TV shows, and video games are made. Odyssey can generate Hollywood-level story shots, breaking through video AI barriers to achieve complete control over the core layers of visual storytelling. Inspired by Pixar, the goal is to use AI to produce film and TV works, solving the problem of AI controllability.

AiBase Highlights:

🎬 Odyssey achieves complete control over the core layers of visual storytelling, generating high-quality scene elements and aspects.

🌟 Introduces a more powerful generative model, training four models to achieve fine configuration of scene details.

🚗 The team is closely related to autonomous driving, with founders having rich experience in the autonomous driving field.

Details link: https://top.aibase.com/tool/odyssey

6. Report: OpenAI's Internal Forum Hacked, Confidential Data Stolen

Recently, the internal forum of the renowned artificial intelligence company OpenAI was hacked, raising security concerns, and employees are worried that security vulnerabilities could be exploited. The company has updated encrypted chat records to enhance data security and established a security and security committee to strengthen security measures. Global cooperation in addressing the challenges brought by AI has become尤为 important.

AiBase Highlights:

💡 OpenAI's internal forum was hacked, raising questions about the company's security, and employees are worried that security vulnerabilities could be exploited.

💡 Discovered a security vulnerability in the ChatGPT macOS application, the company has updated encrypted chat records to enhance data security.

💡 OpenAI successfully thwarted multiple secret influence operations from Russia and Israel, establishing a security and security committee to strengthen security measures.

7. Meta AI Develops Compact Language Model MobileLLM for Mobile Devices

Meta AI research team has introduced MobileLLM, a new method for designing efficient language models for smartphones and other resource-constrained devices. This research challenges assumptions about the scale of effective AI models, achieving a performance improvement of 2.7% to 4.3%. The development of MobileLLM aligns with the demand for more efficient AI models, which is not yet available to the public but has open-sourced pre-training code.

AiBase Highlights:

🔑 MobileLLM is an efficient language model designed for resource-constrained devices, challenging the necessity of large models.

🚀 Innovations in MobileLLM include prioritizing model depth, utilizing embedded sharing and grouped query attention, and adopting direct block weight sharing technology.

💡 MobileLLM performs excellently on benchmark tasks, with a 350 million parameter version performing comparably to a 7 billion parameter model on certain tasks.

8. Poe Social Platform Launches Previews Feature

The Poe social platform has introduced an innovative feature called Previews, bringing an unprecedented interactive experience and marking a new era in AI social interaction. The Previews feature is intuitive and easy to use, allowing users to view AI-generated web applications in real-time within the chat interface and interact instantly, enhancing the quality of interaction between users and AI.

AiBase Highlights:

🚀 AI social interaction enters a new era, with the Previews feature allowing users to intuitively operate AI-generated web applications.

💡 The Previews feature is easy to use and intuitive, allowing users to naturally interact with AI in real-time.

💻 Suitable for large language models, providing ordinary users with the opportunity to access advanced AI programming applications, increasing the attractiveness of the Poe platform.

9. Xinsir Open-Sources Controlnet++ Model Supporting Over Ten Types of Conditional Control Including Openpose and Canny

Xinsir's latest open-source Controlnet++ model offers multiple control conditions, capable of generating high-quality images, especially suitable for designers who require fine editing. Based on the ControlNet architecture, the model adds modules that support over ten different control types, providing examples of image generation under various control conditions. Although it is currently not available on Web UI and Comfyui, its versatility and high-quality output make it a significant breakthrough in the text-to-image generation field.

AiBase Highlights:

🔧 Controlnet++ supports inputs like Openpose and Canny, avoiding frequent model switching.

🧩 The model is designed with multiple controls, using the same network parameters to achieve image generation under different conditions.

🚀 Controlnet++ performs excellently in SDXL experiments, providing examples of image generation under various control conditions.

Details link: https://top.aibase.com/tool/controlnet-

10. Alipay's Medical Large Model Outperforms GPT-4 in Chinese and English Exams

Alipay's medical large model outperforms GPT-4 in Chinese and English exams and has been implemented in hospitals in Jiangsu, Zhejiang, and Shanghai. The model possesses multi-modal capabilities, with an accuracy rate of over 90%, and can provide intelligent question answering, medical record structuring, and retrieval services. Alipay has partnered with multiple institutions to launch an AI medical co-construction plan, committed to improving medical efficiency and data security.

AiBase Highlights:

🏥 Alipay's medical large model outperforms GPT-4 in Chinese and English exams and has been implemented in first-tier hospitals.

💡 The model possesses multi-modal capabilities, with an accuracy rate of over 90%, providing intelligent question answering, medical record structuring, and retrieval services.

🔒 Alipay has taken multiple measures to ensure the reliability of technology and data privacy security, promoting the large-scale application of artificial intelligence.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

站长之家

This article is from AIbase Daily

AI News Recommendations

SenseTime's Vimi Camera Renamed to Performance Package APP Now Officially Launched in Various App Stores