Welcome to the 【AI Daily】 column! Here is your daily guide to exploring the world of artificial intelligence. Every day, we present you with the hottest topics in the AI field, focusing on developers, helping you gain insights into technology trends and innovative AI product applications.

Fresh AI products Click to learn more: https://top.aibase.com/

1. OpenAI introduces screen sharing and video chat features, ChatGPT has a "Santa Claus mode"

OpenAI recently added video chat and screen sharing capabilities to its advanced voice mode, allowing users to interact with ChatGPT in real time through the mobile app. This feature is currently available to ChatGPT Teams, Plus, and Pro users, with plans to expand to enterprise and educational users in January next year. Although users in the EU and some countries cannot access it, the launch of these features marks a significant advancement in the interactivity and practicality of ChatGPT.

image.png

【AiBase Summary:】

🎥 New video chat feature allows ChatGPT to respond in real-time to what users see.

🖥️ Screen sharing feature is live, enabling users to request assistance from ChatGPT on their phones.

🎅 "Santa Claus mode" is available, allowing users to interact with ChatGPT mimicking Santa Claus's voice.

2. Great news! Anthropic's fastest model Claude 3.5 Haiku is now fully available

Anthropic has released its latest Claude 3.5 Haiku model, which is now open to all users. This model has gained widespread attention for its efficiency and outstanding benchmark performance, making it particularly suitable for real-time tasks and large dataset processing. Although there are some limitations, such as the lack of web browsing and image generation support, its versatility in chatbots and integration with Claude Artifacts enhances the user experience.

image.png

【AiBase Summary:】

🌟 Claude 3.5 Haiku is now fully available, supporting image and file analysis features.

💰 The free version has message limits; users can opt for the $20 Claude Pro subscription for more permissions.

📈 This model performs excellently in various benchmarks, suitable for real-time tasks and large dataset processing.

3. Shanghai AI Lab proposes "fingerprint recognition" method REEF to combat "shell" behavior

In the AI era, protecting the intellectual property of large language models (LLMs) is crucial. The REEF method proposed by the Shanghai AI Lab uses feature representation for model fingerprint recognition, effectively identifying "shell" models without impacting model performance. The robustness and theoretical guarantees of REEF ensure its effectiveness against various fine-tuning and modifications, providing new means to combat unauthorized usage.

image.png

【AiBase Summary:】

🔍 REEF is a feature representation-based model fingerprint recognition method that does not rely on specific layer representations and is highly robust.

💡 This method identifies potential "shell" models by comparing the center kernel alignment (CKA) similarity of features represented by the model on the same samples.

📈 Experimental results show that REEF outperforms existing methods in identifying "shell" models, providing new tools for protecting LLM intellectual property.

Details link: https://arxiv.org/pdf/2410.14273

4. Runway Act one alternative! HelloMeme makes meme video creation easier!

HelloMeme is an innovative tool designed to simplify the process of creating meme videos. By optimizing attention mechanisms, the model can capture expressions and action details more accurately. The three main components of HelloMeme work together to enhance the liveliness and clarity of videos while maintaining compatibility with the SD1.5 model.

image.png

【AiBase Summary:】

🎥 HelloMeme enhances meme video production capabilities by optimizing attention mechanisms, simplifying the process.

🤖 It consists of HMReferenceNet, HMControlNet, and HMDenoisingNet, working together to generate high-quality videos.

💡 HelloMeme is compatible with the SD1.5 model, retaining original model functions while adding new capabilities to improve video quality.

Details link: https://songkey.github.io/hellomeme/

5. Meta launches new watermark tool Video Seal to combat AI-generated deepfake videos!

The Meta Video Seal tool introduced by Meta aims to add nearly imperceptible watermarks to AI-generated videos to tackle the challenges posed by deepfake technology. This tool is not only open-source but also integrates seamlessly with existing software to enhance the protection of video originality.

image.png

【AiBase Summary:】

🔍 The Meta Video Seal tool can add watermarks to AI-generated videos, resisting editing and compression.

📊 This tool is open-source, designed to integrate with existing software, and aims to advance watermark technology in the industry.

🏆 Meta will also launch a public leaderboard to compare different watermark methods, promoting industry collaboration and communication.

6. OpenAI CFO reveals: New generation AI model development will cost billions, costs are skyrocketing!

The CFO of OpenAI revealed in New York that the costs of building more advanced AI models will continue to rise significantly, expected to reach billions of dollars. This trend reflects the dual pressures of technological advancement and market demand, prompting the company to increase its investment in AI technology.

image.png

【AiBase Summary:】

💰 OpenAI expects the development costs for the new generation of AI models to continue to soar, reaching billions of dollars.

📈 The company is increasing its investment in advanced AI systems, and service prices may rise in the future.

🎥 The newly launched AI video generator Sora has received positive feedback, providing more possibilities for content creators.

7. Google and Samsung team up to "flip the table"! New mixed reality headsets and AI glasses revealed, targeting Apple's Vision Pro

Google and Samsung have jointly launched a new generation of mixed reality headsets and smart AI glasses, showcasing their ambitions in the mixed reality field. These two devices not only have significant hardware upgrades but also deeply integrate Google's latest Gemini AI model, capable of understanding user intent and long-term memory, providing personalized services. They support various natural interaction methods, making the user experience smoother.

image.png

【AiBase Summary:】

🛠️ The new devices support VR and AR functionalities, deeply applying AI technology to provide personalized services.

🗣️ Innovative interaction methods support gestures, voice, and eye movements, enhancing the user experience.

📱 Based on the Android XR operating system, existing applications are seamlessly adapted, lowering the barriers for developers.

Details link: https://android-developers.googleblog.com/2024/12/introducing-android-xr-sdk-developer-preview.html

8. Google's flagship TPU Trillium is now open for use! Performance skyrockets, AI model training efficiency reaches new heights

Google's latest Trillium TPU is now available to Google Cloud customers, with its significantly improved performance and efficiency bringing new breakthroughs to AI model training. Through optimized hardware and software architecture, the Trillium TPU has achieved significant enhancements in both training and inference performance, greatly promoting the development and application of AI solutions.

image.png

【AiBase Summary:】

⚡ The training performance of Trillium TPU has increased fourfold, inference throughput has increased threefold, and energy efficiency has improved by 67%.

💡 Trillium TPU supports large-scale AI training, effectively distributing workloads and significantly accelerating training speed.

💰 Training performance improves 2.5 times per dollar, inference performance improves 1.4 times, providing excellent cost-effectiveness.

Details link: https://cloud.google.com/blog/products/compute/trillium-tpu-is-ga

9. Twelve Labs is developing AI capable of analyzing and searching videos

In the digital media era, the growth rate of video content is remarkable, but traditional search and analysis methods cannot meet the demand. Twelve Labs is revolutionizing video understanding through AI technology, capable of deeply analyzing actions, objects, and sounds in videos, providing more precise search capabilities.

image.png

【AiBase Summary:】

🔍 Twelve Labs' AI model can deeply understand video content, going beyond traditional keyword searches.

🤖 The company focuses on video understanding, providing customized video analysis tools suitable for various scenarios.

🌍 While innovating technology, Twelve Labs emphasizes ethics, ensuring the fairness and inclusivity of AI models.

10. xAI vs OpenAI salary comparison: The talent war between Musk and Altman

With the rapid development of the AI industry, the talent competition between xAI and OpenAI is intensifying. Musk has accused OpenAI of attracting talent with high salaries, putting competitors in a difficult position. Analysis shows that OpenAI's salaries are significantly above the industry standard, while xAI's compensation is also competitive.

image.png

【AiBase Summary:】

💰 There is a significant salary gap between xAI and OpenAI, with OpenAI's salaries exceeding the industry standard by 87%.

👥 The competition between Musk and Altman is escalating, with xAI having recruited several former OpenAI employees.

⚖️ Musk has accused OpenAI of anti-competitive behavior, with both sides engaging in a battle of wits in the talent war.

11. Former OpenAI algorithm lead starts new company focusing on intelligent companion robots

Jiang Xu, former senior algorithm lead at OpenAI, has announced the establishment of a new company, "Bright Source Innovation," focusing on the development of embodied intelligent companion robots. As a key contributor to GPT-4, Jiang participated in several critical projects during his career at OpenAI and founded the company after leaving in 2023. image.png

【AiBase Summary:】

🤝 Bright Source Innovation focuses on developing embodied intelligent companion robots aimed at enhancing users' quality of life.

🌍 The company has offices in Shenzhen and Singapore and is actively recruiting talent to drive project progress.

🧠 Bright Source Innovation's robots will have the ability to perceive, learn, and interact with their environment, suitable for multiple fields.