AI Daily: New Models to Launch? Ultraman Sparks Debate with Strawberry Photo; Meitu Releases Pro Version of Meitu Cloud Repair; ComfyUI Now Supports Tencent's Hunyuan DiT and Flux Models

Welcome to the AI Daily section! This is your daily guide to exploring the world of artificial intelligence. Each day, we bring you the hottest topics in the AI field, focusing on developers to help you understand technological trends and discover innovative AI product applications.

Explore fresh AI products by clicking here: https://top.aibase.com/

1. Sam Altman's Strawberry Photo Sparks Speculation About New OpenAI Model "Strawberry"

Sam Altman's social media post about a summer garden has sparked speculation about a new model named "Strawberry." The internet is abuzz with anticipation for the potential breakthrough of the Strawberry project.

AiBase Highlights:

🍓 Altman's strawberry-related photo has sparked speculation and discussion.

🗣️ The new model, "Anonymous Chatbot," shows superior reasoning capabilities over existing models and may be related to the "Strawberry" project.

🚀 The "Strawberry" project aims to equip AI with autonomous internet search and deep research capabilities, considered a potential breakthrough.

2. Baidu Netdisk Launches AI Photo Editing Solution for the Photography Industry

Baidu Netdisk introduced a solution for the photography industry in August 2024, integrating storage backup, AI photo editing, and efficient delivery to help studios improve efficiency, reduce costs, and enhance business growth. This solution offers a one-stop service, addressing management, efficiency, and cost issues for studios.

AiBase Highlights:

⚙️ One-stop service: storage backup, internal collaboration, AI photo editing, and one-click delivery, enhancing studio management efficiency.

💡 Advantages for chain studios: improved internal collaboration efficiency, categorized storage of client photos, and streamlined photo workflow.

🔬 Baidu Cloud Engine technology: 9 human analysis capabilities, 86 beautification features, 1000+ visual technology patents, offering personalized AI photo editing services.

Details: https://www.wjx.cn/vm/hMDEeN7.aspx

3. Meitu Launches Meitu Cloud Pro Version with AI Batch Color Adjustment and AI Batch Retouching Features

Meitu Cloud Pro Version, under Meitu Company, introduces AI batch color adjustment and AI batch retouching features, providing a comprehensive retouching solution for the commercial photography industry. The AI workflow automates the process from conversion to delivery, significantly improving efficiency.

AiBase Highlights:

✨ AI batch color adjustment and AI batch retouching features enhance retouching efficiency.

💡 Smart retouching API service supports immediate upload, retouch, and use.

🚀 Using Meitu Cloud achieves efficient business model transformation and cost savings.

4. 360 AI Enterprise Browser Upgrade Supports AI Search, Office Assistant, and AI App Store

360 Enterprise Security Browser offers comprehensive security office solutions for businesses, featuring smart office and security functions, supporting flexible deployment to meet diverse needs. It provides AI office assistants, 360AI search, and document and video analysis applications, creating an efficient office environment with comprehensive security protection.

AiBase Highlights:

⚙️ Smart office: integrates 360AI search, AI office assistant, and AI app store, enhancing work efficiency.

🔒 Comprehensive security protection: provides multi-layer protection measures, including browser native security, web data security, and user behavior security.

🚀 Aggregated applications: offers high-quality development guarantees, unified access, and cross-platform compatibility, strengthening security and simplifying configuration processes.

Details: https://top.aibase.com/tool/360-qiyeanquanliulanqi

5. Tencent Hunyuan Large Model: Ranked First in "Image-to-Text" Multimodal Understanding Among Domestic Large Models

Tencent Hunyuan Large Model ranked first among domestic large models in the August SuperCLUE-V evaluation, demonstrating outstanding performance in multimodal understanding. Its comprehensive advantage stems from in-depth tests of image recognition accuracy and understanding of the real world.

AiBase Highlights:

🏆 Tencent Hunyuan Large Model ranked first among domestic large models, showcasing comprehensive advantages.

🔍 The evaluation results show that Tencent Hunyuan Large Model excels in multimodal understanding foundations and application capabilities.

💡 Tencent Hunyuan Large Model has expanded to a trillion-level parameter scale, using an MoE structure, achieving leading-edge multimodal understanding capabilities domestically.

6. Comfy Org Makes Significant Progress: ComfyUI Now Supports Tencent Hunyuan DiT and Flux Models

Comfy Org has recently made significant progress, introducing new model support and technical upgrades, strengthening the core execution engine, demonstrating a commitment to technological innovation and user experience. These updates make ComfyUI more reliable and powerful in the AI field.

AiBase Highlights:

🚀 New model support: Flux model integration provides example workflows and model download links, significantly enhancing AI image generation capabilities.

🔥 Hunyuan DiT model support enriches ComfyUI's multilingual support capabilities, performing excellently in understanding Chinese prompts.

💡 Frontend technology upgrades will bring a stronger and more maintainable codebase, supporting rapid development of new frontend features.

Details: https://blog.comfy.org/august-2024-flux-support-new-frontend-for-loops-and-more/

7. Reddit User Tests: GTP-4o Beats Gemini 1.5 pro in Chess

In a recent experiment, Reddit user @zefman set up a platform for different language models to play chess in real-time, with GPT-4o emerging as the strongest contender. The experiment showcased the thinking processes of different models, providing an engaging interactive experience.

AiBase Highlights:

🌟 GPT-4o performed exceptionally well in chess matches, becoming the strongest language model.

♟️ The experiment allowed different models to play chess in real-time, showcasing their thinking processes.

🔄 Weaker models sometimes chose incorrect moves, but the experiment provided opportunities to reselect, maintaining the game's progression.

8. New Method for Panoramic Image Generation, PanoFree: Generating Multi-View Images Without Fine-Tuning

PanoFree is a multi-view image generation technique that requires no fine-tuning, addressing consistency and artifact issues through iterative deformation and patching, improving time efficiency and memory usage, and yielding higher diversity in results.

AiBase Highlights:

🌟 Multi-view image generation method without fine-tuning

🚀 Solves consistency and artifact issues through iterative deformation and patching

💡 Significantly improves time efficiency and memory usage, higher result diversity

Details: https://top.aibase.com/tool/panofree

9. ExAvatar: Cloning Human Figures from Short Videos and Converting Them into 3D Digital Avatars

ExAvatar, developed jointly by DGIST and Meta's Codec Avatars Lab, can capture movements and expressions from videos and convert them into lifelike 3D digital avatars. This technology addresses challenges from previous techniques, enhancing the naturalness and rendering effects of animations.

AiBase Highlights:

🌟 Full-body 3D animation: supports comprehensive body, hand, and facial animation, generating various poses and expressions.

💡 Hybrid representation: combines 3D Gaussians and surface meshes, ensuring geometric and appearance consistency, reducing artifacts.

🚀 High-quality rendering: utilizes advanced algorithms and techniques, achieving high-quality dynamic performance and rendering effects.

Details: https://top.aibase.com/tool/exavatar

10. Mistral AI Launches New Development Tools Allowing Users to Optimize and Build Intelligent Agents

Mistral AI's latest development tools offer users and developers more powerful and flexible AI model optimization and application capabilities, garnering widespread attention and anticipation. Users can fine-tune models through La Plateforme, build intelligent agents using the Agents platform, and the new version SDK supports Python and Typescript, providing more options and flexibility.

AiBase Highlights:

✨ Users can fine-tune models through La Plateforme, better utilizing data for optimization.

🔧 The Agents platform helps users adjust models in detail, building intelligent agents.

🚀 The new version SDK supports Python and Typescript, making integration and usage more convenient.

11. Napkin: Easily Convert Text into Visual Graphics Using AI

In an era of information explosion, Napkin is a visualization platform using AI technology that can convert text into various visual graphics, helping users express ideas and creativity more easily. Despite its innovative potential, it faces some challenges and areas for improvement.

AiBase Highlights:

🧠 Visualization platform using AI technology to help users convert text into various visual graphics.

🚀 Offers customization features, allowing users to adjust icons, colors, fonts, and export in various file formats or as URL links.

⚙️ Needs further optimization of AI's ability to handle ambiguous content, enhancing visual design and personalization.

Details: https://top.aibase.com/tool/napkin-ai

12. OpenAI ChatGPT App Revenue Hits新高 in July: Net Income of $28 Million

OpenAI's ChatGPT mobile app set a new monthly revenue record in July, with a net income of $28 million, primarily due to the introduction of the GPT-4 omni mode. This mode brought new capabilities to handle text, voice, and video, offering faster response times and making AI interactions more natural.

AiBase Highlights:

💰 ChatGPT app net income reached $28 million in July, a 40% increase from May.

📱 Apple App Store contributed 83% of the income, a 20% increase from June.

🚀 GPT-4 omni mode brings new capabilities to handle text, voice, and video to ChatGPT, offering faster response times and more natural user interactions.