AI Daily: 1 Second Image Creation! NVIDIA Open Sources Text-to-Image Model Sana; OpenAI Releases Economic Blueprint; Adobe's New AI Tool Edits 10,000 Images with One Click

Welcome to the 【AI Daily】 column! Here is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers to help you gain insights into technology trends and innovative AI product applications.

Fresh AI Products Click to Learn More: https://top.aibase.com/

1. Shocking Debut! NVIDIA Open Sources Image Generation Model Sana, Generates Images in 1 Second, Supports Chinese, English, and Emojis

NVIDIA has recently open-sourced the image generation model Sana, significantly lowering the usage threshold with only 60 million parameters and the capability to generate images at resolutions up to 4096×4096. The model can generate high-quality images in less than 1 second on a 16GB GPU, showcasing excellent performance. Sana utilizes deep compressed autoencoders and linear diffusion transformers to enhance generation speed and quality, supporting multilingual prompts for user creativity.

【AiBase Highlights:】
🌟 Efficient Generation: Sana can quickly generate high-quality images at resolutions up to 4096×4096, suitable for use on regular laptop GPUs.
⚙️ Innovative Design: Deep compressed autoencoders and linear diffusion transformers significantly improve generation speed and quality.
🚀 Outstanding Performance: Sana excels in multiple tests, with throughput significantly higher than other advanced models, supporting rapid content creation.
Details link: https://nv-sana.mit.edu/

2. OpenAI Releases AI Economic Blueprint, Calls for Strengthened Regulation and Development in the US

The “Economic Blueprint” recently released by OpenAI aims to explore policies with the US government and allies to solidify the US's technological leadership in the AI field. The blueprint emphasizes the importance of attracting funds, talent, and resources, while expressing concerns about the current regulatory framework. OpenAI urges the government to increase investment, establish best practices to prevent the misuse of AI models, and provide flexibility for developers regarding intellectual property.

【AiBase Highlights:】
💰 The US needs to attract billions of dollars in funding to enhance AI competitiveness.
⚖️ OpenAI expresses concerns about conflicts in state legislation and current regulations.
🌱 Recommends increased government investment in new energy and data transmission.

3. Mistral Launches Next-Generation Programming Model Codestral 25.01, Doubling Programming Speed

Mistral recently released the latest version of its open-source code generation model, Codestral 25.01, which significantly enhances code generation speed, achieving double that of the previous version. The new version performs exceptionally well in Python coding tests, achieving an 86.6% score on the HumanEval test. This model focuses on low latency and high-frequency operations, making it suitable for code correction, test generation, and especially important for enterprises dealing with large amounts of data.

【AiBase Highlights:】
🌟 Mistral launches Codestral 25.01, doubling code generation speed compared to the previous version.
💻 The model excels in Python coding tests, achieving an 86.6% score on the HumanEval test.
📈 Codestral 25.01 quickly rose to the top in Copilot Arena, garnering widespread attention from developers.
Details link: https://mistral.ai/news/codestral-2501/

4. Tsinghua, Fudan, and Stanford Jointly Open Source “Eko” Framework to Automate Computer Operations with Agents

Recently, research teams from Tsinghua University, Fudan University, and Stanford University jointly released the “Eko” agent development framework, aimed at helping developers quickly build “virtual employees” for production through simple code and natural language. The Eko framework can take over users' computers and browsers, completing various tedious tasks on behalf of humans, significantly improving work efficiency and reducing human burdens.

【AiBase Highlights:】
🌟 The Eko framework can take over users' computers and browsers, replacing humans in tedious tasks.
🔧 Simplifies the development process through the combination of natural language and programming languages.
🛡️ Allows real-time monitoring and intervention by humans, ensuring the safety and accuracy of automated work.
Details link: https://eko.fellou.ai/

5. Adobe Launches AI-Driven Bulk Create, Allows One-Click Batch Editing of 10,000 Images

Adobe has recently launched a new AI tool, Bulk Create, designed to help creative teams in businesses efficiently edit images. This tool offers batch editing functionality through a web platform, eliminating the need to download applications or Photoshop licenses, greatly enhancing work efficiency. Users can easily change backgrounds and resize images while supporting brand customization to meet different business needs. Although still in the testing phase, it is expected to be fully launched soon, further promoting the application of AI in the creative field.

【AiBase Highlights:】
🎨 Bulk Create allows users to batch edit images via a web platform without needing to download desktop applications or Photoshop licenses.
📏 The tool supports background changes and image resizing, providing preset dimensions for social media, helping users quickly adapt to different platforms.
🚀 Adobe plans to introduce video support features in the future, further enhancing the versatility of Bulk Create.

6. New AI Model LlamaV-o1, Outperforms Claude 3.5 Sonnet in Testing Inference Capabilities

The LlamaV-o1 model launched by the Mohamed bin Zayed University of Artificial Intelligence in the UAE sets a new benchmark in the field of multimodal AI, particularly demonstrating exceptional performance in complex text and image reasoning tasks. This model enhances its application value in industries such as healthcare and finance through the transparency of step-by-step reasoning, increasing user trust.

【AiBase Highlights:】
🌟 LlamaV-o1 is a newly released AI model proficient in solving complex text and image reasoning tasks.
📊 The model performs excellently in the VRC-Bench benchmark test, providing a transparent step-by-step reasoning process.
🏥 LlamaV-o1 has significant application value in industries like healthcare and finance, enhancing trust and compliance.
Details link: https://mbzuai-oryx.github.io/LlamaV-o1/

7. Research Reveals: Just 0.001% of False Data Can Cause AI Models to Fail

Recent research has revealed the vulnerability of large language models (LLMs) in data training, especially in the healthcare sector. The study shows that even a tiny amount of false information, at just 0.001%, can lead to significant errors in the model, affecting patient safety. This research highlights the risks of using AI tools in medical applications, urging developers not to use them for critical medical tasks until ensuring model safety.

【AiBase Highlights:】
🌐 Research shows that just 0.001% of false information can cause large language models (LLMs) to fail.
🩺 In the medical field, the spread of false information can severely impact patient safety.
💡 Researchers urge that LLMs should not be used for important medical tasks such as diagnosis or treatment until safety is ensured.

8. Microsoft Paint App Adds Free AI Eraser Feature, Easily Remove Any Element!

Microsoft has upgraded its classic Paint application by introducing an AI-based eraser feature, making it more convenient for users during image processing. Users simply circle the elements they want to delete, and the AI can automatically recognize and erase them, significantly simplifying the traditional processing workflow. After two months of testing, this new feature has now been rolled out to all users, who can upgrade for free in the Microsoft Store.

【AiBase Highlights:】
🌟 AI Eraser Feature: Users can easily delete elements in images by simply circling them.
⏳ Time to Use: Erasing elements may take 40 to 80 seconds, but no dedicated hardware support is required.
🔍 Quality: The effectiveness of the deletion depends on the complexity of the background around the element, and sometimes it may not be ideal.

9. Jumpspace and Cha Baidao Achieve Deep Cooperation

Shanghai Jumpspace Intelligent Technology Co., Ltd. and Cha Baidao's deep cooperation marks an important step toward the intelligent and digital transformation of the tea beverage industry. Through Jumpspace's large model technology, Cha Baidao's operational efficiency has significantly improved, especially in self-inspection verification, saving a considerable amount of time. This collaboration not only optimizes the production operations of stores but also provides consumers with a safer, smarter, and more enjoyable milk tea consumption experience, showcasing the development potential of the future tea beverage industry.

【AiBase Highlights:】
🚀 Jumpspace collaborates with Cha Baidao to explore new models of intelligent inspection and AIGC marketing.
📈 The Step-1V multimodal understanding large model has been integrated into thousands of Cha Baidao stores, enhancing operational efficiency.
☕ Intelligent inspections ensure the safe delivery of tea beverages, improving consumer service experiences.

10. GenAI Creative Community Hitems Founded by Douyin's Co-Founder Ren Lifeng Secures Tens of Millions in Funding

Hitems, a creative goods community founded by Douyin's co-founder Ren Lifeng, recently completed tens of millions in Pre-A round funding, reaching a valuation of $150 million. The platform utilizes generative artificial intelligence technology to help designers and creators turn ideas into actual products, forming an active creative exchange community. After the funding, Hitems plans to increase technological investment, promote the application of GenAI and 3D models, and further expand market space.

【AiBase Highlights:】
💰 Hitems successfully completes tens of millions in Pre-A round funding, reaching a valuation of $150 million.
🛍️ Users can quickly turn ideas into products through the platform, creating an active creative exchange community.
🚀 The company plans to increase technological investment, promote the application of GenAI and 3D models, and expand market space.

AI Daily News

AI Daily: 1 Second Image Creation! NVIDIA Open Sources Text-to-Image Model Sana; OpenAI Releases Economic Blueprint; Adobe's New AI Tool Edits 10,000 Images with One Click

站长之家

This article is from AIbase Daily

AI News Recommendations

Reports: ByteDance Plans to Invest $12 Billion in AI Chips by 2025

NVIDIA Opensources Sana: Generate 4K Ultra HD Images in Seconds on Laptops

A Stunning Debut! Nvidia Launches Open Source Image Generation Model Sana, 1 Second Image Generation, Supports Chinese, English, and Emoji

Artificial Intelligence Startup Sana Raises $55 Million in Funding, Valuation Reaches $500 Million