AI Daily: New Y Video Model 2.0 Released; LivePortrait Supports Animation Control with Images; OpenAI Launches GPT-4o Model Fine-Tuning Feature; Free Watermark-Free! AI Video Hotshot Can Generate Up to 10 Seconds

Welcome to the AI Daily section! This is your daily guide to exploring the world of artificial intelligence, where we present the hottest topics in the AI field every day, focusing on developers, helping you understand technological trends, and discover innovative AI product applications.

Fresh AI Products Click to Explore: https://top.aibase.com/

1. New OneVideo Large Model 2.0 Released: Supports 1080P60fps Output, Up to 4K

The latest release of OneVideo Large Model 2.0 by OneTech has made significant breakthroughs in AI video creation, achieving full-process automated creation, lowering the barriers and costs of video production. The technology integrates advanced AI algorithms and deep learning techniques, offering a convenient one-click trigger function. It also features self-developed script models, emotional voice synthesis technology, and automatic background music generation capabilities.

AiBase Highlights:
⚙️ OneVideo Large Model 2.0 achieves full-process automated creation, reducing the barriers and costs of video production.
💡 The technology integrates advanced AI algorithms and deep learning techniques, offering a convenient one-click trigger function.
🎬 OneVideo Large Model 2.0 features self-developed script models, emotional voice synthesis technology, and automatic background music generation capabilities.
Details Link: https://aigc.yizhentv.com/product/aiVideo

2. OpenAI Launches GPT-4o Model Fine-tuning Feature, Offering 1 Million Free Tokens Daily!

OpenAI has introduced a new multimodal large model, GPT-4o, allowing third-party developers to fine-tune it to meet different application needs. Developers can easily select the model version on the fine-tuning dashboard and receive 1 million free tokens daily for fine-tuning. OpenAI also emphasizes data security and privacy protection to ensure that fine-tuned models do not misuse corporate data.

AiBase Highlights:
🌟 Fine-tuning feature launched: Developers can adjust the behavior of the GPT-4o model based on their needs.
💰 Free Tokens Giveaway: 1 million free tokens daily for model fine-tuning, attracting many developers.
🔒 Data Security Assurance: OpenAI prioritizes data privacy and security, ensuring that fine-tuned models do not use input-output data for retraining.
Details Link: https://platform.openai.com/finetune

3. Another AI Video Tool Makes a Strong Debut! Hotshot Can Generate Up to 10 Seconds of Video, No Watermark

Hotshot is a brand-new text-to-video AI generator that can produce up to 10 seconds of 720p video, showing great potential. Users can experience the early preview version of the model for free, but they are limited to generating two watermark-free videos per day. The founding team completed the model training in just four months, using 600 million video clips and thousands of GPUs. It is expected that AI-generated full YouTube videos will become popular in the future, giving creators more control.

AiBase Highlights:
🌟 Hotshot's new text-to-video AI generator is now in public "early preview," allowing users to experience it for free.
🚀 The model was trained in just four months using 600 million video clips and thousands of GPUs, showing great potential.
🎥 Founder Sastry predicts that within a year, AI-generated full YouTube videos will become widespread, giving creators more control.
Details Link: https://top.aibase.com/tool/hotshot

4. LivePortrait Update: Supports Image-Driven Portrait Animation and Fine-Grained Area Control

LivePortrait's Gradio tool has received a series of exciting updates, allowing users to now use their own images to drive portrait animations and finely select animation areas. The new features enhance the convenience and creative freedom of animation production while protecting privacy information. LivePortrait's core advantage lies in its amazing expression transfer technology, which can create lifelike dynamic effects.

AiBase Highlights:
🚀 Users can use their own images to drive portrait animations and finely select animation areas.
🎭 New relative motion function protects privacy but may affect expression intensity.
💡 LivePortrait can accurately copy expressions to another person, providing unprecedented creative freedom.
Details Link: https://top.aibase.com/tool/liveportrait

5. AI Instant Image Editing Tool TurboEdit - Change Hair Color, Age, and Outfit with a Sentence!

TurboEdit is a text-based instant image editing tool that allows users to quickly edit images through simple text descriptions. The editing speed is extremely fast, supporting instant feedback and interactive editing, enabling users to see the editing effects in real-time. Whether you are a professional designer or an ordinary user, you can easily realize creative ideas through TurboEdit.

AiBase Highlights:
✨ Edit images quickly with a sentence description, achieving instant hair color change, aging, and outfit change.
💡 TurboEdit can modify only the specified parts while maintaining the overall image, allowing users to adjust any area of the image at will.
🚀 TurboEdit supports simultaneous modification of multiple attributes of the image, including color, clothing, style, etc., allowing creativity to extend infinitely.
Details Link: https://betterze.github.io/TurboEdit/

6. AI Dance King Viggle: One Click to Make Musk and Trump Dance, Monthly Visits Surpass 6.8 Million

Musk has once again demonstrated his status as the king of traffic on the internet, with the video released through the Viggle AI tool going viral, quickly surpassing 130 million views. The template-based AI video generation tool of Viggle is simple yet powerful, allowing ordinary users to produce professional-level videos, with monthly visits exceeding 6.8 million, marking a milestone in the application of AI technology in daily life.

AiBase Highlights:
🌟 Viggle AI allows users to easily generate smooth and natural dance videos by simply uploading photos and selecting action templates.
🚀 The multi-character control feature, Multi, allows users to control two characters simultaneously, inspiring creative ideas and quickly spreading secondary creation videos.
💡 The template-based operation of Viggle AI lowers the creation threshold, allowing ordinary users to also create professional-level videos, similar to the successful paths of Jianying and CapCut.
Product Entry: https://top.aibase.com/tool/viggle

7. Born for Complex Visual Reasoning! Microsoft Releases Phi-3.5-vision

Microsoft has recently released Phi-3.5-vision, a lightweight, multimodal open-source AI model designed specifically for processing text and visual inputs. Phi-3.5-vision performs exceptionally well in resource-constrained environments, supporting a 128K context length, and is suitable for commercial and research fields. The model features extensive image understanding, OCR, chart, and table parsing capabilities, showing significant performance improvements in benchmark tests.

AiBase Highlights:
🔍 Phi-3.5-vision is a lightweight, multimodal AI model suitable for processing text and visual inputs.
💡 The model supports a 128K context length, performing excellently in environments with limited memory or computational resources.
🚀 Phi-3.5-vision features image understanding, OCR, chart, and table parsing capabilities, showing significant performance improvements.
Details Link: https://huggingface.co/microsoft/Phi-3.5-vision-instruct

8. ByteDance's Automatic Speech Recognition Model Seed-ASR, Can Understand All Accents and Dialects!

Seed-ASR is ByteDance's speech recognition engine, trained with a large amount of data, featuring excellent recognition capabilities and contextual awareness, accurately identifying multiple languages, dialects, and accents, bringing new possibilities for cross-language communication. It performs well in various scenarios, enhancing user experience, especially in smart assistants and voice search fields.

AiBase Highlights:
🔍 Seed-ASR has been trained with over 20 million hours of speech data and 900,000 hours of paired data, accurately recognizing 13 Chinese dialects and 7 foreign languages, including English with various accents.
🔑 Seed-ASR has excellent contextual awareness capabilities, combining historical dialogue records and meeting minutes to enhance recognition accuracy, especially in specific scenarios.
🎯 Seed-ASR can recognize various professional terms, including medical, technological, automotive, and music fields, significantly improving the efficiency and accuracy of smart assistants and voice searches.
Details Link: https://bytedancespeech.github.io/seedasr_tech_report/

9. Llama3 Compressed Version! Nvidia Launches Small Language Model Llama-3.1-Minitron4B with Only 400 Million Parameters

In the era where tech companies are striving to achieve artificial intelligence, Nvidia has launched Llama-3.1-Minitron4B, which uses pruning and distillation techniques, is highly efficient, and has excellent training and deployment efficiency.

AiBase Highlights:
🌟 Llama-3.1-Minitron4B is Nvidia's small language model, with efficient training and deployment.
📈 The amount of tokens used is reduced by 40 times, with significant performance improvement.
🔓 The width pruning version has been released on Hugging Face, facilitating commercial use and development.
Details Link: https://developer.nvidia.com/blog/how-to-prune-and-distill-llama-3-1-8b-to-an-nvidia-llama-3-1-minitron-4b-model/

10. OpenAI and Condé Nast Reach a Multi-Year Content Partnership

Recently, OpenAI and Condé Nast have reached a multi-year partnership agreement to explore the display of Condé Nast's brand content in OpenAI's AI products. This partnership marks the close cooperation between digital content and artificial intelligence fields, bringing users richer search experiences and high-quality reporting.

AiBase Highlights:
🌟 OpenAI and Condé Nast have reached a multi-year partnership, with content embedded in AI products.
📰 OpenAI has access to a large number of publisher text archives for training large language models.
⚖️ Some media companies choose to sue OpenAI to protect their rights.

11. Crackdown on AI-Generated Fake Reviews! The U.S. Government Takes Strong Action to Ban False AI-Generated Reviews

Recently, the U.S. Federal Trade Commission (FTC) has taken significant measures to completely ban false AI-generated reviews and recommendations. This new regulation aims to combat dishonest behaviors in online reviews, protect consumer rights, and maintain a fair competitive market environment. FTC Chair Lina Khan stated that false reviews waste time and money, pollute the market, and divert attention from honest competitors. President Biden supports this move, emphasizing that consumers should trust customer reviews.

AiBase Highlights:
🔍 The FTC has decided to completely ban false AI-generated reviews to protect consumer rights and maintain a fair competitive market environment.
📰 Many well-known media outlets have published product reviews by fictitious authors, further exacerbating the falsity of reviews and causing consumer concerns.
💼 The new regulation allows the FTC to pursue civil liability against违规companies, strengthening the regulation of the e-commerce environment and maintaining market order.

12. Qualcomm Snapdragon 7s Gen3 Released

AI News

AI Daily

AI Timeline

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

AI Daily: New Y Video Model 2.0 Released; LivePortrait Supports Animation Control with Images; OpenAI Launches GPT-4o Model Fine-Tuning Feature; Free Watermark-Free! AI Video Hotshot Can Generate Up to 10 Seconds

站长之家

This article is from AIbase Daily

AI News Recommendations

AI Daily: Alibaba's Qwen3 Model Imminent; GitHub Opensources MCP Server; Runway Releases Gen-4 Turbo

AI Daily: China's AI Investment to Exceed $100 Billion by 2028; OpenRouter Releases Free Model Quasar Alpha; Midjourney V7 Launches Major Update

AI Daily: Dream 3.0 Internal Testing Generates 2K Commercial Posters; ChatGPT Updates Image Generation Capabilities; Ele.me Introduces AI-Powered Smart Managers

AI Daily: Alibaba's Qwen Tops Global Open-Source Model Ranking; MiniMax Launches Speech-02; ChatGPT Paid Users Surge to 20 Million

AI Daily: Runway Launches New Video Model Gen-4; Unitree G1 Sells Over One Million in 5-Minute Livestream; OpenAI to Open-Source New Model

AI Daily: Zhipu Releases Agent Product AutoGLM-Thinking; Google Gemini 2.5 Pro Opens for Free Use; ChatGPT's Native Image Generation Rolls Out to Free Users

AI Daily: Alibaba's New Visual Reasoning Model QVQ-Max; KeLing AI Adds New AI Sound Effects; GPT-4o Performance Soars After Upgrade; Midjourney V7 to Launch Next Week

AI Daily: Taobao Launches AI Fight Against Fake Images; OpenAI Announces Support for MCP Protocol; Alibaba Open-Sources Multimodal Model Qwen2.5-Omni

AI Daily: OpenAI's New Image Generation Model Can Create Images from a Single Sentence; Keling AI Revenue Exceeds 100 Million; Google Launches Gemini 2.5, Its Most Powerful Reasoning Large Language Model

AI Daily: DeepSeek-V3-0324 Quietly Released; Alibaba Cloud Launches Large-Scale AI Campus Recruitment; WeChat Shops Ban AI Business Courses