Welcome to the AI Daily column! This is your daily guide to exploring the world of artificial intelligence, where we bring you the hottest content in the AI field every day, focusing on developers to help you understand technical trends and innovative AI product applications.

Discover the latest AI products here: https://top.aibase.com/

1. Fish Speech: A Low-Memory Open-Source Text-to-Speech Model Comparable to GPT-SoVITS

Fish Speech, developed by fishaudio, is a new text-to-speech tool that supports Chinese, English, and Japanese, with speech processing approaching human-level quality. It uses the Flash-Attn algorithm for large-scale data processing, providing an efficient, accurate, and stable TTS experience.

AiBase Summary:

😊 Perfectly supports Chinese, English, and Japanese, with speech processing approaching human-level quality.

😊 Supports voice cloning; just provide a reference voice to quickly complete the cloning process.

😊 Extremely low memory requirement, only 4GB, supporting various voice generation models.

Online experience: https://top.aibase.com/tool/fish-audiowenbenzhuanyuyin

Details: https://www.aibase.com/news/9979

2. Meta 3D Gen Released: Rapidly Generate 3D Assets from Text in Under a Minute

Researchers have recently released a new technology called Meta3D Gen (3DGen) that can quickly generate 3D assets from text in less than a minute, providing users with high-quality textures and material experiences. The technology integrates Meta3D AssetGen and Meta3D TextureGen, offering an efficient way to create 3D models, which is three to ten times faster than existing solutions.

image.png

AiBase Summary:

🚀 Meta 3D Gen system can create high-quality 3D assets in less than a minute.

💡 Meta3D Gen integrates Meta3D AssetGen and Meta3DTextureGen, two core technologies.

✨ AssetGen supports the generation of physically based rendering materials with realistic relighting effects.

Details: https://ai.meta.com/research/publications/meta-3d-gen/

3. Microsoft Open-Sources GraphRAG to Build Knowledge Graphs to Enhance Large Models' Question Answering, Reasoning, and More

Microsoft's latest open-source GraphRAG system uses entity knowledge graphs to enhance large models' search, question answering, summarization, reasoning, and other capabilities, especially for large datasets. By constructing a global entity knowledge graph, GraphRAG can capture complex connections and interactions in text, improving retrieval accuracy and comprehensiveness. Additionally, GraphRAG has low token requirements, saving development costs. It has performed excellently in comprehensive tests and is one of the best RAG methods currently available.

AiBase Summary:

💡 GraphRAG enhances large models' search, question answering, summarization, reasoning, and other capabilities by constructing an entity knowledge graph, especially for large datasets.

💡 The core of GraphRAG includes two steps: constructing an entity knowledge graph and generating community summaries. By extracting relevant information from community summaries, it generates more comprehensive and accurate answers.

💡 GraphRAG has very low token requirements, helping developers save costs. It has performed excellently in comprehensive tests and is one of the best RAG methods currently available.

Details: https://top.aibase.com/tool/graphrag

4. Microsoft Launches Designer Tool: Create Personalized Greeting Cards with Just a Sentence

Microsoft's latest launch of the Microsoft Designer "Greeting Cards" feature brings an unprecedented personalized greeting card creation experience, showcasing the practical application of AI technology in daily life.

image.png

AiBase Summary:

🎨 Text-to-design: Users input a simple description, and AI transforms it into a unique greeting card design.

🖼️ AI-generated images: Greeting card designs are inspired by user descriptions and generated by AI into detailed images.

✏️ Editable content: The inside of the greeting card provides editable text to meet users' personalization needs.

Details: https://designer.microsoft.com/

5. Tencent Launches AI Translation Company TRANSAGENTS

TRANSAGENTS, developed by Tencent AI Lab, is a multi-agent virtual translation publishing company specifically for literary translation. By simulating the virtual role collaboration mode of a real translation company, it achieves smooth and efficient translation of literary works. Using TRANSAGENTS for literary translation is 80 times cheaper than professional human translators and outperforms human translation in domain-specific knowledge requirements. The platform demonstrates the potential of AI technology in the field of literary translation, providing new possibilities for literary creation and dissemination.

image.png

AiBase Summary:

🔑 TRANSAGENTS is a multi-agent virtual translation publishing company, specifically designed for ultra-long literary content translation, simulating the role collaboration mode of a real translation company.

💰 Using TRANSAGENTS for literary translation is 80 times cheaper than professional human translators, reducing translation costs and promoting the dissemination of excellent literary works.

🌟 TRANSAGENTS outperforms human translation in domain-specific knowledge requirements and is favored by human evaluators and advanced language models.

Details: https://top.aibase.com/tool/transagents

6. Suno Launches iOS Client for Voice-Generated Music

Suno's iOS app turns your phone into a virtual music studio, leading a revolution in music production and potentially changing the way we express creativity in the digital age. Facing legal challenges, Suno insists that the technology is designed to generate entirely new works. Suno's iOS app represents a significant step forward for AI-generated music, leading the future trend of the music industry.

AiBase Summary:

🎵 Music studio on your phone: Users input text prompts or hum melodies to generate complete songs, meeting the needs of various music styles.

⚖️ Legal challenges and firm stance: Facing lawsuits from record companies, Suno insists that AI generates entirely new works, and the outcome of the legal battle may affect the development of the AI music industry.

🔮 Future outlook for AI music: The boundaries between AI and human music creation are blurring, raising profound questions about creativity and the future of the music industry.

7. Apple Executive Joins OpenAI Board as Observer

I believe this article reports that Apple executive Phil Schiller has joined the OpenAI board as an observer. This will allow Apple to gain a deeper understanding of OpenAI's internal operations and potentially integrate ChatGPT into iOS and macOS to enhance Siri's intelligence. Microsoft has also joined the OpenAI board, making the collaboration more complex.

AiBase Summary:

🍏 Apple executive Phil Schiller joins the OpenAI board in an observer role, which helps deepen the understanding of OpenAI.

🤖 Schiller's presence on the board will facilitate the integration of ChatGPT into iOS and macOS, enhancing Siri's intelligence.

🔗 Microsoft also joins the OpenAI board as a non-voting observer, making the OpenAI board more complex.

8. AI-Generated Panda Eating Instant Noodles Video on Douyin Receives Over 420,000 Likes, Netizens Call It Incredibly Realistic

Recently, AI-generated video technology on Douyin has reached new heights, with videos of pandas and cats using chopsticks to eat instant noodles that are unbelievably realistic. Although there are flaws, future AI videos will be even more lifelike.

QQ Screenshot 20240703114243.jpg

AiBase Summary:

🐼 The realism of the video is astonishing, sparking heated discussions among netizens.

😺 AI technology is widely applied in the video production field, bringing new experiences to creators and audiences.

💻 The competition among domestic and international video big models is intense, with AI-generated video clips ranking 26th on Douyin's challenge list.

Details: https://www.aibase.com/news/9993

9. A Million Netizens Watch as a User Connects GPT-4V to His Home Camera

A foreign user connected GPT-4Vision to his home camera, attracting millions of netizens to watch. This behavior demonstrates the potential of AI technology in daily life but also sparks discussions about privacy and security. With the development of technology, we look forward to more innovative and secure applications.

image.png

AiBase Summary:

👀 GPT-4Vision connected to a home camera attracts millions of netizens to watch.

🔒 Sparks discussions about privacy and security, reminding people to pay attention to personal information protection.

💡 Demonstrates the potential of AI technology in daily life, inspiring people to think about technology applications.

Details: https://www.aibase.com/news/9995

10. Angry! Scottish Artist "Destroys" His Work to Protest the Negative Impact of AI on Art

Scottish artist Michael Forbes has expressed his protest against the negative impact of artificial intelligence (AI) on the art field by smearing his own artworks. Forbes has "edited" four paintings, including works featuring John Lennon and American singer Taylor Swift. He hopes his actions will raise awareness of the infringement issues caused by AI in the art field. Artists can no longer compete with computer-generated images, leading many to abandon their careers as artists.

image.png

AiBase Summary:

⭐ Scottish artist Michael Forbes protests the negative impact of AI on the art field by smearing his own artworks.

⭐ Forbes has "edited" four paintings, including works featuring John Lennon and Taylor Swift, hoping to raise awareness of AI's infringement in the art field.

⭐ Artists can no longer compete with computer-generated images, leading many to abandon their careers as artists.