AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

ByteDance and USTC Join Forces to Break Resolution Limits, Launch Multi-Modal Document Large Model

量子位

Published inAI News · 1 min read · Dec 4, 2023

ByteDance has collaborated with the University of Science and Technology of China for the first time to launch the high-resolution multi-modal document large model, DocPedia. The model has been uploaded to arXiv, solving the problem of previous models being unable to parse high-resolution document images. With a resolution of 2560×2560, DocPedia exhibits significant superiority in areas such as image-text understanding and visual question answering. The model enhances performance by addressing resolution issues through the frequency domain, creating a new technical breakthrough.

ByteDance Multi-Modal Document Large Model DocPedia

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

ByteDance Unveils DreamActor-M1: Replicating RunwayML Act Functionality and Pushing Animation Generation Boundaries

ByteDance recently announced its latest AI project, DreamActor-M1, a cutting-edge advancement in video generation technology. This model seamlessly replaces a person from a still image into a video scene using a reference video, generating dynamic imagery with fine-grained expressions, natural movements, and high-definition quality. This launch marks another breakthrough for ByteDance in generative AI and is seen as a challenge to existing animation generation tools (like RunwayML).

Apr 3, 2025

1.3k

ByteDance Unveils DreamActor-M1 Project, Challenging Runway Act-One's AI Character Animation Technology

ByteDance recently launched its new AI project, DreamActor-M1. This project aims to replicate the functionality of Runway Act-One, utilizing advanced generative AI technology to transform character performances in videos into virtual animations with improved accuracy and expressiveness. This news has quickly garnered widespread attention from the industry and netizens, seen as another significant step forward for ByteDance in the AI video generation field. Technological Breakthrough: Ambition to Surpass Runway Act-One. According to publicly available information, Drea...

Apr 3, 2025

1.4k

ByteDance Releases MegaTTS3 on Hugging Face: A Breakthrough in Lightweight Speech Synthesis

Beijing—ByteDance recently released its latest text-to-speech (TTS) model, MegaTTS3, on the Hugging Face open-source AI community. This release has quickly garnered attention from AI researchers and developers worldwide due to its breakthroughs in lightweight design and multilingual support. Based on community feedback and official information, MegaTTS3 is hailed as a significant advancement in speech synthesis. MegaTTS3's core highlights are...

Apr 3, 2025

230

Former ByteDance AI Expert Joins Chixun Intelligent, Boosting Embodied AI Development

News is spreading in the AI field that Jie Junyuan, a former AI expert from ByteDance, has officially joined Chixun Intelligent, an embodied AI startup, as the head of its embodied intelligence department. This change not only injects strong momentum into Chixun Intelligent's technical team but also paves the way for the future development of embodied AI. Jie Junyuan is a highly respected figure in the field of artificial intelligence. He graduated from the University of Science and Technology of China and received his doctorate from the University of Washington. He has published papers at several top academic conferences, and these papers...

Mar 21, 2025

110

ByteDance's InfiniteYou (InfU): AI Image Generation Framework Preserving Facial Features Across Diverse Scenes

ByteDance has quietly launched an image generation tool called InfiniteYou (InfU). Simply put, it's a text-to-image generation model capable of producing high-quality images incorporating your personal identity features based on your text input. Unlike simple face-swap apps, it excels at precisely preserving your identity while flexibly changing scenes and content. Imagine easily generating images of yourself walking on the moon in a spacesuit, or dressed in ancient Chinese garb...

Mar 21, 2025

960

Kai-Fu Lee Predicts: China's Large Language Model Market May Narrow to Three Giants: DeepSeek, Alibaba, and ByteDance

Mar 21, 2025

200

Bytedance's Doubao Large Model Team Holds All-Hands Meeting, Exploring New Heights in AI

Amidst the booming development of the artificial intelligence field, Bytedance's Doubao large model team (Seed) recently held an all-hands meeting, marking a significant decision regarding the team's future direction. The meeting was co-hosted by Zhu Wenjia and the newly appointed head of AI fundamental research, Wu Yonghui. This was the first time the two leaders appeared together, attracting widespread attention from the industry. At the meeting, Zhu Wenjia and Wu Yonghui clearly stated that the Seed team's primary goal is to "explore the upper limit of intelligence," which will serve as the core guidance for the team's future work. They pointed out that exploring...

Mar 19, 2025

450

Beyond One-Shot Wonders: ByteDance's LCT Technology Enables AI Filmmaking

ByteDance's innovative LCT technology allows AI to direct and shoot cinematic masterpieces, breaking the limitations of traditional one-shot video production.

Mar 18, 2025

210

AI Daily: Man Sentenced to 10 Months for AI-Generated Pornographic Novels; 360's Zhi Nao Team Replicates DeepSeek Reinforcement Learning Results; ByteDance's SeedFoley AI Sound Effects Generation Model Launches

Welcome to the AI Daily column! Your daily guide to exploring the world of artificial intelligence. We present you with the hottest AI news, focusing on developers and helping you understand technology trends and innovative AI product applications. Discover new AI products here: https://top.aibase.com/1. A man was sentenced to ten months in prison for using AI to write and profit from pornographic novels, illegally earning over 20,000 yuan. Dayaze People's Court in Hubei Province recently ruled on a case involving the use of artificial intelligence to write and profit from pornographic novels.

Mar 14, 2025

Say Goodbye to Silent Video Awkwardness! ByteDance's AI Sound Effect Generation Model, SeedFoley, Launches, Instantly Creating Cinematic Sound Effects

Still struggling with sound effects for your short videos? Still searching for the perfect BGM that always falls short? Now, ByteDance is releasing a blockbuster AI technology that breaks the final silent spell of video creation! Their newly launched SeedFoley sound effect generation model is like injecting life into your videos. With just one click, it intelligently matches professional-level sound effects to your videos, instantly transforming your work from a silent film into a vibrant cinematic experience. The effect is stunning! Even more exciting, this AI sound effect technology has quickly launched on ByteDance's platform.

Mar 13, 2025

290