At today's 2024 Volcano Engine AI Innovation Tour, ByteDance introduced the Doubao·Music Model and Doubao·Simultaneous Interpretation Model, in addition to the video generation model, and announced significant upgrades to the Doubao General Model Pro, Text-to-Image Model, and Voice Synthesis Model, among other specialized models.

WeChat Screenshot_20240924154634.png

The launch of the Doubao·Music Model signifies Volcano Engine's deep commitment to the music creation field. This model, supported by powerful algorithms, enables high-quality music creation freedom. In terms of lyric generation, it quickly produces emotionally precise and意境深远 lyrics with just a few simple words as input. For melody creation, the Doubao·Music Model offers over ten different music styles and emotional expression options, meeting the diverse needs of creators.

Additionally, leveraging Doubao's advanced voice synthesis technology, the singing effects are incredibly lifelike, providing users with an immersive auditory experience. Moreover, the model lowers the barriers to music creation, supporting various creation methods such as image-to-music, inspiration-to-music, and lyric-to-music, allowing more people to easily participate in music creation.

WeChat Screenshot_20240924153132.png

On the other hand, the release of the Doubao·Simultaneous Interpretation Model brings a revolutionary change to cross-language communication. This model achieves ultra-low latency in real-time translation, allowing users to see the translation results simultaneously as they speak, significantly enhancing communication efficiency. In terms of translation quality, the Doubao·Simultaneous Interpretation Model delivers smooth, natural, and highly accurate translations, approaching or even surpassing human simultaneous interpretation levels in multiple scenarios such as office, legal, and educational settings. Notably, the model also supports voice cloning, enabling cross-language translation with the same voice tone, breaking communication barriers with more vivid and realistic vocal performances, making cross-language communication smoother and more seamless.

Experience URL: https://www.volcengine.com/product/doubao