精品无码AV一区二区三区不卡 ,久久亚洲精品成人AV无码网站,国产成人AV一区二区三区在线观看

亚洲美女高潮久久久久-久久久久成人精品无码-久久久精品人妻一区二区三区四-无码人妻aⅴ一区二区三区

position： EnglishChannel > News> Upload a Photo, Get a Video

Upload a Photo, Get a Video

Source: Science and Technology Daily | 2025-06-11 11:29:11 | Author: LI LInxu

The rapid developments in AI have unlocked new possibilities for digital representation. With the help of AI models, you can now achieve a remarkable feat: bringing characters to life with just an image and an audio clip.

Jointly developed by Tencent Hunyuan and Tencent Music, the newly released HunyuanVideo-Avatar, a multimodal diffusion transformer-based model, is capable of simultaneously generating dynamic, emotion-controllable, and multi-character dialogue videos. This capability supports head-and-shoulder, half-body, and full-body views, encompassing multiple styles, species, and even dual-character scenes.

To put it simply, you just upload a photo and a voice clip, and the model figures out the context, emotion and lip movements to create a realistic animated video.

For instance, if you upload an image of a woman sitting on a beach with a guitar, along with a piece of lyrical music, the model understands the scene as "a woman playing the guitar and singing a lyrical song by the sea," and subsequently generates a video of the woman performing the song.

The model provides video creators with highly consistent and dynamic video generation capabilities. Its versatility can unlock a myriad of applications in fields like entertainment, media, e-commerce, advertising and education.

It has already been applied in multiple scenarios within Tencent Music, such as AI companions for music listening, long-form audio podcasts, and music videos (MVs).

For example, on the app QQ Music, when users listen to songs by "AI Leehom" (a fully AI-driven singer created by Tencent Music and Team Leehom), a lively and adorable AI Leehom image synchronizes its singing in real-time on the player.

On WeSing, a popular karaoke singing app, users can upload their images to generate personalized MVs of themselves singing.

In subject consistency and audio-video synchronization, the HunyuanVideo-Avatar shows top-tier industry performance. For video dynamics and natural body movements, it exceeds open-source solutions and rivals closed-source ones.

Currently, the model supports audio uploads of up to 14 seconds for video generation, with more capabilities to be released and open-sourced in the future.

Editor：李林旭

Top News

15th FYP to Advance Innovation-driven Development

A press conference held by the Central Committee of the Communist Party of China (CPC) in Beijing on Friday on the recently concluded fourth plenary session of the 20th CPC Central Committee highlighted the need to raise innovation capacity to lead the development of new quality productive forces.

Preserving China and Russia's Cultural Memory

?Founded in 1795, the National Library of Russia (NLR) is the first public library in Europe and the oldest in Russia. For over 200 years, with its collection of over 40 million rare books, the NLR has been a vast repository preserving Russia's cultural memory and continuing its historical mission.

Do Sugar-free Drinks Increase the Risk of Diabetes?

?September is China's National Health Lifestyle Promotion Month. The campaign aims to raise awareness of the health risks associated with modern lifestyle habits, such as relying on fast food deliveries, drinking sugary drinks and spending too much time sitting down. In particular, the question of whether sugar-free beverages increase the risk of diabetes has sparked heated discussions online.