Gabung/ Daftar

Alibaba’s New AI Can Transform Still Photos to Realistic Talking and Singing Videos

2024/02/29 16:59

Mengikuti

Revolutionising AI Animation

Following the launch of China’s first AI cartoon series, Alibaba's Institute for Intelligent Computing has introduced a groundbreaking artificial intelligence system dubbed "EMO," short for Emote Portrait Alive. This innovative system has the capability to animate static portrait photos, bringing them to life in talking and singing videos with astonishing realism.

EMO: A Leap in AI Animation Technology

EMO utilises a direct audio-to-video synthesis approach, sidestepping the need for intermediary 3D models or facial landmarks. This pioneering technique allows for the creation of fluid and expressive facial movements and head poses that closely mimic the nuances of the provided audio track.

(Source: Emote Portrait Alive)

Direct Audio-to-Video Synthesis

Unlike previous methods that relied on 3D face models or blend shapes, EMO directly converts audio waveforms into video frames. By doing so, it captures subtle motions and individual facial characteristics associated with natural speech, setting a new standard in audio-driven talking head video generation.

Character: Audrey Kathleen Hepburn-Ruston, Vocal Source: Interview Clip (Source: Emote Portrait Alive)

Cutting-edge Training Techniques

The system's foundation lies in a diffusion model, a powerful AI technique known for generating lifelike synthetic imagery. Trained on a vast dataset of over 250 hours of curated talking head videos sourced from various media, EMO has been meticulously honed to deliver unparalleled quality and expressiveness.

Exceptional Performance Metrics

Experimental results outlined in the research paper showcase EMO's superiority over existing methodologies. It outperforms competitors in crucial metrics such as video quality, identity preservation, and expressiveness. A user study further confirms the naturalness and emotiveness of videos generated by EMO.

Expanding Capabilities: Singing Videos

Beyond conversational videos, EMO demonstrates proficiency in animating singing portraits. With the ability to synchronise mouth shapes and facial expressions to vocals, it creates singing videos of remarkable realism and expressiveness, surpassing current industry standards.

Character: AI Lady from SORA, Vocal Source: Dua Lipa - Don't Start Now (Source: Emote Portrait Alive)

Its capabilities also encompass rapping, further expanding its creative potential.

Character: China Celebrity Cai Xu Kun, Vocal Source: Eminem - Rap God (Source: Emote Portrait Alive)

Implications and Ethical Considerations

EMO's ability to animate static portraits is undeniably impressive, offering new avenues for personalised content creation. However, the potential for misuse, including generating deepfakes for pornography as seen in the recent Taylor Swift's case, spreading misinformation such as Singapore Prime Minister Lee Hsien Loong promoting crypto, or even influencing elections as witnessed in US's 2024 Presidential Election, is a crucial consideration. As with any powerful technology, responsible development and safeguards are essential to mitigate the potential harms and ensure EMO remains a force for good.

A Glimpse into the Future

Alibaba's EMO represents a significant leap forward in AI animation technology. Its ability to breathe life into static images, producing lifelike talking and singing videos, holds immense promise for various applications. However, as with any transformative technology, careful consideration of ethical implications is paramount to ensure responsible innovation.

Alibaba

Dapatkan pemahaman yang lebih luas tentang industri kripto melalui laporan informatif, dan terlibat dalam diskusi mendalam dengan penulis dan pembaca yang berpikiran sama. Anda dipersilakan untuk bergabung dengan kami di komunitas Coinlive kami yang sedang berkembang:https://t.me/CoinliveSG

Tambahkan komentar

Gabunguntuk meninggalkan komentar Anda yang luar biasa…

0 Komentar

paling awal

Muat lebih banyak komentar

Berita lainnya tentang emo ai alibaba

Nov 13
Alibaba launches AI search engine to help Western small businesses source supplies
Bullish
Kasar
Sep 04
Conflux Network Teams Up with Alibaba Cloud
Bullish
Kasar
Jul 01
CertiK migrates blockchain applications to Alibaba Cloud
Bullish
Kasar
Jun 08
New Qwen2 AI Model from Alibaba to Challenge Meta, OpenAI
Bullish
Kasar
Apr 03
Alibaba Welcomes Its First AI Employee, Tongyi Lingma, Coding Genius
Bullish
Kasar
Des 24
Alibaba Cloud Unveils Cutting-Edge AI Text-to-Video Generator
Bullish1
Kasar
Des 16
Chinese Tech Giant Alibaba Unveils New AI Video Tool
Bullish
Kasar
Okt 28
Futureverse Bermitra dengan Alibaba Cloud untuk Meningkatkan Platform Musik AI JEN
Bullish
Kasar
Agt 05
Mantan eksekutif Alibaba: Tiket NFT memiliki lebih banyak keunggulan daripada sistem tradisional
Bullish
Kasar
Nov 29
Alibaba Cloud Hong Kong Summit diadakan hari ini dan Kursus Terbuka Alibaba Cloud Web3 pertama diluncurkan
Bullish
Kasar

Lagi

Berita lainnya tentang emo ai alibaba

Lagi

Alibaba’s New AI Can Transform Still Photos to Realistic Talking and Singing Videos

Revolutionising AI Animation

EMO: A Leap in AI Animation Technology

Direct Audio-to-Video Synthesis

Cutting-edge Training Techniques

Exceptional Performance Metrics

Expanding Capabilities: Singing Videos

Implications and Ethical Considerations

A Glimpse into the Future

Berita lainnya tentang emo ai alibaba

Berita lainnya tentang emo ai alibaba

Tencent, Alibaba, Baidu and Huawei Heads Collaborate to Set China’s AI Standards

Jack Ma Talks About The Future Of Alibaba In The World Of AI During His Rare Appearance

Alibaba downsizing its metaverse division, amid shifting focus and resource towards AI

Alibaba Launches Tongyi Wanxiang AI Video Generator Promising Unmatched Quality, Free and Packed with Features

Fokus AI Alibaba: Bertransisi dari Komputasi Kuantum

PERANG AI? Alibaba dan Tencent Menginvestasikan 2,5 Miliar Yuan untuk Startup AI

Model AI Sumber Terbuka Alibaba Bertujuan untuk Bersaing dengan Llama Meta

Crypto penasaran Joseph Tsai untuk mengambil alih sebagai kursi di Alibaba

Kegagalan Sistem Cloud Alibaba Mengganggu Penarikan Cryptocurrency

Alibaba Cloud meluncurkan solusi NFT… lalu dengan cepat mengisinya dengan memori