Microsoft makes Mona Lisa rap with AI help

2 min read

9 months ago admin

Microsoft makes Mona Lisa to rap with AI expertise.Photo Courtesy: X web page video seize

The iconic Mona Lisa is now not solely smiling, she additionally prefers to sing, due to the brand new synthetic intelligence expertise unveiled by Microsoft.

Last week, Microsoft researchers detailed a brand new AI mannequin they’ve developed that may take a nonetheless picture of a face and an audio clip of somebody talking and routinely create a sensible wanting video of that individual talking, reported CNN.

The video can depart folks shocked as it’s full with lip-syncing and pure face and head actions.
In one demo video, researchers confirmed how they animated the Mona Lisa to recite a comedic rap by actor Anne Hathaway, the American information channel reported.

Microsoft simply dropped VASA-1.

This AI could make single picture sing and discuss from audio reference expressively. Similar to EMO from Alibaba

10 wild examples:

1. Mona Lisa rapping Paparazzi pic.twitter.com/LSGF3mMVnD

— Min Choi (@minchoi) April 18, 2024

Speaking about outputs from AI mannequin named VASA-1, Micorsoft mentioned: “We introduce VASA, a framework for producing lifelike speaking faces of digital characters with interesting visible affective expertise (VAS), given a single static picture and a speech audio clip. Our premiere mannequin, VASA-1, is able to not solely producing lip actions which can be exquisitely synchronised with the audio, but in addition capturing a big spectrum of facial nuances and pure head motions that contribute to the notion of authenticity and liveliness.”

“The core improvements embody a holistic facial dynamics and head motion era mannequin that works in a face latent house, and the event of such an expressive and disentangled face latent house utilizing movies. Through in depth experiments together with analysis on a set of recent metrics, we present that our technique considerably outperforms earlier strategies alongside varied dimensions comprehensively. Our technique not solely delivers excessive video high quality with lifelike facial and head dynamics but in addition helps the web era of 512×512 movies at as much as 40 FPS with negligible beginning latency. It paves the best way for real-time engagements with lifelike avatars that emulate human conversational behaviours” the web site mentioned.

Tags: finance FInance News microsoft

Microsoft makes Mona Lisa rap with AI help

More Stories

Over 200,000 people living without electricity after severe winter storm batters parts of US

“Asian” label on grooming gangs: Indian diaspora in UK condemns generic terminology, demands greater accuracy in reporting

Tibet earthquake: Death toll touches 126, rescuers brave chilling cold to carry out operation

Lectrix Delivers 114,985 Meals on Electric Vehicles in 8 Hours

H1B Visa Fees For Indians: How much fee will Indians have to pay for H-1B visa required for jobs in America? Know here

Blockchain Technology: Reshaping Cybersecurity Paradigms

Over 200,000 people living without electricity after severe winter storm batters parts of US

News

Pages

More Stories

Over 200,000 people living without electricity after severe winter storm batters parts of US

“Asian” label on grooming gangs: Indian diaspora in UK condemns generic terminology, demands greater accuracy in reporting

Tibet earthquake: Death toll touches 126, rescuers brave chilling cold to carry out operation

You may have missed

Lectrix Delivers 114,985 Meals on Electric Vehicles in 8 Hours

H1B Visa Fees For Indians: How much fee will Indians have to pay for H-1B visa required for jobs in America? Know here

Blockchain Technology: Reshaping Cybersecurity Paradigms

Over 200,000 people living without electricity after severe winter storm batters parts of US