New Research Is Making Deepfake Speech Even More Real & Terrifying

Published on July 16, 2019
by Ambika Choudhury

Deepfake is a controversial topic from the very beginning. In one of our previous articles, we discussed how Deepfake is being carried out around the globe. It has already started to change a lot of things around including the television and film sector.

Recently, a group of researchers from the Max Planck Institute for Informatics, Stanford University, Princeton University, and Adobe Research created an exceptional algorithm which can make flawless edits on talking-head-videos by changing the speech content.

How The Model Works

This novel model specifically focuses on the face and upper body of a speaker and is based on text-edits and works as transcript-based editing of the talking-head video. When the transcription is edited, the algorithm selects segments from various parts of the video with a similar motion which can be joined to create the newly edited video.

The working of the model is mentioned below:

Phoneme Alignment: The researchers firstly align the transcript of the speech to a talking-head video at the level of Phonemes (Phonemes are perceptually distinct units that distinguish one word from another in a specific language). This method helps in searching snippets in the video which can be later combined to create new content.
3D Face Tracking and Reconstruction: A 3D parametric face model is registered with each frame of the input talking-head video which will later help to selectively blend different aspects of the face.
Viseme Search: Given an edit operation, the model performs a viseme search (Visemes are the groups of aurally distinct phonemes that appear visually similar to one another) in order to find the best match between the subsequences of the phonemes in the video.
Parameter Retiming & Blending: The parametric face model is used in order to mix different properties of a face such as a pose, expressions, etc. from different input frames and then blend them together in parameter space.
Neural Face Rendering: A neural face rendering approach is implied in order to synthesize photo-realistic talking-head video which matches the modified parameter sequence and thus creating a photo-realistic talking-head video frame.

Applications Of The Model

The researchers mainly focused to use this model for video editing and translation in the production of movies, TV shows, commercials, YouTube video logs, and online lectures as a better editing tool.

Currently, the model supports three kinds of edit operations as mentioned below

Add New Words: In this type, one or more consecutive words can be added at a particular point of a video.
Rearrange existing words: In this type, the edit works by moving one or more consecutive words that exist in the video.
Delete existing words: In this type, the edit works by removing one or more consecutive words from the video.

The Other Perspective

As the advancement of technology has huge and immense advantages, however, there are some people who will never stop while utilising it for bad means. The researchers raised important and valid concerns about the probability for misusing the test-based editing approach such as utilising this technology to falsify personal allegations and scandal famous individuals.

One of the researchers from Stanford University stated that every advanced technology will undoubtedly attract people with negative thoughts. For these reasons, the researchers propose some guidelines such as developing forensics, biometrics and other verification methods to diagnose the manipulated videos by the viewers.

Access all our open Survey & Awards Nomination forms in one place >>

Ambika Choudhury

A Technical Journalist who loves writing about Machine Learning and Artificial Intelligence. A lover of music, writing and learning something out of the box.

New Research Is Making Deepfake Speech Even More Real & Terrifying

How The Model Works

Applications Of The Model

The Other Perspective

Ambika Choudhury

Download our Mobile App

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

3 Ways to Join our Community

Telegram group

Discord Server

Subscribe to our Daily newsletter

Get our daily awesome stories & videos in your inbox

Recent Stories

KissanAI Releases Dhenu Llama 3, an Indic LLM for Farmers

Enhancing AI Integration through Optimal Data Management in the Global Convenience Food and Beverage Sector

Is it Humane to Bash Humane Ai Pin?

Meta Llama 3 Now Available on Databricks For Enterprise

How Databricks is Enabling Agriculture’s Data Revolution with UPL

How Good is Llama 3 for Indic Languages?

OpenAI Hires Pragya Misra As Its First Employee in India

Meta Forces Developers Cite ‘Llama 3’ in their AI Development

India is Making its Own AI Servers

World's Biggest Media & Analyst firm specializing in AI

Advertise with us

AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.

Branded Content

AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.

Corporate Upskilling

ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent

Hackathons

With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.

Talent Assessment

Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring

Research & Advisory

AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.

Conferences & Events

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives.

AIM Launches the 3rd Edition of Data Engineering Summit. May 30-31, Bengaluru