Voice-driven animation method and device based on artificial intelligence

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A voice and animation technology, applied in the field of data processing, can solve problems such as reducing interactive experience, deviation of expression parameters, inconsistent voice, etc., and achieve the effect of improving interactive experience

Pending Publication Date: 2019-11-26

TENCENT TECH (SHENZHEN) CO LTD

View PDF6 Cites 24 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] However, since the extracted acoustic features contain information related to the speaker, the mapping model established based on this can accurately determine the corresponding expression parameters for the speech of a specific speaker. If the speaker changes, the expression determined by the mapping model There will be a large deviation in the parameters, and the mouth shape of the animated image driven by this will be inconsistent with the voice, which will reduce the interactive experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0036] Embodiments of the present application are described below in conjunction with the accompanying drawings.

[0037] For the Speech2Face system adopted in the related art, see figure 1 shown. For the voice of a speaker, the system can extract the acoustic features of the voice to obtain MFCC. Then, the expression parameters are determined based on the acoustic features through the mapping model. For a set animation image whose expression (such as mouth shape) can be adjusted by adjusting expression parameters, the animation image corresponding to the segment of voice is generated by using the determined expression parameters to adjust the animation image.

[0038] However, since the acoustic features extracted in related technologies are related to the speaker, when the speaker is changed, the expression parameters determined by the mapping model will deviate greatly, and the mouth shape of the animated image driven by this will be inconsistent with the voice, reducing ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

An embodiment of the invention discloses a voice-driven animation method based on artificial intelligence. When a to-be-processed voice comprising a plurality of voice frames is obtained, linguistic information corresponding to the voice frames in the to-be-processed voice can be determined, wherein each piece of the linguistic information is used for identifying the probability of distribution ofphonemes to which the corresponding voice frame belongs, namely, reflecting which probability distribution of the phonemes contents in the voice frame belong to; information carried by the linguisticinformation is irrelevant to an actual speaker of the to-be-processed voice, so that the influence of pronunciation habits of different speakers on determination of subsequent expression parameters can be counteracted; and according to the expression parameters determined by the linguistic information, an animation image can be accurately driven to make an expression corresponding to the to-be-processed voice, such as a mouth shape, so that the to-be-processed voice corresponding to any speaker can be effectively supported and the interactive experience is improved.

Description

technical field [0001] The present application relates to the field of data processing, in particular to a voice-driven animation method and device based on artificial intelligence. Background technique [0002] At present, the technology of voice-to-virtual facial animation generation is becoming a research hotspot in the field of industrial applications. For example, for a voice of any speaker, an animated image can be driven to make the mouth shape corresponding to the voice. In this scenario, the presence of animated images can greatly enhance the sense of reality, improve expressiveness, and bring users a more immersive experience. [0003] One way is to implement the above technology through the Speech2Face system. Generally speaking, for a speaker's voice, after the system extracts the acoustic features in the voice such as Mel Frequency Cepstral Coefficient (MFCC), the mapping model can be used to determine the corresponding one that can be adjusted based on the aco...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/02G10L15/22G10L15/25

CPCG10L15/02G10L15/22G10L15/25G10L2015/025G10L2015/227G10L21/10G10L25/63G10L25/30G06N3/044G06T13/205G06T13/40G10L15/04G10L15/187G10L15/30G06N3/045

Inventor康世胤陀得意李广之傅天晓黄晖榕苏丹

OwnerTENCENT TECH (SHENZHEN) CO LTD

Voice-driven animation method and device based on artificial intelligence

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology