Voice-driven animation method and device based on artificial intelligence

A voice and animation technology, applied in the field of data processing, can solve problems such as reducing interactive experience, deviation of expression parameters, inconsistent voice, etc., and achieve the effect of improving interactive experience

Pending Publication Date: 2019-11-26
TENCENT TECH (SHENZHEN) CO LTD
View PDF6 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, since the extracted acoustic features contain information related to the speaker, the mapping model established based on this can accurately determine the corresponding expression parameters for the speech of a specific speaker. If the speaker changes, the expression determined by the mapping model There will be a large deviation in the parameters, and the mouth shape of the animated image driven by this will be inconsistent with the voice, which will reduce the interactive experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice-driven animation method and device based on artificial intelligence
  • Voice-driven animation method and device based on artificial intelligence
  • Voice-driven animation method and device based on artificial intelligence

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] Embodiments of the present application are described below in conjunction with the accompanying drawings.

[0037] For the Speech2Face system adopted in the related art, see figure 1 shown. For the voice of a speaker, the system can extract the acoustic features of the voice to obtain MFCC. Then, the expression parameters are determined based on the acoustic features through the mapping model. For a set animation image whose expression (such as mouth shape) can be adjusted by adjusting expression parameters, the animation image corresponding to the segment of voice is generated by using the determined expression parameters to adjust the animation image.

[0038] However, since the acoustic features extracted in related technologies are related to the speaker, when the speaker is changed, the expression parameters determined by the mapping model will deviate greatly, and the mouth shape of the animated image driven by this will be inconsistent with the voice, reducing ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

An embodiment of the invention discloses a voice-driven animation method based on artificial intelligence. When a to-be-processed voice comprising a plurality of voice frames is obtained, linguistic information corresponding to the voice frames in the to-be-processed voice can be determined, wherein each piece of the linguistic information is used for identifying the probability of distribution ofphonemes to which the corresponding voice frame belongs, namely, reflecting which probability distribution of the phonemes contents in the voice frame belong to; information carried by the linguisticinformation is irrelevant to an actual speaker of the to-be-processed voice, so that the influence of pronunciation habits of different speakers on determination of subsequent expression parameters can be counteracted; and according to the expression parameters determined by the linguistic information, an animation image can be accurately driven to make an expression corresponding to the to-be-processed voice, such as a mouth shape, so that the to-be-processed voice corresponding to any speaker can be effectively supported and the interactive experience is improved.

Description

technical field [0001] The present application relates to the field of data processing, in particular to a voice-driven animation method and device based on artificial intelligence. Background technique [0002] At present, the technology of voice-to-virtual facial animation generation is becoming a research hotspot in the field of industrial applications. For example, for a voice of any speaker, an animated image can be driven to make the mouth shape corresponding to the voice. In this scenario, the presence of animated images can greatly enhance the sense of reality, improve expressiveness, and bring users a more immersive experience. [0003] One way is to implement the above technology through the Speech2Face system. Generally speaking, for a speaker's voice, after the system extracts the acoustic features in the voice such as Mel Frequency Cepstral Coefficient (MFCC), the mapping model can be used to determine the corresponding one that can be adjusted based on the aco...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/22G10L15/25
CPCG10L15/02G10L15/22G10L15/25G10L2015/025G10L2015/227G10L21/10G10L25/63G10L25/30G06N3/044G06T13/205G06T13/40G10L15/04G10L15/187G10L15/30G06N3/045
Inventor 康世胤陀得意李广之傅天晓黄晖榕苏丹
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products