Unlock instant, AI-driven research and patent intelligence for your innovation.

Method, system and device for converting voice into lip shape and storage medium

A voice conversion and lip shape technology, applied in speech analysis, character and pattern recognition, instruments, etc., can solve problems such as affecting the visual experience, huge amount of calculation, and time-consuming, and achieve elimination of influence, smooth visual effects, and fast processing. effect of speed

Active Publication Date: 2020-06-09
RES INST OF TSINGHUA PEARL RIVER DELTA +1
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing technology generally has the disadvantages of huge amount of calculation and long time consumption, and there is a large delay in the process of converting speech into lip shape. If the speech to be processed by the prior art is obtained from text conversion , the output lip shape needs to be applied to subsequent steps such as deformation or mapping, then the delay in the process of converting speech into lip shape will be superimposed with the delay of other processes, resulting in an easily noticeable and unbearable delay, which seriously affects Visual experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, system and device for converting voice into lip shape and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] In this embodiment, a trained long-short-term memory network (Long Short-Term Memory, LSTM) is mainly used to perform the process of converting speech into lip shape.

[0048] For the training process of the long short-term memory network, refer to figure 1 , including the following steps:

[0049] P1. Construct a training set; this step consists of the following steps P101-P104; by executing steps P101-P104, speech samples and lip shape key point samples can be obtained to form a training set.

[0050] P101. Take a video of the speaker speaking; this step is mainly performed by using a video camera with a recording function and other equipment to shoot; in this embodiment, through the control of the speaker's speech content and the instruction of the speaker's speech rhythm, And the post-editing of the captured video to control the ratio between the duration of the speaker speaking and the duration of not speaking in the finally obtained video. In this embodiment, try...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method, a system and a device for converting voice into a lip shape and a storage medium. According to the method, voice is processed by using the long-term and short-term memory network, so that a relatively high processing speed can be achieved, the process of outputting lip-shaped key points is completed with relatively low time consumption, and the delay caused by thewhole process is relatively low. In the training process of the long-term and short-term memory network, the long-term and short-term memory network can analyze received voice according to the language law of human beings, and a proper lip shape image can be output more accurately. By setting a reverse processing process for the lip-shaped key points output by the long-term and short-term memorynetwork, the lip-shaped key points are processed according to the opposite logic of the preprocessing process of the training set, so that the influence of the preprocessing process on the formation of the long-term and short-term memory network can be eliminated, the finally obtained lip-shaped key points have proper distribution, and the subsequent application of generative adversarial networksand the like for visual processing is facilitated. The method is widely applied to the technical field of voice data.

Description

technical field [0001] The invention relates to the technical field of voice data, in particular to a method, system, device and storage medium for converting voice into lip shapes. Background technique [0002] In fields such as virtual anchors, there is a wide demand for converting speech into lip shapes. Combining text-to-speech technology, then converting speech into lip shapes, and then displaying lip shapes on computer-generated character avatars, it can convert boring press releases into realistic lip movements and provide a good visual experience. However, the existing technology generally has the disadvantages of huge amount of calculation and long time consumption, and there is a large delay in the process of converting speech into lip shape. If the speech to be processed by the prior art is obtained from text conversion , the output lip shape needs to be applied to subsequent steps such as deformation or mapping, then the delay in the process of converting speech...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/10G10L25/30G10L25/24G10L19/02G10L19/26G06K9/00G06K9/62G06T13/20G06T13/40
CPCG10L21/10G10L25/30G10L25/24G10L19/0212G10L19/26G06T13/205G06T13/40G10L2021/105G06V40/20G06F18/2135
Inventor 黄桂芳李权叶俊杰王伦基任勇韩蓝青
Owner RES INST OF TSINGHUA PEARL RIVER DELTA