Voice-based mouth shape animation synthesis device and method and readable storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An animation synthesis and lip-synthesis technology, applied in the computer field, can solve problems such as the inability to match voice data to lip-synthesis animations

Active Publication Date: 2018-11-06

PING AN TECH (SHENZHEN) CO LTD

View PDF5 Cites 27 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The present invention provides a voice-based lip animation synthesis device, method and readable storage medium, the main purpose of which is to solve the technology in the prior art that cannot display realistic lip animations that match the synthesized voice data question

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0046] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0047] The invention provides a voice-based lip animation synthesis device. refer to figure 1 As shown, it is a schematic diagram of a preferred embodiment of the voice-based lip animation synthesis device of the present invention.

[0048]In this embodiment, the voice-based lip animation synthesis device may be a PC (Personal Computer, personal computer), or may be a terminal device such as a smart phone, a tablet computer, or a portable computer. The voice-based lip animation synthesis device at least includes a memory 11 , a processor 12 , a communication bus 13 , and a network interface 14 .

[0049] Wherein, the memory 11 includes at least one type of readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (eg, SD or DX memory, etc.), magnetic...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice-based mouth shape animation synthesis device. The device comprises a memory and a processor, wherein a mouth shape animation synthesis program capable of running on theprocessor is stored on the memory, and the program is executed by the processor through the steps that target text data is acquired, and phonemic characteristics in the target text data are acquiredaccording to a pronunciation dictionary; the phonemic characteristics are input into a pre-trained deep neural network model to output acoustic characteristics, and the acoustic characteristics are input into a voice synthesizer to output voice data; according to the voice data, a pre-trained tensor model and speaker identification information, mouth shape data is acquired; and a mouth shape animation corresponding to the voice data is generated according to the mouth shape data. The invention furthermore provides a voice-based mouth shape animation synthesis method and a computer readable storage medium. Through the voice-based mouth shape animation synthesis device and method and the computer readable storage medium, the technical problem that a mouth shape animation which is matched with synthesized voice data and has a sense of reality cannot be displayed in the prior art is solved.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a voice-based lip animation synthesis device, method and readable storage medium. Background technique [0002] Speech synthesis, also known as text-to-speech technology, is a technology that can convert text information into speech and read it aloud. It involves multiple disciplines such as acoustics, linguistics, digital signal processing, and computer science. It is a cutting-edge technology in the field of Chinese information processing. The main problem to be solved is how to convert text information into audible sound information. [0003] In some application scenarios, such as computer-aided pronunciation training application scenarios, it is necessary to dynamically display the mouth shape changes of the speaker when playing voice data to help users perform pronunciation training. In the prior art, synthetic When there is no real speaker's mouth shape data correspondin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06F17/27G06N3/08G06T13/20G10L13/02G10L15/02G10L15/16G10L25/24G10L25/30

CPCG06F40/289G06N3/084G06T13/205G10L13/02G10L15/02G10L15/16G10L25/24G10L25/30G10L2015/025G06N3/08G10L13/033G10L13/04G10L17/00

Inventor 梁浩王健宗肖京

Owner PING AN TECH (SHENZHEN) CO LTD

Voice-based mouth shape animation synthesis device and method and readable storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology