Voice-based mouth shape animation synthesis device and method and readable storage medium

An animation synthesis and lip-synthesis technology, applied in the computer field, can solve problems such as the inability to match voice data to lip-synthesis animations

Active Publication Date: 2018-11-06
PING AN TECH (SHENZHEN) CO LTD
View PDF5 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a voice-based lip animation synthesis device, method and readable storage medium, the main purpose of which is to solve the technology in the prior art that cannot display realistic lip animations that match the synthesized voice data question

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice-based mouth shape animation synthesis device and method and readable storage medium
  • Voice-based mouth shape animation synthesis device and method and readable storage medium
  • Voice-based mouth shape animation synthesis device and method and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0047] The invention provides a voice-based lip animation synthesis device. refer to figure 1 As shown, it is a schematic diagram of a preferred embodiment of the voice-based lip animation synthesis device of the present invention.

[0048]In this embodiment, the voice-based lip animation synthesis device may be a PC (Personal Computer, personal computer), or may be a terminal device such as a smart phone, a tablet computer, or a portable computer. The voice-based lip animation synthesis device at least includes a memory 11 , a processor 12 , a communication bus 13 , and a network interface 14 .

[0049] Wherein, the memory 11 includes at least one type of readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (eg, SD or DX memory, etc.), magnetic...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a voice-based mouth shape animation synthesis device. The device comprises a memory and a processor, wherein a mouth shape animation synthesis program capable of running on theprocessor is stored on the memory, and the program is executed by the processor through the steps that target text data is acquired, and phonemic characteristics in the target text data are acquiredaccording to a pronunciation dictionary; the phonemic characteristics are input into a pre-trained deep neural network model to output acoustic characteristics, and the acoustic characteristics are input into a voice synthesizer to output voice data; according to the voice data, a pre-trained tensor model and speaker identification information, mouth shape data is acquired; and a mouth shape animation corresponding to the voice data is generated according to the mouth shape data. The invention furthermore provides a voice-based mouth shape animation synthesis method and a computer readable storage medium. Through the voice-based mouth shape animation synthesis device and method and the computer readable storage medium, the technical problem that a mouth shape animation which is matched with synthesized voice data and has a sense of reality cannot be displayed in the prior art is solved.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a voice-based lip animation synthesis device, method and readable storage medium. Background technique [0002] Speech synthesis, also known as text-to-speech technology, is a technology that can convert text information into speech and read it aloud. It involves multiple disciplines such as acoustics, linguistics, digital signal processing, and computer science. It is a cutting-edge technology in the field of Chinese information processing. The main problem to be solved is how to convert text information into audible sound information. [0003] In some application scenarios, such as computer-aided pronunciation training application scenarios, it is necessary to dynamically display the mouth shape changes of the speaker when playing voice data to help users perform pronunciation training. In the prior art, synthetic When there is no real speaker's mouth shape data correspondin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06N3/08G06T13/20G10L13/02G10L15/02G10L15/16G10L25/24G10L25/30
CPCG06F40/289G06N3/084G06T13/205G10L13/02G10L15/02G10L15/16G10L25/24G10L25/30G10L2015/025G06N3/08G10L13/033G10L13/04G10L17/00
Inventor 梁浩王健宗肖京
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products