Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Image generation method and device, equipment and storage medium

An image generation and image technology, applied in the computer field, can solve the problems of inaccurate pronunciation actions and affect the look and feel effect, and achieve the effect of improving the look and feel effect and the accuracy.

Active Publication Date: 2021-08-24
BEIJING SENSETIME TECH DEV CO LTD
View PDF5 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] If the pronunciation action reflected in the pronunciation face image is inaccurate, it may affect the perception effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image generation method and device, equipment and storage medium
  • Image generation method and device, equipment and storage medium
  • Image generation method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] Exemplary embodiments will be described in detail below, and examples are illustrated in the drawings. The following description is related to the drawings, unless otherwise indicated, the same numbers in the drawings represent the same or similar elements. The embodiments described in the exemplary embodiments are not meant to all embodiments consistent with the present application. Instead, it is only an example of apparatus and method consistent with some aspects of the present application as detailed in the appended claims.

[0033] The terms used in this application are only for the purpose of describing particular embodiments, not to limit the invention. "One", "one", "one", "one", "" "," and "" "as used in the present application and the appended claims are also intended to include many forms unless otherwise clearly indicated. It should also be understood that the terms "and / or" as used herein refer to any or from any or more of the associated listing items. It sh...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an image generation method and device, equipment and a storage medium. The method comprises the following steps: receiving audio data, and extracting text features corresponding to an audio sequence included in the audio data, wherein the text feature represents the text content of the corresponding audio sequence; performing facial feature mapping based on the text features corresponding to the audio sequence to obtain facial features corresponding to the audio sequence, wherein facial features represent pronunciation actions corresponding to the audio sequence; and generating a pronunciation face image corresponding to the audio sequence according to the facial features corresponding to the audio sequence and the received face image.

Description

Technical field [0001] The present application relates to the field of computer technology, and more particularly to an image generation method, apparatus, device, and storage medium. Background technique [0002] The generation of a pronunciation is a very critical technique that speech drive characters and virtual digital people. [0003] The pronunciation of human face image refers to a procedure of a pronunciation of a pronunciation of a sound action when generating an actuated voice based on the received audio data and a face image. [0004] If the pronunciation action embodied in the pronunciation of people, it may affect the effect of the effect. Inventive content [0005] In view of this, the present application discloses an image generation method. The method can include receiving audio data, extracting text features corresponding to the audio sequence included in the audio data; the text feature characterizes the text content of the audio sequence; based on the text ch...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/10G10L25/24G10L25/30
CPCG10L21/10G10L25/30G10L25/24G10L2021/105
Inventor 吴潜溢吴文岩戴勃王宇欣高娜钱晨
Owner BEIJING SENSETIME TECH DEV CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products