Front man-machine interaction voice recognition method and system based on computer vision assistance

A computer vision and speech recognition technology, applied in speech recognition, computer parts, computing, etc., can solve problems such as wrong commands, inaccurate speech segmentation, and inability to effectively identify target voice commands and voice commands, and achieve the effect of improving accuracy.

Active Publication Date: 2019-03-01
FUJIAN SHIDA COMP EQUIP
View PDF10 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Therefore, traditional speech recognition technology cannot effectively recognize the voice spoken by the initiator of the target voice command when the speaker is in a noisy environment with positive human-computer interaction, different people at the same sound source are speaking alternately, or there are other people talking nearby. Order
At the same time, due to the inaccuracy of the previous speech recognition algorithm for speech segmentation, it may happen that the first half of the sentence is recognized before the sentence is finished, resulting in the execution of the wrong command.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Front man-machine interaction voice recognition method and system based on computer vision assistance
  • Front man-machine interaction voice recognition method and system based on computer vision assistance

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0035] It should be pointed out that the following detailed description is exemplary and is intended to provide further explanation to the present application. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs.

[0036] It should be noted that the terminology used here is only for describing specific implementations, and is not intended to limit the exemplary implementations according to the present application. As used herein, unless the context clearly dictates otherwise, the singular is intended to include the plural, and it should also be understood that when the terms "comprising" and / or "comprising" are used in this specification, they mean There are features, steps, operations, means, components and / or combina...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a front man-machine interaction voice recognition method and a front man-machine interaction voice recognition system based on computer vision assistance. According to the method and the system, video signal input is added in the traditional voice recognition flow, and then the simultaneous recognition with the voice signal is realized; voice assistance is carried out in human face recognition and human face lip movement recognition, and whether a target to be recognized is speaking or not is judged; and meanwhile, through human face recognition and auxiliary positioning, the orientation of a speaker is judged, and in addition, enhancement processing is carried out on a sound source signal in the specified direction by utilizing the corresponding orientation. Withthe method and the system, the recognition accuracy for voice command and voice input information of clients can be effectively enhanced under the specific environments, including the man-machine interaction use situations in which the clients need to face the device from the just front side, such as self-service retail terminals, bank self-service terminals and insurance self-service terminals.

Description

technical field [0001] The invention relates to the field of speech recognition, in particular to a computer vision-assisted frontal human-computer interaction speech recognition method and system. Background technique [0002] The current speech recognition technologies are all directly recognized based on the input audio. The main method used in the entire audio recognition process is to analyze the input audio to obtain the speech text content in the audio. [0003] Therefore, traditional speech recognition technology cannot effectively recognize the voice spoken by the initiator of the target voice command when the speaker is in a noisy environment with positive human-computer interaction, different people at the same sound source are speaking alternately, or there are other people talking nearby. Order. At the same time, due to the inaccuracy of the previous speech recognition algorithm for speech segmentation, it may happen that the first half of the sentence is recog...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/22G10L15/22G06K9/00G10L21/02
CPCG10L15/22G10L17/22G10L21/02G06V40/16G06V40/171G06V40/20
Inventor 邱霖恺刘维王贤俊高刚强郑文侃宋煌钟
Owner FUJIAN SHIDA COMP EQUIP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products