Speaking recognition method based on video analysis, system and equipment and medium

A technology of video analysis and recognition method, applied in the field of intelligent interaction, can solve the problem that speech recognition is not intelligent enough, and achieve the effect of accurate speech recognition results

Active Publication Date: 2021-07-27
GRG INTELLIGENT TECH SOLUTION CO LTD +1
View PDF4 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In order to overcome the deficiencies of the prior art, one of the purposes of the present invention is to provide a speech recognition method based on video analysis, which can solve the problem that the speech recognition of the person to be recognized in the traditional intelligent interactive system has certain limitations and is not intelligent enough. The problem
[0004] The second object of the present invention is to provide a speech recognition system based on video analysis, which can solve the problem that the speech recognition of the person to be recognized has certain limitations and is not intelligent enough in the traditional intelligent interactive system
[0005] The third object of the present invention is to provide an electronic device, which can solve the problem that the speech recognition of the person to be recognized has certain limitations and is not intelligent enough in the traditional intelligent interactive system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speaking recognition method based on video analysis, system and equipment and medium
  • Speaking recognition method based on video analysis, system and equipment and medium
  • Speaking recognition method based on video analysis, system and equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] Below, the present invention will be further described in conjunction with the accompanying drawings and specific implementation methods. It should be noted that, under the premise of not conflicting, the various embodiments described below or the technical features can be combined arbitrarily to form new embodiments. .

[0042] like figure 1 As shown, a method for speech recognition based on video analysis in this embodiment includes the following steps:

[0043] Read video data, read the target video data collected by the camera in the intelligent interactive system.

[0044] Image preprocessing, performing clipping processing and grayscale processing on each video frame in the target video data to obtain an input image corresponding to each video frame. Specifically: performing size cutting on each video frame in the target video data, and performing grayscale processing on the size-cut video frame, converting it into a grayscale image, and using the grayscale image ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a speech recognition method based on video analysis, and the method comprises the steps: carrying out the cutting and graying of each video frame in target video data, and obtaining an input image corresponding to each video frame; performing face detection processing on all the input images to obtain a face detection frame; screening the face detection frames corresponding to all the input images, and taking the face detection frame corresponding to each video frame conforming to a preset face screening rule as a final face detection frame of the frame; and calculating a feature result corresponding to each final face detection frame according to the lip contour and the face key points in the final face detection frames, inputting a plurality of feature results into a preset speaking recognition model for recognition, and obtaining a speaking recognition result corresponding to the to-be-recognized person. According to the speech recognition method based on video analysis, the obtained speech recognition result is more accurate, and the speech recognition method can adapt to different forms when the person to be recognized speaks.

Description

technical field [0001] The invention relates to the field of intelligent interaction, in particular to a speech recognition method, system, device and medium based on video analysis. Background technique [0002] In the field of intelligent interaction, when starting the intelligent interaction system, it is necessary to first determine whether the person to be recognized is speaking. When the person to be recognized is speaking, the intelligent interaction system starts the sound pickup function and executes the subsequent voice interaction function. At present, in the field of intelligent interaction, the judgment of whether the person to be recognized is speaking is based on lip feature points combined with simple threshold analysis to judge whether to speak or to judge whether to speak through audio analysis combined with lip feature analysis. The threshold analysis of the above speech recognition process cannot achieve the robustness of the model and is not suitable for...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00
CPCG06V40/165G06V40/171G06V40/176Y02D10/00
Inventor 黄欢尹士朝
Owner GRG INTELLIGENT TECH SOLUTION CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products