Intelligent Speech Recognition Method Based on Three-Level Feature Acquisition

An intelligent voice, three-level feature technology, applied in the computer field, can solve the problems of insufficient recognition accuracy, difficult recognition, and inability to accurately distinguish voices with small differences.

Active Publication Date: 2021-04-09
广州仿真机器人有限公司
View PDF14 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, traditional speech recognition schemes still have deficiencies in their recognition accuracy, for example, they cannot accurately distinguish speech with small differences (for example, for retroflex and flat tongue, when the speaker's pronunciation is lighter and more ambiguous, the traditional Speech recognition schemes are difficult to accurately identify)
Therefore, the recognition accuracy of traditional speech recognition schemes needs to be improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Intelligent Speech Recognition Method Based on Three-Level Feature Acquisition
  • Intelligent Speech Recognition Method Based on Three-Level Feature Acquisition
  • Intelligent Speech Recognition Method Based on Three-Level Feature Acquisition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0045] refer to figure 1 , the embodiment of the present application provides an intelligent speech recognition method based on three-level feature collection, comprising the following steps:

[0046] S1. Using a preset sound collection device to collect and process the speaker's sound, so as to obtain the first sound signal in the first time window;

[0047] S2. Using a preset image acquisition device to perform image acquisition processing on the lips of the speaker, so as to obtain a second image signal within the first time window;

[0048] S3. Send a signal acquisit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present application discloses an intelligent speech recognition method, device, computer equipment and storage medium based on three-level feature collection. The method includes: performing sound collection and processing to obtain a first sound signal; performing image processing on the speaker's lips Acquisition and processing to obtain the second image signal; sending a signal acquisition request to the intraoral sensor cluster; acquiring the third sensing signal set sent by the intraoral sensor cluster; combining the first sound signal, the second sensing signal subset and the third sensing signal set Input the sensing signal subset into the first semantic recognition model to obtain the first recognized text; input the second image signal, the first sensing signal subset and the second sensing signal subset into the second semantic recognition model to obtain second recognition text; calculate the text similarity value between the first recognition text and the second recognition text; if the text similarity value is greater than the text similarity threshold, then use the first recognition text as the intelligent speech recognition result.

Description

technical field [0001] The present application relates to the computer field, in particular to an intelligent speech recognition method, device, computer equipment and storage medium based on three-level feature collection. Background technique [0002] Speech recognition technology is used to recognize collected speech, which has been widely used in various fields, such as in the field of intelligent robots. Due to the application of speech recognition technology, voice communication between natural people and intelligent robots has become possible. However, the recognition accuracy of traditional speech recognition schemes still has deficiencies, such as the inability to accurately distinguish speech with small differences (for example, for retroflex and flat tongue, when the speaker's pronunciation is lighter and more ambiguous, the traditional Speech recognition schemes are difficult to accurately identify). Therefore, the recognition accuracy of traditional speech reco...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/26G10L15/16G10L15/25G10L15/32G06F40/194G06F40/30G06K9/00G06K9/62G06N3/04
CPCG10L15/26G10L15/16G10L15/25G10L15/32G06F40/194G06F40/30G06V40/20G06N3/044G06N3/045G06F18/251
Inventor 罗绍远
Owner 广州仿真机器人有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products