Intelligent Speech Recognition Method Based on Three-Level Feature Acquisition

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An intelligent voice, three-level feature technology, applied in the computer field, can solve the problems of insufficient recognition accuracy, difficult recognition, and inability to accurately distinguish voices with small differences.

Active Publication Date: 2021-04-09

广州仿真机器人有限公司

View PDF14 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, traditional speech recognition schemes still have deficiencies in their recognition accuracy, for example, they cannot accurately distinguish speech with small differences (for example, for retroflex and flat tongue, when the speaker's pronunciation is lighter and more ambiguous, the traditional Speech recognition schemes are difficult to accurately identify)

Therefore, the recognition accuracy of traditional speech recognition schemes needs to be improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0044] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0045] refer to figure 1 , the embodiment of the present application provides an intelligent speech recognition method based on three-level feature collection, comprising the following steps:

[0046] S1. Using a preset sound collection device to collect and process the speaker's sound, so as to obtain the first sound signal in the first time window;

[0047] S2. Using a preset image acquisition device to perform image acquisition processing on the lips of the speaker, so as to obtain a second image signal within the first time window;

[0048] S3. Send a signal acquisit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present application discloses an intelligent speech recognition method, device, computer equipment and storage medium based on three-level feature collection. The method includes: performing sound collection and processing to obtain a first sound signal; performing image processing on the speaker's lips Acquisition and processing to obtain the second image signal; sending a signal acquisition request to the intraoral sensor cluster; acquiring the third sensing signal set sent by the intraoral sensor cluster; combining the first sound signal, the second sensing signal subset and the third sensing signal set Input the sensing signal subset into the first semantic recognition model to obtain the first recognized text; input the second image signal, the first sensing signal subset and the second sensing signal subset into the second semantic recognition model to obtain second recognition text; calculate the text similarity value between the first recognition text and the second recognition text; if the text similarity value is greater than the text similarity threshold, then use the first recognition text as the intelligent speech recognition result.

Description

technical field [0001] The present application relates to the computer field, in particular to an intelligent speech recognition method, device, computer equipment and storage medium based on three-level feature collection. Background technique [0002] Speech recognition technology is used to recognize collected speech, which has been widely used in various fields, such as in the field of intelligent robots. Due to the application of speech recognition technology, voice communication between natural people and intelligent robots has become possible. However, the recognition accuracy of traditional speech recognition schemes still has deficiencies, such as the inability to accurately distinguish speech with small differences (for example, for retroflex and flat tongue, when the speaker's pronunciation is lighter and more ambiguous, the traditional Speech recognition schemes are difficult to accurately identify). Therefore, the recognition accuracy of traditional speech reco...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L15/26G10L15/16G10L15/25G10L15/32G06F40/194G06F40/30G06K9/00G06K9/62G06N3/04

CPCG10L15/26G10L15/16G10L15/25G10L15/32G06F40/194G06F40/30G06V40/20G06N3/044G06N3/045G06F18/251

Inventor 罗绍远

Owner 广州仿真机器人有限公司

Intelligent Speech Recognition Method Based on Three-Level Feature Acquisition

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology