Speech recognition method and device, electronic equipment and storage medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology for speech recognition and target speech, applied in the field of deep learning, can solve the problems of incomplete acquisition of speech information, highly restrictive methods, and low speech recognition accuracy, and achieve the effect of avoiding truncation of speech information and improving accuracy.

Active Publication Date: 2021-02-19

BEIJING BAIDU NETCOM SCI & TECH CO LTD

View PDF4 Cites 3 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] In related technologies, in order to obtain complete voice information, tail point detection is performed on the voice information, that is, to detect the pause duration of the voice information, which can also be understood as the silence duration. When the pause duration reaches a fixed value, it is considered that the complete voice information has been obtained. Voice information. Obviously, this method of determining whether the voice information is complete or not is highly restrictive, which may lead to incomplete acquisition of voice information and low accuracy of voice recognition.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

example 1

[0042] In this example, the correspondence between the semantic completeness and the monitoring duration is preset, so that the preset correspondence is queried to obtain the monitoring duration corresponding to the semantic completeness.

example 2

[0044]In this example, the baseline semantic integrity corresponding to the monitoring duration baseline value is preset. The monitoring duration baseline value can be understood as the preset default monitoring duration, and the semantics of the current target voice information and the voice integrity of the baseline semantic integrity are calculated. Difference, according to the difference to determine the monitoring duration adjustment value, wherein the semantic difference is inversely proportional to the monitoring duration adjustment value, calculate the sum of the monitoring duration adjustment value and the monitoring duration reference value, and use the sum as the monitoring duration .

[0045] Step 104, if no voice information is detected within the monitoring period, perform voice recognition according to the target voice information.

[0046] In this embodiment, if no voice information is detected within the monitoring period, it indicates that the user has finish...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speech recognition method and device, electronic equipment and a storage medium, and relates to the technical field of deep learning and the technical field of speech in thetechnical field of artificial intelligence; and the method comprises the steps: obtaining state information and context information of an application corresponding to target speech information in response to the obtained target speech information; calculating semantic integrity of the target voice information according to the state information and the context information; determining a monitoringduration corresponding to the semantic integrity, and monitoring the speech information in the monitoring duration; and if the speech information is not monitored within the monitoring duration, performing speech recognition according to the target speech information. Therefore, the semantic integrity of the acquired speech information is determined according to the multi-dimensional parameters, the duration of detecting the speech information is flexibly adjusted according to the semantic integrity, the speech information is prevented from being cut off, and the speech recognition accuracy isimproved.

Description

technical field [0001] The present application relates to the field of deep learning technology and the field of speech technology in the field of artificial intelligence technology, and in particular to a speech recognition method, device, electronic equipment and storage medium. . Background technique [0002] With the development of artificial intelligence technology, smart home products such as smart speakers and smart robots have also been developed. Users can control the work of related products based on voice information input. The speaker performs operations such as opening a music application. [0003] In related technologies, in order to obtain complete voice information, tail detection is performed on the voice information, that is, to detect the pause duration of the voice information, which can also be understood as the silence duration. When the pause duration reaches a fixed value, it is considered that the complete voice information has been obtained. Voice...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/04G10L15/183G10L15/26G10L15/16

CPCG10L15/04G10L15/183G10L15/16G10L15/22G10L2015/228G10L25/78G10L2025/783G10L15/02G10L15/08G10L15/1815

Inventor吴震周茂仁王知践崔亚峰吴玉芳瞿琴刘兵革家象

OwnerBEIJING BAIDU NETCOM SCI & TECH CO LTD

Speech recognition method and device, electronic equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

example 1

example 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology