Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech recognition method and device, electronic equipment and storage medium

A technology for speech recognition and target speech, applied in the field of deep learning, can solve the problems of incomplete acquisition of speech information, highly restrictive methods, and low speech recognition accuracy, and achieve the effect of avoiding truncation of speech information and improving accuracy.

Active Publication Date: 2021-02-19
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF4 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In related technologies, in order to obtain complete voice information, tail point detection is performed on the voice information, that is, to detect the pause duration of the voice information, which can also be understood as the silence duration. When the pause duration reaches a fixed value, it is considered that the complete voice information has been obtained. Voice information. Obviously, this method of determining whether the voice information is complete or not is highly restrictive, which may lead to incomplete acquisition of voice information and low accuracy of voice recognition.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition method and device, electronic equipment and storage medium
  • Speech recognition method and device, electronic equipment and storage medium
  • Speech recognition method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

example 1

[0042] In this example, the correspondence between the semantic completeness and the monitoring duration is preset, so that the preset correspondence is queried to obtain the monitoring duration corresponding to the semantic completeness.

example 2

[0044]In this example, the baseline semantic integrity corresponding to the monitoring duration baseline value is preset. The monitoring duration baseline value can be understood as the preset default monitoring duration, and the semantics of the current target voice information and the voice integrity of the baseline semantic integrity are calculated. Difference, according to the difference to determine the monitoring duration adjustment value, wherein the semantic difference is inversely proportional to the monitoring duration adjustment value, calculate the sum of the monitoring duration adjustment value and the monitoring duration reference value, and use the sum as the monitoring duration .

[0045] Step 104, if no voice information is detected within the monitoring period, perform voice recognition according to the target voice information.

[0046] In this embodiment, if no voice information is detected within the monitoring period, it indicates that the user has finish...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech recognition method and device, electronic equipment and a storage medium, and relates to the technical field of deep learning and the technical field of speech in thetechnical field of artificial intelligence; and the method comprises the steps: obtaining state information and context information of an application corresponding to target speech information in response to the obtained target speech information; calculating semantic integrity of the target voice information according to the state information and the context information; determining a monitoringduration corresponding to the semantic integrity, and monitoring the speech information in the monitoring duration; and if the speech information is not monitored within the monitoring duration, performing speech recognition according to the target speech information. Therefore, the semantic integrity of the acquired speech information is determined according to the multi-dimensional parameters, the duration of detecting the speech information is flexibly adjusted according to the semantic integrity, the speech information is prevented from being cut off, and the speech recognition accuracy isimproved.

Description

technical field [0001] The present application relates to the field of deep learning technology and the field of speech technology in the field of artificial intelligence technology, and in particular to a speech recognition method, device, electronic equipment and storage medium. . Background technique [0002] With the development of artificial intelligence technology, smart home products such as smart speakers and smart robots have also been developed. Users can control the work of related products based on voice information input. The speaker performs operations such as opening a music application. [0003] In related technologies, in order to obtain complete voice information, tail detection is performed on the voice information, that is, to detect the pause duration of the voice information, which can also be understood as the silence duration. When the pause duration reaches a fixed value, it is considered that the complete voice information has been obtained. Voice...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/04G10L15/183G10L15/26G10L15/16
CPCG10L15/04G10L15/183G10L15/16G10L15/22G10L2015/228G10L25/78G10L2025/783G10L15/02G10L15/08G10L15/1815
Inventor 吴震周茂仁王知践崔亚峰吴玉芳瞿琴刘兵革家象
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products