Speech recognition result correction method, device, equipment and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and text recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem that the accuracy of acoustic model recognition needs to be further improved, and achieve the effect of improving the accuracy.

Active Publication Date: 2020-11-20

BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

View PDF5 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] However, the recognition accuracy of the acoustic model needs to be further improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0030] figure 1 It is a flow chart of a speech recognition result correcting method provided in Embodiment 1 of the present invention. This embodiment is applicable to the situation of correcting speech recognition results, and the method can be executed by the speech recognition result correcting device provided in the embodiment of the present invention , the device may be implemented in software and / or hardware, and the device may be integrated in the terminal device or in an application end of the terminal device. Wherein, the terminal device may be, but not limited to, a mobile terminal (tablet computer or smart phone).

[0031] Wherein, the application end may be a plug-in of a certain client embedded in the terminal device, or a plug-in of the operating system of the terminal device, and the speech recognition result embedded in the terminal device may correct the operation of the client or the terminal device. The voice recognition result correction application progra...

Embodiment 2

[0044] Figure 2A It is a flow chart of a method for correcting speech recognition results provided by Embodiment 2 of the present invention. This embodiment is optimized on the basis of the above-mentioned embodiments. In this embodiment, the step is to further use the neural machine translation NMT model to identify and correct the initial text information, and the final text recognition result is optimized as follows: the initial text information contains The text is segmented to obtain at least one word; the word is encoded into a dense vector by the encoder in the NMT model, and the dense vector is decoded by the decoder in the NMT model to obtain the final text recognition result.

[0045] Correspondingly, such as Figure 2A As shown, the method of this embodiment specifically includes:

[0046] S201. Perform speech recognition on the acquired speech data to obtain initial text information.

[0047] S202. Segment the text included in the initial text information to ob...

Embodiment 3

[0055] Figure 3A It is a flow chart of a method for correcting speech recognition results provided by Embodiment 3 of the present invention. This embodiment is optimized on the basis of the above-mentioned embodiments. In this embodiment, further steps are to encode words into dense vectors through the encoder in the NMT model, and decode the dense vectors through the decoder in the NMT model to obtain The final text recognition result is optimized as follows: convert at least one word into a source hidden state vector through the encoder in the NMT model; input the source hidden state vector into the decoder in the NMT model, and output the target hidden state vector through the decoder in the NMT model State vector; determine the hidden state vector of the attention mechanism according to the target hidden state vector and the source hidden state vector; obtain the final text recognition result according to the hidden state vector of the attention mechanism.

[0056] Corre...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A method and apparatus for correcting a speech recognition result, a device and a computer-readable storage medium are provided. The method includes performing speech recognition on acquired speech data to obtain initial text information; and recognizing and correcting the initial text information by a neural machine translation NMT model to obtain a final text recognition result.

Description

technical field [0001] Embodiments of the present invention relate to the technical field of speech recognition, and in particular, to a method, device, device, and storage medium for correcting speech recognition results. Background technique [0002] With the rapid improvement of computer processing capabilities, speech recognition technology has been developed rapidly. Speech recognition technology is a technology that converts speech signals into corresponding text or commands through the process of recognition and analysis. The application of speech recognition technology is changing the production and life style of human beings day by day, and is widely used in fields such as speech input system, speech control system and intelligent dialogue query system. [0003] As the most natural way of interaction, voice interaction is increasingly popularized, and the requirements for the accuracy of voice recognition are getting higher and higher. At present, the speech recogn...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L15/22G10L15/26

CPCG10L15/22G10L15/26G10L15/16G10L15/04G10L15/063

Inventor 黄俊李先刚

Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Speech recognition result correction method, device, equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology