Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech recognition model training and speech recognition method and device

A speech recognition model and recognition model technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of low accuracy of streaming recognition, reduce the amount of calculation and delay, and improve recognition efficiency and recognition accuracy. Effect

Active Publication Date: 2021-07-16
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF10 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, compared with non-streaming speech recognition, the accuracy of streaming recognition is relatively lower because it needs to start recognition before a sentence or a paragraph is finished.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition model training and speech recognition method and device
  • Speech recognition model training and speech recognition method and device
  • Speech recognition model training and speech recognition method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0025] figure 1 is a schematic diagram according to the first embodiment of the present disclosure. Such as figure 1 As shown, the training method of the speech recognition model of the present embodiment may specifically include the following steps:

[0026] S101. Obtain training data, the training data includes a plurality of voice data and a label sequence of each...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech recognition model training and speech recognition method and relates to the technical field of deep learning and speech processing. The training method of the speech recognition model comprises steps of obtaining training data; constructing a neural network model comprising a first recognition model and a second recognition model; inputting each voice data as a first input sequence into a first recognition model, and obtaining a second input sequence of each voice data according to a first output sequence and a feature sequence outputted by the first recognition model for each voice data; and training a second recognition model according to the second input sequence and the tag sequence of each piece of voice data till the second recognition model converges, and taking the first recognition model and the trained second recognition model as a voice recognition model. The voice recognition method comprises the following steps of acquiring to-be-recognized voice data; and taking the to-be-recognized voice data as the input of the voice recognition model, and taking the output result of the voice recognition model as the recognition result of the to-be-recognized voice data.

Description

technical field [0001] The present disclosure relates to the technical field of data processing, in particular to the technical fields of deep learning and speech processing. Provided are a speech recognition model training and speech recognition method, device, electronic equipment and readable storage medium. Background technique [0002] Speech recognition is the conversion of sound signals into corresponding texts, which is one of the most important ways to realize human-computer interaction. In recent years, with the great improvement in speech recognition accuracy and the continuous popularization of smart devices, speech input has become one of the main ways of text input, and speech interaction has also been applied in more and more scenarios. The response speed and accuracy of speech recognition are key factors affecting the user experience of speech input and speech interaction. [0003] In terms of scenarios, speech recognition can be divided into streaming scen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/06G10L15/26G10L15/02G10L15/28G10L15/16
CPCG10L15/063G10L15/26G10L15/02G10L15/28G10L15/16G10L2015/0631
Inventor 梁鸣心付晓寅邵俊尧
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products