Unlock instant, AI-driven research and patent intelligence for your innovation.

Speech recognition model training and speech recognition method and device

A speech recognition model and recognition model technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of low accuracy of stream recognition, reduce the amount of calculation and delay, and improve recognition efficiency and accuracy Effect

Active Publication Date: 2022-01-28
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, compared with non-streaming speech recognition, the accuracy of streaming recognition is relatively lower because it needs to start recognition before a sentence or a paragraph is finished.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition model training and speech recognition method and device
  • Speech recognition model training and speech recognition method and device
  • Speech recognition model training and speech recognition method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0025] figure 1 is a schematic diagram according to the first embodiment of the present disclosure. Such as figure 1 As shown, the training method of the speech recognition model of the present embodiment may specifically include the following steps:

[0026] S101. Obtain training data, the training data includes a plurality of voice data and a label sequence of each...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The disclosure discloses a speech recognition model training and a speech recognition method, and relates to the technical fields of deep learning and speech processing. The training method of the speech recognition model comprises: obtaining training data; Constructing the neural network model comprising the first recognition model and the second recognition model; Inputting each speech data as the first input sequence into the first recognition model, according to the first recognition model for each The first output sequence and feature sequence of voice data output are obtained the second input sequence of each voice data; according to the second input sequence and label sequence of each voice data, the second recognition model is trained until the second recognition model converges, and the first A recognition model and a trained second recognition model are used as speech recognition models. The speech recognition method includes: acquiring speech data to be recognized; using the speech data to be recognized as an input of a speech recognition model, and using the output result of the speech recognition model as a recognition result of the speech data to be recognized.

Description

technical field [0001] The present disclosure relates to the technical field of data processing, in particular to the technical fields of deep learning and speech processing. Provided are a speech recognition model training and speech recognition method, device, electronic equipment and readable storage medium. Background technique [0002] Speech recognition is the conversion of sound signals into corresponding texts, which is one of the most important ways to realize human-computer interaction. In recent years, with the great improvement in speech recognition accuracy and the continuous popularization of smart devices, speech input has become one of the main ways of text input, and speech interaction has also been applied in more and more scenarios. The response speed and accuracy of speech recognition are key factors affecting the user experience of speech input and speech interaction. [0003] In terms of scenarios, speech recognition can be divided into streaming scen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/06G10L15/26G10L15/02G10L15/28G10L15/16
CPCG10L15/063G10L15/26G10L15/02G10L15/28G10L15/16G10L2015/0631
Inventor 梁鸣心付晓寅邵俊尧
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More