Speech recognition method and device, computer equipment and storage medium

A speech recognition and phoneme recognition technology, which is applied in speech recognition, speech analysis, instruments, etc., can solve the problems of affecting speech recognition accuracy, increasing deletion errors, and increasing error rate in decoding process, so as to improve recognition accuracy and reduce deletion Error, likelihood-reducing effect

Pending Publication Date: 2021-10-22
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the RNN-T model introduces the concept of empty output in the phoneme recognition process, that is, it predicts that a speech frame does not contain a valid phoneme. The introduction of empty outp

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition method and device, computer equipment and storage medium
  • Speech recognition method and device, computer equipment and storage medium
  • Speech recognition method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0056] Exemplary embodiments will be described in detail herein, and examples are illustrated in the drawings. The following description is related to the drawings, unless otherwise indicated, the same figures in the different drawings represent the same or similar elements. The embodiment described in the exemplary embodiments is not meant to all embodiments consistent with the present application. Instead, they are only examples of apparatus and methods consistent with some aspects of the present application as detailed in the appended claims.

[0057] Before the various embodiments shown in the present application, the few concepts involved in the present application are introduced:

[0058] 1) Artificial Intelligence (AI)

[0059] AI is the use of digital computer or digital computer controlled machine simulation, extension, and expanding people's intelligence, perceived environments, access to knowledge and use knowledge to achieve best results. In other words, artificial int...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a speech recognition method and device, computer equipment and a storage medium, and relates to the technical field of speech recognition. The method comprises the following steps: processing a speech signal through an acoustic model to obtain a phoneme recognition result corresponding to each speech frame in the speech signal; suppressing and adjusting the probability of null output in the phoneme recognition result corresponding to each speech frame so as to reduce the ratio of the probability of null output in the phoneme recognition result to the probability of each phoneme; and inputting the adjusted phoneme recognition result corresponding to each speech frame into a decoded image to obtain a recognition text sequence corresponding to the speech signal. According to the scheme, the recognition accuracy of the model can be improved in a speech recognition scene in the field of artificial intelligence.

Description

technical field [0001] The present application relates to the technical field of speech recognition, in particular to a speech recognition method, device, computer equipment and storage medium. Background technique [0002] Speech recognition is a technology for recognizing speech as text, which has a wide range of applications in various artificial intelligence (AI) scenarios. [0003] The speech recognition framework usually includes an acoustic model part and a decoding part, wherein the acoustic model part is used to recognize the phonemes of each speech frame in the input speech signal, and the decoding part outputs the text of the speech signal through the recognized phonemes of each speech frame sequence. In related technologies, implementing an acoustic model through a Recurrent Neural Network Transducer (RNN-T) is one of the focuses of research in the industry. [0004] However, the RNN-T model introduces the concept of empty output in the phoneme recognition proc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/02G10L15/06G10L15/26
CPCG10L15/02G10L15/063G10L15/26G10L2015/025G10L15/16G10L15/20G10L15/183G10L15/187
Inventor 孙思宁
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products