Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech recognition method and device, computer equipment and storage medium

A speech recognition and phoneme recognition technology, which is applied in speech recognition, speech analysis, instruments, etc., can solve the problems of affecting speech recognition accuracy, increasing deletion errors, and increasing error rate in decoding process, so as to improve recognition accuracy and reduce deletion Error, likelihood-reducing effect

Pending Publication Date: 2021-10-22
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the RNN-T model introduces the concept of empty output in the phoneme recognition process, that is, it predicts that a speech frame does not contain a valid phoneme. The introduction of empty output will lead to an increase in the error rate of the subsequent decoding process in some application scenarios. , especially leading to an increase in deletion errors, affecting the accuracy of speech recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition method and device, computer equipment and storage medium
  • Speech recognition method and device, computer equipment and storage medium
  • Speech recognition method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with this application. Rather, they are merely examples of apparatuses and methods consistent with aspects of the present application as recited in the appended claims.

[0057] Before describing the various embodiments shown in this application, several concepts involved in this application are firstly introduced:

[0058] 1) Artificial Intelligence (AI)

[0059] AI is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a speech recognition method and device, computer equipment and a storage medium, and relates to the technical field of speech recognition. The method comprises the following steps: processing a speech signal through an acoustic model to obtain a phoneme recognition result corresponding to each speech frame in the speech signal; suppressing and adjusting the probability of null output in the phoneme recognition result corresponding to each speech frame so as to reduce the ratio of the probability of null output in the phoneme recognition result to the probability of each phoneme; and inputting the adjusted phoneme recognition result corresponding to each speech frame into a decoded image to obtain a recognition text sequence corresponding to the speech signal. According to the scheme, the recognition accuracy of the model can be improved in a speech recognition scene in the field of artificial intelligence.

Description

technical field [0001] The present application relates to the technical field of speech recognition, in particular to a speech recognition method, device, computer equipment and storage medium. Background technique [0002] Speech recognition is a technology for recognizing speech as text, which has a wide range of applications in various artificial intelligence (AI) scenarios. [0003] The speech recognition framework usually includes an acoustic model part and a decoding part, wherein the acoustic model part is used to recognize the phonemes of each speech frame in the input speech signal, and the decoding part outputs the text of the speech signal through the recognized phonemes of each speech frame sequence. In related technologies, implementing an acoustic model through a Recurrent Neural Network Transducer (RNN-T) is one of the focuses of research in the industry. [0004] However, the RNN-T model introduces the concept of empty output in the phoneme recognition proc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/02G10L15/06G10L15/26
CPCG10L15/02G10L15/063G10L15/26G10L2015/025G10L15/16G10L15/20G10L15/183G10L15/08G10L15/187
Inventor 孙思宁
Owner TENCENT TECH (SHENZHEN) CO LTD
Features
  • Generate Ideas
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More