Speech recognition method and device, computer equipment and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and phoneme recognition technology, which is applied in speech recognition, speech analysis, instruments, etc., can solve the problems of affecting speech recognition accuracy, increasing deletion errors, and increasing error rate in decoding process, so as to improve recognition accuracy and reduce deletion Error, likelihood-reducing effect

Pending Publication Date: 2021-10-22

TENCENT TECH (SHENZHEN) CO LTD

View PDF0 Cites 5 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] However, the RNN-T model introduces the concept of empty output in the phoneme recognition process, that is, it predicts that a speech frame does not contain a valid phoneme. The introduction of empty output will lead to an increase in the error rate of the subsequent decoding process in some application scenarios. , especially leading to an increase in deletion errors, affecting the accuracy of speech recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0056] Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with this application. Rather, they are merely examples of apparatuses and methods consistent with aspects of the present application as recited in the appended claims.

[0057] Before describing the various embodiments shown in this application, several concepts involved in this application are firstly introduced:

[0058] 1) Artificial Intelligence (AI)

[0059] AI is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a speech recognition method and device, computer equipment and a storage medium, and relates to the technical field of speech recognition. The method comprises the following steps: processing a speech signal through an acoustic model to obtain a phoneme recognition result corresponding to each speech frame in the speech signal; suppressing and adjusting the probability of null output in the phoneme recognition result corresponding to each speech frame so as to reduce the ratio of the probability of null output in the phoneme recognition result to the probability of each phoneme; and inputting the adjusted phoneme recognition result corresponding to each speech frame into a decoded image to obtain a recognition text sequence corresponding to the speech signal. According to the scheme, the recognition accuracy of the model can be improved in a speech recognition scene in the field of artificial intelligence.

Description

technical field [0001] The present application relates to the technical field of speech recognition, in particular to a speech recognition method, device, computer equipment and storage medium. Background technique [0002] Speech recognition is a technology for recognizing speech as text, which has a wide range of applications in various artificial intelligence (AI) scenarios. [0003] The speech recognition framework usually includes an acoustic model part and a decoding part, wherein the acoustic model part is used to recognize the phonemes of each speech frame in the input speech signal, and the decoding part outputs the text of the speech signal through the recognized phonemes of each speech frame sequence. In related technologies, implementing an acoustic model through a Recurrent Neural Network Transducer (RNN-T) is one of the focuses of research in the industry. [0004] However, the RNN-T model introduces the concept of empty output in the phoneme recognition proc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/02G10L15/06G10L15/26

CPCG10L15/02G10L15/063G10L15/26G10L2015/025G10L15/16G10L15/20G10L15/183G10L15/08G10L15/187

Inventor 孙思宁

Owner TENCENT TECH (SHENZHEN) CO LTD

Features

Generate Ideas
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Speech recognition method and device, computer equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology