An adaptive method and system for speech recognition based on a cached language model

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A language model and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of low frequency word recognition, low recognition accuracy of low frequency words, and complex recognition system, so as to improve field relevance and improve recognition accuracy. rate effect

Active Publication Date: 2021-09-03

HANGZHOU YIWISE INTELLIGENT TECH CO LTD

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] In order to solve the problem that the existing speech recognition system based on single-sentence tasks cannot adapt to domain information to recognize low-frequency words, resulting in low recognition accuracy of low-frequency words or too complicated recognition system, the present invention proposes a speech recognition system based on a cached language model. Identify adaptive methods and systems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0102] In order to demonstrate the experimental effect of the present invention, this embodiment provides a comparative experiment, the experimental method is the same as the process described above, only the specific implementation details are given here, and the repeated process will not be repeated.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice recognition self-adaptive method and system based on a cached language model, belonging to the field of voice recognition. In the present invention, by receiving the continuous voice signal input by the user, the continuous voice signal is divided into multiple short voices based on the voice activity detection technology VAD, and the short voices are sequentially recognized based on the general language model, and a corresponding identification is generated for each short voice. As a result, the associated vocabulary is obtained based on keyword search, and the associated vocabulary is processed through the cache model to obtain a language model that adapts to local changes in the distribution of historically recognized texts. Based on the modified language model, the subsequent short speech continues to be recognized. After partial modification, the language model has a better similarity with the historical recognition content, which improves the recognition accuracy of continuous long speech. In addition, users can actively correct misrecognized low-frequency words during the recognition process to improve the subsequent recognition accuracy of low-frequency words.

Description

technical field [0001] The invention relates to the field of speech recognition, in particular to an adaptive method and system for speech recognition based on a cached language model. Background technique [0002] After decades of development, speech recognition has become a relatively mature technology. In practical applications, Siri, Cortana, etc. have high recognition accuracy under ideal conditions. [0003] The performance of a speech recognition system largely depends on the similarity between the language model (LM) used and the task to be processed. This similarity is especially important in cases where the statistical properties of language change over time, such as in application scenarios involving spontaneous and multi-domain speech. Topic Identification (TI) based on information retrieval is a key technology. The topic under discussion can be obtained through semantic analysis of historical recognition results, so as to adjust the language model and realize d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L15/04G10L15/183G10L15/26

CPCG10L15/04G10L15/183

Inventor黄俊杰

OwnerHANGZHOU YIWISE INTELLIGENT TECH CO LTD

An adaptive method and system for speech recognition based on a cached language model

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology