An adaptive method and system for speech recognition based on a cached language model
A language model and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of low frequency word recognition, low recognition accuracy of low frequency words, and complex recognition system, so as to improve field relevance and improve recognition accuracy. rate effect
Active Publication Date: 2021-09-03
HANGZHOU YIWISE INTELLIGENT TECH CO LTD
View PDF0 Cites 0 Cited by
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
[0006] In order to solve the problem that the existing speech recognition system based on single-sentence tasks cannot adapt to domain information to recognize low-frequency words, resulting in low recognition accuracy of low-frequency words or too complicated recognition system, the present invention proposes a speech recognition system based on a cached language model. Identify adaptive methods and systems
Method used
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View moreImage
Smart Image Click on the blue labels to locate them in the text.
Smart ImageViewing Examples
Examples
Experimental program
Comparison scheme
Effect test
Embodiment
[0102] In order to demonstrate the experimental effect of the present invention, this embodiment provides a comparative experiment, the experimental method is the same as the process described above, only the specific implementation details are given here, and the repeated process will not be repeated.
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More PUM
Login to View More
Abstract
The invention discloses a voice recognition self-adaptive method and system based on a cached language model, belonging to the field of voice recognition. In the present invention, by receiving the continuous voice signal input by the user, the continuous voice signal is divided into multiple short voices based on the voice activity detection technology VAD, and the short voices are sequentially recognized based on the general language model, and a corresponding identification is generated for each short voice. As a result, the associated vocabulary is obtained based on keyword search, and the associated vocabulary is processed through the cache model to obtain a language model that adapts to local changes in the distribution of historically recognized texts. Based on the modified language model, the subsequent short speech continues to be recognized. After partial modification, the language model has a better similarity with the historical recognition content, which improves the recognition accuracy of continuous long speech. In addition, users can actively correct misrecognized low-frequency words during the recognition process to improve the subsequent recognition accuracy of low-frequency words.
Description
technical field [0001] The invention relates to the field of speech recognition, in particular to an adaptive method and system for speech recognition based on a cached language model. Background technique [0002] After decades of development, speech recognition has become a relatively mature technology. In practical applications, Siri, Cortana, etc. have high recognition accuracy under ideal conditions. [0003] The performance of a speech recognition system largely depends on the similarity between the language model (LM) used and the task to be processed. This similarity is especially important in cases where the statistical properties of language change over time, such as in application scenarios involving spontaneous and multi-domain speech. Topic Identification (TI) based on information retrieval is a key technology. The topic under discussion can be obtained through semantic analysis of historical recognition results, so as to adjust the language model and realize d...
Claims
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More Application Information
Patent Timeline
Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/04G10L15/183G10L15/26
CPCG10L15/04G10L15/183
Inventor 黄俊杰
Owner HANGZHOU YIWISE INTELLIGENT TECH CO LTD
Who we serve
- R&D Engineer
- R&D Manager
- IP Professional
Why Patsnap Eureka
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com