Method for automatically correcting identification error of repeated words in Chinese pronunciation identification

A speech recognition and recurring technology, applied in speech recognition, speech analysis, special data processing applications, etc., can solve problems such as low correction efficiency

Active Publication Date: 2012-01-18
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF5 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Regardless of the early correction method or the later candidate selection correction method, the same word recognition error in different sen...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for automatically correcting identification error of repeated words in Chinese pronunciation identification
  • Method for automatically correcting identification error of repeated words in Chinese pronunciation identification
  • Method for automatically correcting identification error of repeated words in Chinese pronunciation identification

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below with reference to the accompanying drawings according to the embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0043] In Chinese speech recognition, the speech content to be recognized basically revolves around a certain topic, so some keywords related to the topic will appear in multiple sentences before and after. Due to the differences in context and each pronunciation, the same keyword may be misrecognized multiple times in different sentences. Even if it is correctly recognized when it appears for the first time, it may be misrecognized when it appears later. If for each recurring keyword, using the corrected recognition result when it appears for the first time, the system can automatically...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for automatically correcting an identification error of repeated words in Chinese pronunciation identification. The method comprises the following steps of: (1) performing similarity matching on word confusion networks which are obtained after identification of each sentence, word groups in a word group library and intermediate identification results, and searching the repeated word groups, wherein each word confusion network is a set of all possible identification results and comprises an optimum identification result, namely the original optimum identification result, and the intermediate identification result which corresponds to each word in the optimum identification result, and the word group library comprises the word groups and the intermediate identification results which correspond to the word groups; (2) according to word group information which is obtained by searching, re-calculating a similar probability value and a word identification probability value; (3) according to a new probability value, sorting the word confusion networks according to the size of the probability value; and (4) replacing the optimum identification results and the intermediate identification results of the word confusion networks by using a sorting result. The method has the advantages that: by using experience knowledge in the corrected identification result, the identification error of the repeated words in the current identification sentence is automatically corrected, so the correction efficiency and correction speed of the identification error are improved.

Description

technical field [0001] The invention relates to the technical field of speech recognition, in particular to an automatic correction method for repeated word recognition errors in Chinese speech recognition. Background technique [0002] Speech recognition technology is a technology that uses computer and digital signal processing technology to accurately identify the content of human speech. At present, small and medium-sized vocabulary speech recognition technology for special applications has been practically applied. However, due to the limitations of factors such as background noise, dialect accent, colloquial natural speech, and semantic understanding, continuous speech recognition technology with large vocabulary and speaker-independent Still in the exploratory stage. Since speech recognition cannot achieve 100% recognition accuracy, it is indispensable to correct the recognition errors in the recognition results. [0003] Recognition error correction means that afte...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/26G06F17/30
Inventor 李新辉王向东钱跃良林守勋
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products