Voice recognition text error correction method in specific field

A speech recognition and field-specific technology, applied in the field of data processing, can solve problems such as poor error correction effect, and achieve the effects of avoiding mishandling, expanding the scope of adaptation, and improving accuracy

Active Publication Date: 2020-07-03
网经科技(苏州)有限公司
View PDF6 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to overcome the deficiencies in the prior art and provide a text error correction method for speech recognition in a specific field, aiming to solve the problem of poor error correction effect after speech recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice recognition text error correction method in specific field
  • Voice recognition text error correction method in specific field
  • Voice recognition text error correction method in specific field

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] In order to have a clearer understanding of the technical features, purposes and effects of the present invention, specific implementations are now described in detail.

[0062] A kind of speech recognition text error correction method of the specific field of the present invention, such as figure 1 shown, including the following steps:

[0063] S101: Obtain words, word-level language models and pinyin language models using correct domain corpus statistics;

[0064] Before performing specific text error correction tasks, collect a certain scale of text materials in this field, including various types of documents such as regulations, systems, guidelines, manuals, etc.; the field corpus used for statistics must not contain typos and words The correct text can also be manually proofread to ensure the accuracy of the language model statistics.

[0065] Use statistical methods to obtain the required language model, so as to obtain the probability of each basic unit. The s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a voice recognition text error correction method in a specific field, wherein the method comprises the following steps: firstly, performing statistics by using correct field corpora to obtain a character and word level language model and a pinyin language model; then, receiving a text sequence to be subjected to error correction, and performing clause processing on more than one sentence; determining suspected wrong words by using a word, word and pinyin language model; determining a candidate word list of the suspected wrong words according to a language model vocabulary and a pronunciation-prone dictionary; and finally, substituting candidate words into the original text sequence, and selecting and outputting the most reasonable sentence in combination with macroscopic and microcosmic scores. Basic units with different granularities and dimensions such as characters, words, pinyin and initial and final consonants are selected to construct a language model, and word segmentation error interference caused by wrong characters is reduced; isolated character disorder is processed by adopting a word language model, and continuous recognition errors caused by pronunciation deviation is distinguished by adopting the pinyin language model; and candidate sentences after the wrong words are replaced are comprehensively evaluated by macroscopic and microcosmic scores, and the smoothness degree of the replaced sentences are measured.

Description

technical field [0001] The invention relates to a speech recognition text error correction method in a specific field, belonging to the technical field of data processing. Background technique [0002] With breakthroughs in technology in recent years, the error rate of speech recognition has been greatly reduced. At present, it has many application scenarios and has achieved good market response. For example, the voice input method on smartphones, the voice processing of smart speakers, and the networked transcription function of recording equipment, etc., are all inseparable from the assistance of voice recognition. For a typical human-computer interaction system with voice interface, voice recognition is the front-end module, and only the recognized text can be understood and processed in natural language, so as to generate corresponding interaction and return it to the user. [0003] However, there are still inevitable errors in the results of speech recognition. Due to...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/26G10L15/04G10L15/16G10L15/02G10L15/22G06F40/232G06F40/289
CPCG10L15/26G10L15/04G10L15/16G10L15/02G10L15/22G10L2015/223Y02D10/00
Inventor 刘继明金宁洪爱金孟亚磊陈浮
Owner 网经科技(苏州)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products