Unlock instant, AI-driven research and patent intelligence for your innovation.

Error correction, recognition and classification equipment for Chinese texts with errors

A text error correction, recognition and classification technology, applied in the fields of word feature extraction, text error correction and text classification, can solve the problem of poor automatic error correction ability of Chinese text, etc., to enhance anti-interference ability, reduce deviation, and improve accuracy sexual effect

Pending Publication Date: 2022-03-08
ZHEJIANG UNIV
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In order to overcome the poor automatic error correction ability of current equipment for Chinese texts and the shortage and huge demand for Chinese text classification research, the purpose of the present invention is to provide a method that can automatically extract rich Chinese texts by integrating Chinese morphology, pronunciation and semantic information. Features to complete Chinese text error correction and classification tasks

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Error correction, recognition and classification equipment for Chinese texts with errors
  • Error correction, recognition and classification equipment for Chinese texts with errors
  • Error correction, recognition and classification equipment for Chinese texts with errors

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] The present invention will be described in detail below according to the accompanying drawings.

[0064] figure 1 Shown is a functional module diagram of the Chinese text error correction recognition classification device proposed by the present invention, including: Chinese text database 1, Chinese feature extraction module 2, text error correction module 3, text event element extraction and classification module 4. The device uses the following process to correct and classify error-containing Chinese texts:

[0065] 1) Chinese text database A large number of error-free Chinese texts are legally obtained from publishing houses, newspaper offices, news websites, and scientific research institutions, and stored in the database. In this module, it is also possible to segment a large number of collected texts to obtain the Chinese word set W:

[0066] W = {w 1 ,w 2 ,...,w n}

[0067] where w i ,i=1,2,...,n represents the divided words or words in the set, and n is t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an error-containing Chinese text error correction, recognition and classification device. The device is composed of a Chinese text database, a Chinese feature extraction module, a text error correction module and a text event element extraction and classification module. The Chinese text database collects a large number of Chinese texts, then the Chinese feature extraction module is responsible for encoding Chinese characters and words and converting the Chinese characters and words into high-dimensional embedded vectors, and then the text error correction module carries out error correction on error-containing texts. And finally, a text event element extraction and classification module extracts each event element in the text and classifies the text. According to the method, rich features in the Chinese text can be automatically extracted according to the form, pronunciation and semanteme of Chinese so as to automatically complete error correction and classification tasks of the Chinese text, and the defects that an existing achievement is poor in Chinese text automatic error correction capability and insufficient in Chinese text classification model research are overcome; and significant synergy is brought to application of a text classification technology in natural language processing to Chinese texts.

Description

technical field [0001] The invention relates to the field of word feature extraction, text error correction and text classification, in particular, to the technology of error correction and classification for Chinese text containing errors, and is a new Chinese text classification device. Background technique [0002] Artificial intelligence technology is rising day by day, and natural language processing technology, as an important technology in the field of artificial intelligence, has also been flourishing. Natural language processing refers to the use of computers to process information such as the form, sound, and semantics of natural language, so as to achieve the goal of communicating between humans and computers in natural language to complete specific tasks. A very important application aspect of natural language processing is the classification of text. In the text classification task, the computer is required to be able to identify the key intention contained in ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06F40/289G06F40/30G06K9/62G06N3/04G06N3/08
CPCG06F16/35G06F40/289G06F40/30G06N3/084G06N3/047G06N3/045G06F18/2415
Inventor 刘兴高刘昭然刘静王文海张志猛张泽银
Owner ZHEJIANG UNIV