Document correcting marking method

A marking method and document technology, applied in the field of information processing, can solve problems such as poor support and difficult implementation of document error correction marks, and achieve the effect of increasing support

Active Publication Date: 2018-06-08
KUNMING UNIV OF SCI & TECH
View PDF5 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, the traditional document error correction marking method is mainly used to mark wrong English words, and the support for non-English languages ​​such as Chinese is not good, because English does not involve word segmentation, and the independence between words is strong, while Unlike English, Chinese only needs to separate words based on spaces, and it can be judged whether the word is correct by looking up the English word database.
Chinese involves word segmentation, whether it is from a statistical or grammatical point of view, there is no certain rule between words, which makes it difficult to implement error correction marks for Chinese documents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document correcting marking method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0028] Embodiment 1: The present invention thinks that after the Chinese sentence is normally segmented, if a plurality of consecutive words are all independent words, then this does not conform to the logical structure of the normal sentence, so an error must be made here. Based on this, the present invention first divides the document to be corrected into small particle sets with separators, performs word segmentation operations on all set elements through a specific word segmentation algorithm, and then searches and records whether there is a spelling error of an English word according to the English word database. Then calculate the word length of all word sets. If there are multiple consecutive independent words, that is, if there are multiple consecutive words whose length is 1, then record them in the wrong word set, and finally use the data in the wrong word set to process the error correction document. Error flagging, error-corrected documentation generated and exporte...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a document correcting marking method, and belongs to the technical field of information processing. A document to be corrected is divided into the form of a granule set through a separator, through a specific word splitting algorithm, all set elements are subjected to word splitting, afterwards, according to an English word database, whether an English word is wrongly spelled or not is searched for, mistakes are recorded, then the word length of all word sets is calculated, if multiple independent words exist continuously, that is, the words of which the length is 1 continuously exist, the words are recorded into an wrong word set, and finally, the mistakes for the document to be corrected are marked through data in the wrong word set to generate a corrected document and importing the corrected document. Compared with the prior art, the phenomena in the prior art that the supporting performance of other languages except English is poor, particularly correctingmarks aiming at a Chinese document are imperfect, and the supporting performance is poor are avoided, and people stride to improve the supporting performance when currently, a computer is relied on tomark the mistakes for the Chinese document.

Description

technical field [0001] The invention relates to a document error correction marking method, which belongs to the technical field of information processing. Background technique [0002] Document error correction mark is a very important and commonly used technology in information processing technology. Usually, the WORD or WPS we use has a document error correction mark function, which is mainly used to remind document editors that there may be word spelling errors somewhere in the document. Or word logic errors, etc. [0003] At present, the traditional document error correction marking method is mainly used to mark wrong English words, and the support for non-English languages ​​such as Chinese is not good, because English does not involve word segmentation, and the independence between words is strong, while Unlike English, Chinese only needs to separate words based on spaces, and it can be judged whether the word is correct by looking up the English word database. Chin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/21G06F17/27
CPCG06F40/117G06F40/232
Inventor 龙华祁俊辉毕丹红唐菁敏
Owner KUNMING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products