Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Chinese text proofreading method based on a knowledge graph

A knowledge map and text technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as incomplete sentence components, contradictory sentence definitions, unrecognizable syntax and semantic errors of sentences, etc.

Active Publication Date: 2019-06-21
ZHEJIANG GONGSHANG UNIVERSITY
View PDF3 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

If the thesaurus is not updated in time, the proofreading effect will be affected, and due to excessive reliance on the thesaurus, often only the word errors in the text can be proofread, and the syntax and semantic errors in the sentence cannot be identified, such as incomplete components of the sentence, gaps between sentences, etc. definition contradiction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Chinese text proofreading method based on a knowledge graph
  • A Chinese text proofreading method based on a knowledge graph
  • A Chinese text proofreading method based on a knowledge graph

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0052] In order to facilitate the understanding and implementation of the present invention by those skilled in the art, a specific implementation example of the method described in the present invention is now given. The core idea of ​​providing Chinese text proofreading is to use the knowledge graph to compare the text to be proofread with the reference text to search for syntax and semantic errors in the text to be proofread, thereby providing a specific implementation plan for Chinese text proofreading.

[0053] Focusing on text proofreading for building university data structure textbooks, a case is used below to describe this embodiment.

[0054] The data of the case comes from Wikipedia and the teaching materials of a data structure course in a university. The Wikipedia corpus is taken from the website: https: / / dumps.wikimedia.org / zhwiki / latest / zhwiki-latest-pages-articles.xml.bz2 .

[0055] Since all files in the Wikipedia corpus are web pages, it is first necessary t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Chinese text proofreading method based on a knowledge graph. Firstly, an entity extraction technology is used for extracting to obtain an entity in a text statement, and thenaccording to a matching result of the entity and a relation rule, a syntactic semantic error type is searched and discovered in a knowledge graph. According to the method, dependence on a large-scaleword library can be avoided, and semantic proofreading is conducted on texts from the three aspects of wrongly written characters, component deletion and definition contradiction. Compared with an existing Chinese automatic proofreading system, the proofreading method has high recall ratio for proofreading various semantic errors in the limited field. The method can effectively improve the accuracy and recall ratio of text proofreading, and is helpful for text workers to improve the text quality.

Description

technical field [0001] The invention relates to the field of text proofreading, and relates to a Chinese text proofreading method based on a knowledge graph. Background technique [0002] The wide application of computers has given birth to automatic proofreading tools for Chinese text, which replace the time-consuming and laborious traditional manual proofreading. The most common one is the Chinese automatic proofreading tool Office Proofing Tools that comes with Office. Other widely used proofreading tools include proofreading assistants, small Red pen, dark horse proofreading system, etc. However, the existing proofreading tools can only proofread words based on a large-scale lexicon, and it is difficult to perform syntactic and semantic proofreading. Moreover, these proofreading tools are paid software, and users need to pay a relatively expensive fee to use them. [0003] The existing text automatic proofreading technologies mainly include local language features based...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27
CPCY02D10/00
Inventor 董黎刚邵红蒋献汤柳君吴梦莹索同鹏
Owner ZHEJIANG GONGSHANG UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products