Cluster-based text duplicate checking method
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- CHINA ACAD OF LAUNCH VEHICLE TECH
- Publication Date
- 2017-02-22
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The invention relates to the technical field of text data analysis and mining, in particular to a clustering-based text plagiarism checking method. Background technique
[0002] In recent years, with the frequent occurrence of fraudulent incidents in academia and the increasing calls for intellectual property protection, the research on text plagiarism checking technology has gradually become a research hotspot for relevant experts and scholars. At present, some scholars at home and abroad have proposed text plagiarism checking methods, which can be mainly divided into the following categories after induction:
[0003] 1. Text plagiarism checking method based on the sememe space of HowNet.
[0004] In this method, the text is firstly segmented, and then the split words are further divided into smaller semantic units "sememes". "HowNet" is based on sememes, and uses a formalized language (similar to ontology description language) to organize sememes t...