Article duplicate checking detection method, device and equipment, and storage medium

A detection method and article technology, applied in the field of information processing, can solve the problems of interfering with the duplication checking system, failure of the paper duplication checking system, affecting the accuracy of duplication checking and detection, etc., to achieve the effect of improving the accuracy

Pending Publication Date: 2019-11-19
SHANGHAI XIAOI ROBOT TECH CO LTD
View PDF3 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The existing paper plagiarism checking system can find the similarity between the papers to be checked and the papers uploaded by other people on the Internet by comparing texts, but some cheating software replaces a large number of synonyms to make plagiarism detection by comparing texts. The plagiarism check system of the paper is invalid, and the order of the content of the original text is artificially changed, which will also interfere with the above plagiarism check system, thus affecting the accuracy of the plagiarism check

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Article duplicate checking detection method, device and equipment, and storage medium
  • Article duplicate checking detection method, device and equipment, and storage medium
  • Article duplicate checking detection method, device and equipment, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0027] figure 1 It is a flow chart of a method for checking and detecting plagiarism of an article in Embodiment 1 of the present invention. The technical solution of this embodiment is suitable for performing the analysis based on the key sentences extracted from the article to be checked and the key descriptive features extracted from the reference article. In the case of article plagiarism detection, the method can be executed by the article plagiarism detection device, which can be implemented by software and / or hardware, and can be integrated in various general-purpose computer equipment, specifically comprising the following steps:

[0028] Step 110, perform semantic analysis on the article to be checked for duplicates, and determine at least one key sentence set corresponding to the article to be checked for duplicates, and the key sentences in the same set of key sentences correspond to the same article point of view;

[0029] Among them, the set of key sentences is a ...

Embodiment 2

[0038] figure 2 It is a flow chart of a method for detecting plagiarism of an article provided by Embodiment 2 of the present invention. This embodiment is further refined on the basis of the above-mentioned embodiment, and provides a semantic analysis of the article to be checked for plagiarism, and determines the content of the article to be checked. Concrete steps of at least one key sentence set corresponding to the article. Combine below figure 2 A method for checking and detecting articles for plagiarism provided in Embodiment 2 of the present invention will be described, including the following steps:

[0039] Step 210: Filter the sentences included in the article to be checked for duplicates according to preset conditions to obtain a set of candidate key sentences.

[0040] In this embodiment, in order to extract the core point of view of the article to be checked for plagiarism, the article to be checked for plagiarism is first split in units of sentences to obtai...

Embodiment 3

[0064] image 3 It is a flow chart of a method for checking plagiarism of an article in Embodiment 3 of the present invention. This embodiment is further refined on the basis of the above-mentioned embodiment, and provides semantic analysis of the article to be checked for plagiarism, and determines the content of the article to be checked. Specific steps before at least one key sentence set corresponding to the main article and specific steps after obtaining at least one key descriptive feature respectively corresponding to at least one reference article. Combine below image 3 A method for checking and detecting articles for plagiarism provided in Embodiment 3 of the present invention will be described, including the following steps:

[0065] Step 310, perform semantic analysis on the reference article, and determine at least one key sentence set corresponding to the reference article as a comparison key sentence set.

[0066] In this embodiment, in order to accurately mat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses an article duplicate checking detection method and device, equipment and a storage medium. The duplicate checking detection method for the article comprises the steps of conducting semantic analysis on the article to be subjected to duplicate checking, and determining at least one key sentence set corresponding to the article to be subjected to duplicate checking; obtaining at least one key description feature corresponding to the at least one reference article; and respectively matching each key sentence set of the to-be-duplicated article with each key description feature of each reference article, and determining key feature similarity between the to-be-duplicated article and each reference article according to a matching result so as to carry out duplicate checking detection on the to-be-duplicated article. According to the technical scheme of the embodiment of the invention, the core viewpoint in the to-be-duplicated article is matched with the core viewpoint of the reference article, so that the influence on the duplicate checking detection result due to synonym replacement or article content sequence change is avoided, and the duplicate checking detection accuracy of the article is improved.

Description

technical field [0001] Embodiments of the present invention relate to information processing technologies, and in particular to a method, device, equipment and storage medium for plagiarism checking and detection of articles. Background technique [0002] With the rapid development of network technology, network users can easily obtain research results and dissertations published by others on the network. Nowadays, there is a need to write papers in many jobs, such as teachers, doctors, and students' graduation defenses. In order to verify the originality of the papers, it is usually necessary to check the papers for plagiarism. [0003] The existing paper plagiarism checking system can find the similarity between the papers to be checked and the papers uploaded by other people on the Internet by comparing texts, but some cheating software replaces a large number of synonyms to make plagiarism detection by comparing texts. The plagiarism checking system of the paper is inva...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/22G06F17/27
Inventor 李陟
Owner SHANGHAI XIAOI ROBOT TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products