A citation-based paper originality detection method

A detection method, an original technology, applied in special data processing applications, instruments, calculations, etc., can solve problems such as inaccurate detection results, and achieve the effect of improving the level of scientific research

Active Publication Date: 2019-01-29
HARBIN ENG UNIV
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] The purpose of the present invention is to solve the problem of inaccurate detection results above, and to provide a method for detecting the originality of papers based on citations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A citation-based paper originality detection method
  • A citation-based paper originality detection method
  • A citation-based paper originality detection method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The present invention will be described in more detail below in conjunction with the accompanying drawings.

[0034] to combine figure 1 , figure 2 , image 3 , Figure 4 , Figure 5 , the present invention comprises the following steps:

[0035] 1. Corpus processing. Use a web search engine to search and locate articles using heuristic rules. For the downloaded articles, format conversion is required. For the convenience of experiments, we convert them to UTF-8 encoded plain text format. For plain text, it is first necessary to check whether it is a valid scientific document, that is, to determine whether it contains references. Documents containing incomplete or incorrect citations were also removed from the experimental documentation set, the text was normalized, and a simple baseline method was used to identify citations pointing to the same article and group them together. This method needs to go through all the bibliographies and then arrange them from lo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a citation-based paper originality detection method, which relates to the field of paper retrieval and comparison. The invention proposes to study plagiarism from the perspective of citation, and designs the citation features of the text to analyze the citations, separating the text from the references at the end of the text, Segmentation of the segmented reference string,Create Bibliographic List, according to the author of the bibliography and the year of publication, and extracted by parser, For the lab text, If its shared citation exceeds a threshold, In the nextstage, the longest common reference sequence of the selected documents is analyzed, and if the value is less than a certain threshold, the selected documents are eliminated from the experimental textset, and the citation analysis is carried out for the texts successfully passed through the first two stages, and the maximum overlap number of the reference blocks is used to measure the plagiarism degree of the texts. The invention is of great significance to the detection of academic misconduct, is conducive to the standardization of academic atmosphere and the improvement of scientific research level.

Description

technical field [0001] The invention relates to the field of paper retrieval and comparison, in particular to a method for detecting the originality of papers based on citations. Background technique [0002] The concept of bibliographic coupling is very practical as a measure of subject similarity. Two documents are considered bibliographically coupled if they have at least one bibliography in common. Coupling strength is characterized by the number of shared references. [0003] The bibliographic coupling approach is to characterize the relationship between documents based on earlier documents identified by the author when selecting a bibliography. This relationship is static and intrinsic to coupled documents, since it depends only on the respective cited works and does not change over time. [0004] Some researchers have questioned the validity of bibliographic coupling as a measure of similarity. Bibliographic coupling can only represent the probability of correlati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/22
CPCG06F40/194Y02D10/00
Inventor 刘刚王贺飞杨笑笑
Owner HARBIN ENG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products