Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for automatically detecting academic misconduct literature

An automatic detection and document technology, applied in the direction of instruments, calculations, electrical digital data processing, etc., can solve the problems of wasting matching time, unable to create features in tables, etc., achieve rapid detection, improve the accuracy and completeness of the effect

Active Publication Date: 2010-09-15
TONGFANG KNOWLEDGE NETWORK TECH CO LTD (BEIJING)
View PDF4 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The above detection process is only the detection of single-layer features, and cannot create features for the tables in the documents; the matching is not a matching of one document against multiple documents at the same time, but a match between one document and two documents (such as figure 1 shown) wastes the time of matching; and the content is only a process of detecting plagiarism

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for automatically detecting academic misconduct literature
  • Method and system for automatically detecting academic misconduct literature
  • Method and system for automatically detecting academic misconduct literature

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0025] This embodiment provides a method for automatically detecting academic misconduct documents such as figure 2 As shown, the method includes:

[0026] Step 101 creates features for the hierarchical content of the document to be detected and the table data in the document.

[0027] Step 102 creates features for the stored document level content and the table data in the document;

[0028] The above-mentioned documents to be detected and the existing documents refer to arbitrary documents. The documents are processed hierarchically, and unique features are created according to the levels of chapters, paragraphs, and sentences.

[0029] Step 103 matches the hierarchical content features of the documents to be detected and the table data features in the documents to be detected with the hierarchical content features of the stored documents and the table features in the stored documents;

[0030] The first is to perform feature matching at the chapter level. If the entire chapter level...

Embodiment 2

[0041] Such as Image 6 Shown is the structure diagram of the system for detecting academic misconduct documents, including the feature area of ​​the documents to be detected, the comparison resource area of ​​the documents to be tested, the hierarchical feature matching area, and the judging area of ​​misconduct academic documents and types, among which the feature area of ​​the documents to be tested, Create features for the level content of the received documents to be tested and the table data in the documents; the comparison resource area of ​​the documents to be tested is used to create features for the stored document level content and the table data in the documents; the source of the documents to be tested can be It is freely designated by the user, and real-time generation of multi-layer content features of the document is added to the document feature library; the documents in the resource area of ​​the document comparison resource area to be tested can be documents i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and a system for automatically detecting an academic misconduct literature. The method comprises the following steps of: establishing characteristics of a hierarchical content of a literature to be detected and table data in the literature; establishing characteristics of hierarchical contents of stored literatures and table data in the literatures; matching the hierarchical content characteristics of the literature to be detected and the table data characteristics in the literature to be detected with the hierarchical content characteristics of all the stored literatures and the table characteristics in the stored literatures; and judging whether the literature to be detected contains academic misconduct contents , academic misconduct table data and types of the academic misconduct contents or not. The system comprises a characteristic region of the literature to be detected, a comparison resource region of the literature to be detected, a hierarchical content characteristic region and a misconduct academic literature and type judgement region. Through a hierarchical multi-stage characteristic structure, the invention can fast detect an ultra-long literature, satisfy the detection of a minimum characteristic granularity short verse of the literature and enhance the pertiency factor and the recall ratio. In addition, the invention also supports to establish the table data characteristics in the literature and disposable matching for matching all the literatures.

Description

Technical field [0001] The invention relates to the fields of intelligent information processing and computer technology, and in particular to a method and system for automatically detecting academic misconduct documents and table data in the documents. Background technique [0002] With the rapid development and rapid popularization of the Internet, electronic texts currently published on the Internet have become a focus of current intellectual property protection. Because electronic texts are easy to copy and download, they have become the object of research and quotation by many people. Some electronic texts are copied on large pages and are considered plagiarism from time to time. The current electronic text protection measures on the Internet are mainly through blocking and detection methods. [0003] At present, there are also methods for plagiarism of electronic text content. For example, the patent application number is "200810232309.8 A method for detecting and locating e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 张振海孙雄勇
Owner TONGFANG KNOWLEDGE NETWORK TECH CO LTD (BEIJING)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products