Science and technology project duplicate checking method for automatically realizing field weight allocation based on deep learning algorithm

A technology project and deep learning technology, applied in the field of data retrieval and comparison, can solve the problem of low efficiency of duplicate checking

Pending Publication Date: 2020-03-31
广西壮族自治区科学技术情报研究所
View PDF9 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to provide a method for plagiarism checking of scientific and technological items based on a deep learnin

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Science and technology project duplicate checking method for automatically realizing field weight allocation based on deep learning algorithm
  • Science and technology project duplicate checking method for automatically realizing field weight allocation based on deep learning algorithm
  • Science and technology project duplicate checking method for automatically realizing field weight allocation based on deep learning algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below through specific embodiments in conjunction with the accompanying drawings. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0029] The present invention provides a scientific and technological item checking method based on a deep learning algorithm to automatically realize field weight distribution, including the following steps:

[0030] Step 1: Extract the target text in the specified field of the target file, and divide the target text into keywords; for example, select the target file, set the specified field to "technical content", and in the "technical content" field of the target file Extracted the target text of "applying the game engine UDK technology to virtualize and digitize the extracted charac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a science and technology project duplicate checking method for automatically realizing field weight allocation based on a deep learning algorithm, which comprises the followingsteps: extracting a target text from a specified field of a target file, and segmenting the target text into keywords; retrieving a to-be-queried file containing a single keyword in a database, and setting a weight value of the keyword; utilizing a neural network to establish a weight evaluator to evaluate and sort the to-be-checked files containing the keywords; selecting a to-be-queried file with the highest relevancy, and extracting a comparison text from a specified field of the to-be-queried file; establishing a comparison matrix, and calculating the similarity between the target text andthe comparison text according to the scale of the sub-matrix; according to the science and technology project duplicate checking method for automatically realizing field weight distribution based onthe deep learning algorithm, learning training is conducted on related samples through the neural network, and after training is completed, a file similarity comparison (duplicate checking) task can be efficiently and rapidly completed.

Description

technical field [0001] The invention belongs to the technical field of data retrieval and comparison, and in particular relates to a scientific and technological item checking method for automatically realizing field weight distribution based on a deep learning algorithm. Background technique [0002] At present, the duplication rate detection of papers / projects mainly uses detection systems such as PaperPass, Wanfang, and HowNet, and uses string matching algorithms to calculate the similarity ratio of the files to be detected relative to the target files in the file library. The string matching algorithm uses a piece of text to be completely consistent as the standard for measuring the repetition of papers. However, due to the complexity of the Chinese language and the diversity of expressions, for two paragraphs of text with the same substantive content, there are often some meaningless gaps in the middle. "Stop words" or function words or situations such as subject-verb-o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/903G06F40/194
CPCG06F16/90344
Inventor 谢积鉴陈旭红粟月萍钟雪梅胡婷婷玉泉陈金平李荣陈怡玲卢琳玲
Owner 广西壮族自治区科学技术情报研究所
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products