Article similarity detection method and device

A detection method and similarity technology, applied in the field of information processing, can solve the problems of reduced detection efficiency, false detection, missed detection, etc., to achieve the effect of ensuring accuracy, improving detection efficiency, and reducing comparisons

Pending Publication Date: 2022-01-04
ALIPAY (HANGZHOU) INFORMATION TECH CO LTD +1
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The keyword-based detection method does not consider the position of the keyword in the sentence, which will lead to false detection and missed detection.
Although the detection method based on sentence granularity has a high detection accuracy, if the length of the article is long or the number of articles to be detected is large, the detection efficiency will decrease.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Article similarity detection method and device
  • Article similarity detection method and device
  • Article similarity detection method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0015] The technical solutions in the embodiments of the present disclosure will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present disclosure. Obviously, the described embodiments are only a part of the embodiments of the present disclosure, but not all of the embodiments.

[0016] With the rapid development of the Internet and the rapid dissemination of information, a large number of literary and novel creation websites have been formed. With the development of users' payment habits, it brings a lot of income to creators and a virtuous circle of creation ecology. However, driven by interests, some criminals use manual or tools to process and plagiarize the content of literature and novels. The methods are not limited to changing the names of main characters and locations, uniform replacement of sentence granularity, and imitation of plots. In order to protect the interests of creators, it is necessary to pe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an article similarity detection method and device. The method comprises the following steps of: extracting text features of a to-be-detected first article and a to-be-detected second article by using a neural network model; determining the similarity of the text features of the first article and the second article; and under the condition that the text features of the first article and the second article are similar, performing sentence granularity comparison on the first article and the second article to determine the similarity of the first article and the second article. In addition, the similarity detection result of the first article and the second article can be uploaded to a block chain system for evidence storage.

Description

technical field [0001] The present disclosure relates to the technical field of information processing, and in particular, to a method and device for detecting similarity of articles. Background technique [0002] With the rapid development of the Internet and the rapid dissemination of information, the dissemination of literary works and novels has become faster and faster, and more and more infringements have followed. In order to protect the interests of creators, it is necessary to perform similarity detection on articles. [0003] There are currently two detection methods, one is a keyword-based detection method, and the other is a sentence-granularity-based detection method. The keyword-based detection method does not consider the position of the keyword in the sentence, which will lead to the problem of false detection and missed detection. Although the detection method based on sentence granularity has a high detection accuracy, if the length of the article is long...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/194G06F40/211G06F40/295G06F16/33G06F16/383G06K9/62G06N3/04G06N3/08
CPCG06F40/194G06F40/211G06F40/295G06F16/3344G06F16/383G06N3/04G06N3/08G06F18/22
Inventor 黄凯明张晓博
Owner ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products