Unlock instant, AI-driven research and patent intelligence for your innovation.

Similar document retrieval method and device, electronic equipment and storage medium

A document retrieval and document technology, applied in the fields of devices, electronic equipment and storage media, and similar document retrieval methods, can solve the problems of high time cost and low efficiency, and achieve the effect of reducing time cost and improving retrieval efficiency.

Pending Publication Date: 2022-03-01
E-SURFING DIGITAL LIFE TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The invention provides a similar document retrieval method, device, electronic equipment and storage medium, which are used to solve the technical problems of high time cost and low efficiency of the existing similar document retrieval method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Similar document retrieval method and device, electronic equipment and storage medium
  • Similar document retrieval method and device, electronic equipment and storage medium
  • Similar document retrieval method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] The embodiments of the present invention provide a similar document retrieval method, device, electronic equipment and storage medium, which are used to solve the technical problems of high time cost and low efficiency in the existing similar document retrieval method.

[0064] In order to make the purpose, features and advantages of the present invention more obvious and understandable, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the following The described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0065] see figure 1 , figure 1 It is a flowchart of steps o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a similar document retrieval method and device, electronic equipment and a storage medium. The technical problems that an existing similar document retrieval method is high in time cost and low in efficiency are solved. The method comprises the steps of obtaining a training document library; the training document library comprises a plurality of documents, and each document has a corresponding document ID; constructing a training data set according to the document; training a neural network by adopting the training data set to obtain a target neural network; receiving a target document, and generating a target training data set by adopting the target document; inputting the target training data set into the target neural network to obtain a target vector of the target document; and calculating a difference value between the target vector and a comparison vector in a preset database, and taking a document corresponding to the comparison vector of which the difference value is smaller than a preset threshold value as a similar document.

Description

technical field [0001] The present invention relates to the technical field of document retrieval, in particular to a similar document retrieval method, device, electronic equipment and storage medium. Background technique [0002] With the advancement of the information society, more and more documents (such as academic papers, novels, news, etc.) With technology, computers can still quickly retrieve documents that meet specific conditions (such as specific titles or specific keywords) from a huge document library. However, just retrieving titles or keywords is not enough to support all application scenarios. Sometimes it is necessary to retrieve a collection of similar documents of the target document. The release review system needs to detect whether an article marked as original is really original, and similar documents need to be eliminated during the corpus construction process. However, how to quickly retrieve a set of similar documents to a target document from a h...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332G06F16/35G06F40/289G06N3/04
CPCG06F16/332G06F16/35G06F40/289G06N3/04
Inventor 杨珉孙立奋毛绍嵘
Owner E-SURFING DIGITAL LIFE TECH CO LTD