Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for determining text retrieval and ranking

A technology for retrieval sorting and determining methods, applied in the field of data processing, can solve the problem that the arrangement method cannot obtain the best retrieval results, etc., and achieve the effect of improving retrieval efficiency and accuracy.

Active Publication Date: 2020-05-22
BEIJING HEXIANG WISDOM TECH CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Therefore, the present invention provides a method and system for determining the retrieval, selection and ranking of documents, which overcomes the inadequacy of the inability to obtain the best retrieval results caused by the different arrangements of document retrieval in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for determining text retrieval and ranking
  • Method and system for determining text retrieval and ranking
  • Method and system for determining text retrieval and ranking

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0031] An embodiment of the present invention provides a method for determining text retrieval and sorting, which can be applied to electronic devices, and the electronic device can be a server or a terminal, such as figure 1 As shown, the method includes the following steps:

[0032] Step S1: Obtain the target text to be retrieved and a set of candidate texts.

[0033] In practical applications, the target text to be retrieved includes but is not limited to technical documents, patents, academic papers, etc. In the embodiment of the present invention, the target text is described using patents as an example, and the candidate text set may be a candidate patent set. The server can receive the target patents to be searched input by the user on the user terminal, and obtain the candidate patent collection from the patent database. According to the usage scenario, it may be the patent of the whole database, or it may be a patent collection customized by other means. For example,...

Embodiment 2

[0142] An embodiment of the present invention provides a system for determining the retrieval order of texts, such as Figure 7 As shown, the system includes:

[0143] The target text and candidate text set acquisition module 1 is configured to acquire the correlation metric value between the target text and each text in the candidate text set. This module executes the method described in step S1 in Embodiment 1, which will not be repeated here.

[0144] An association metric acquisition module 2, configured to acquire an association metric between the target text and each text in the candidate text set. This module executes the method described in step S2 in Embodiment 1, which will not be repeated here.

[0145] The first text set construction module 3 is used to sort each text in the candidate text set according to the first preset rule by using the correlation measure value, and construct the first text set according to the first preset filter condition; The module exec...

Embodiment 3

[0149] An embodiment of the present invention provides a computer device, such as Figure 8 As shown, it includes: at least one processor 401 , such as a CPU (Central Processing Unit, central processing unit), at least one communication interface 403 , memory 404 , and at least one communication bus 402 . Wherein, the communication bus 402 is used to realize connection and communication between these components. Wherein, the communication interface 403 may include a display screen (Display) and a keyboard (Keyboard), and the optional communication interface 403 may also include a standard wired interface and a wireless interface. The memory 404 may be a high-speed RAM memory (Ramdom Access Memory, volatile random access memory), or a non-volatile memory (non-volatile memory), such as at least one disk memory. Optionally, the memory 404 may also be at least one storage device located away from the aforementioned processor 401 . where processor 401 can execute figure 1 In the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text retrieval sorting determination method and system. The method comprises the following steps of obtaining a to-be-retrieved target text and a candidate text set; obtaining an association degree measurement value of the target text and each text in the candidate text set; sorting each text in the candidate text set according to a first preset rule by utilizing the correlation measurement value, and constructing a first text set according to a first preset screening condition; and sorting each text in the first text set according to a second preset rule to obtain aretrieval sorting result of the target text. According to the embodiment provided by the invention, the advantages of multiple algorithms are integrated, the accuracy of a patent retrieval result is improved, and the retrieval efficiency of a user is improved.

Description

technical field [0001] The invention relates to the field of data processing, in particular to a method and system for determining text retrieval and sorting. Background technique [0002] In the prior art, when retrieving documents (such as journal papers, patents, etc.), multiple existing similarity calculation methods (such as structural analysis, semantic analysis, keyword analysis, etc.) are used to search for candidate documents. Different sorting results can be obtained after sorting; in addition, for the same type of similarity calculation method, there may also be different results. For example, taking semantic analysis as an example, for the same pair of patent original There are also differences in the similarity calculation results between them. Therefore, for the same target patent, for different solutions, the similarity of candidate patents can be arranged in various ways, and each method has its own sorting rules, and the sorting results obtained may be quit...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/338
Inventor 郭永红
Owner BEIJING HEXIANG WISDOM TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products