Unlock instant, AI-driven research and patent intelligence for your innovation.

Text retrieval method and device

A text and text similarity technology, applied in the direction of unstructured text data retrieval, text database query, special data processing applications, etc., can solve the problems of poor accuracy of retrieval results, achieve accuracy improvement, and eliminate high-frequency irrelevant words Interference, the effect of ensuring accuracy

Inactive Publication Date: 2019-07-16
BEIJING GRIDSUM TECH CO LTD
View PDF10 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the currently used algorithms are generally based on some screening rules, such as the same cause of action, consistent applicable laws, etc., to retrieve other documents similar to the input legal documents, and the retrieval results obtained by this retrieval method are often less accurate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text retrieval method and device
  • Text retrieval method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] Hereinafter, exemplary embodiments of the present disclosure will be described in more detail with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure can be implemented in various forms and should not be limited by the embodiments set forth herein. On the contrary, these embodiments are provided to enable a more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0055] Such as figure 1 As shown, a text retrieval method provided by an embodiment of the present invention may include:

[0056] Step 101: Perform word segmentation on the search text to obtain a set of search terms.

[0057] Specifically, the present invention can adopt at least one of a word segmentation method based on thesaurus matching, a word segmentation method based on word frequency statistics, a word...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text retrieval method and device. The method comprises the following steps: carrying out word segmentation on a retrieval text to obtain a retrieval word set; for each word in the retrieval word set, calculating a TextRank value of each word by adopting a TextRank algorithm; selecting a preset number of words as a keyword set according to the TextRank value of each word;determining a word vector of each word in the keyword set; obtaining a text word set corresponding to at least one to-be-retrieved text, and determining a word vector of each word in the text word setcorresponding to the at least one to-be-retrieved text; calculating the similarity between the word vector of each word in the keyword set and the word vector of each word in the text word set corresponding to the at least one to-be-retrieved text; and sorting and outputting the at least one to-be-retrieved text according to the similarity. The accuracy of the retrieval result is improved.

Description

Technical field [0001] The present invention relates to the technical field of text retrieval, in particular to a text retrieval method and device. Background technique [0002] Pushing legal documents refers to inputting a legal document, using a certain algorithm to obtain a series of other documents similar to the imported legal document, so as to quickly find the historical documents (also called Historical cases). [0003] However, the algorithms currently used are generally based on some screening rules, such as the same case and the same applicable legal provisions, to retrieve other documents similar to the input legal documents. The retrieval results obtained by this retrieval method are often less accurate. Summary of the invention [0004] In view of the above problems, the present invention is proposed to provide a text retrieval method and device that overcomes the above problems or at least partially solves the above problems. The technical solutions are as follows: ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/33G06F17/27
CPCG06F40/289G06F40/30G06F16/3344
Inventor 戴威
Owner BEIJING GRIDSUM TECH CO LTD