Unlock instant, AI-driven research and patent intelligence for your innovation.

Text retrieval method and device

A technology of text and words, which is applied in the field of text retrieval methods and devices, can solve problems such as poor accuracy of retrieval results, and achieve the effect of improving accuracy, ensuring accuracy, and eliminating the interference of high-frequency irrelevant words

Inactive Publication Date: 2019-07-16
BEIJING GRIDSUM TECH CO LTD
View PDF11 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the currently used algorithms are generally based on some screening rules, such as the same cause of action, consistent applicable laws, etc., to retrieve other documents similar to the input legal documents, and the retrieval results obtained by this retrieval method are often less accurate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text retrieval method and device
  • Text retrieval method and device
  • Text retrieval method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0053] Such as figure 1 As shown, a text retrieval method provided by an embodiment of the present invention may include:

[0054] Step 101, perform word segmentation on the search text to obtain a set of search terms.

[0055] Specifically, the present invention can use at least one of the word segmentation method based on thesaurus matching, the word segmentation method based on word frequency statistics, the word segmentation m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text retrieval method and device. The method comprises the following steps: carrying out word segmentation on a retrieval text to obtain a retrieval word set; for each word in the retrieved word set, respectively calculating the entropy sum of the left entropy and the right entropy of each word; selecting a preset number of words as a keyword set according to the entropysum of the words; determining a word vector of each word in the keyword set; obtaining a text word set corresponding to at least one to-be-retrieved text, and determining a word vector of each word inthe text word set corresponding to the at least one to-be-retrieved text; calculating the similarity between the word vector of each word in the keyword set and the word vector of each word in the text word set corresponding to the at least one to-be-retrieved text; and sorting and outputting the at least one to-be-retrieved text according to the similarity. The accuracy of the retrieval result is improved.

Description

technical field [0001] The invention relates to the technical field of text retrieval, in particular to a text retrieval method and device. Background technique [0002] Pushing legal documents refers to inputting a legal document and using a certain algorithm to obtain a series of other documents similar to the input legal document, so as to quickly find historical documents related to the currently input legal document (also called historical cases). [0003] However, the algorithms currently used are generally based on some screening rules, such as the same cause of action, consistent applicable laws, etc., to retrieve other documents similar to the input legal documents, and the retrieval results obtained by this retrieval method are often inaccurate. Contents of the invention [0004] In view of the above problems, the present invention is proposed in order to provide a text retrieval method and device that overcomes the above problems or at least partially solves th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/33G06F16/36
CPCG06F16/334G06F16/36
Inventor 戴威
Owner BEIJING GRIDSUM TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More