Text retrieval method and device

A text and original text technology, applied in the fields of instruments, computing, electronic digital data processing, etc., can solve the problems such as the retrieval results do not meet the retrieval requirements, the retrieval accuracy is reduced, and the keywords do not meet the user retrieval requirements.

Active Publication Date: 2014-06-25
STATE GRID CORP OF CHINA +3
View PDF3 Cites 53 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] From the above technical solutions, it can be seen that the existing text segmentation cannot fully understand the user's retrieval needs, so the keywords extracted during text segmentation may be invalid words, and keywords that are not invalid words may not meet the user's retrieval requirements requirements, and then in the fuzzy full-text retrieval based on these keywords, there will be texts that do not meet the retrieval requirements in the retrieval results, and the retrieval accuracy will be reduced

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text retrieval method and device
  • Text retrieval method and device
  • Text retrieval method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] In existing keyword-based text retrieval, keywords obtained after word segmentation of the original text are directly used for retrieval, such as after word segmentation of the original text "an image matching device based on an image recognition method", the keywords obtained Including: "a kind of, based on, image recognition, method, image matching and device", and "a kind of, based on, method and device" are obviously invalid words, which have little effect on retrieval, so based on these keywords When searching for words, most of the retrieved texts do not meet the user's retrieval needs, reducing the retrieval accuracy.

[0054] For this reason, the text retrieval method provided by the embodiment of the present invention will filter the search terms according to the user's retrieval needs to obtain keywords after word segmentation, so that when the text is retrieved based on the combined keywords, the obtained retrieval text is more in line with To meet the user's...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a text retrieval method and device. The text retrieval method includes the steps that an original text input by a user is acquired; retrieval words are acquired from the original text; according to the retrieval requirement of the user, the retrieval words are filtered to acquire keywords; the keywords are combined, texts in a text database are retrieved according to the combined keywords, and at least one retrieval text is acquired; the retrieval texts are displayed in a relevancy inverted order mode, and the keywords are highlighted in the retrieval texts, wherein relevancy is used for representing the relevancy degree of the original text and the retrieval texts. Due to the fact that the keywords are acquired by filtering the retrieval words according to the retrieval requirement of the user, the probability that the keywords are invalid words is reduced, and the retrieval requirement is better met compared with the manner that the retrieval words are directly acquired from the original text, the retrieval texts acquired through retrieval by the application of the combined keywords can well meet the retrieval requirement, and therefore retrieval accuracy is improved.

Description

technical field [0001] The invention relates to the technical field of text mining, in particular to a text retrieval method and device. Background technique [0002] As the name suggests, text retrieval is to extract valuable information from text, and display the valuable information to the user on the display screen of the electronic device. At present, the principle of text retrieval is to perform matching search directly after text segmentation. There are two common text retrieval methods: providing users with a search method of classified browsing or a full-text search method based on keywords. [0003] The above two retrieval methods obtain keyword matching by segmenting the text input by the user, but the keywords obtained by this simple text segmentation method include invalid words, wherein the invalid words appear in most texts and cause Search for words with increased results. For example, the text entered by the user is: an image matching device based on an im...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/3331
Inventor 杨芳盛兴李蔚君彭珍赵鹏贾辉辉
Owner STATE GRID CORP OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products