Unlock instant, AI-driven research and patent intelligence for your innovation.

A method and device for determining the position information of a search term in a document

A location information, determine the retrieval technology, applied in the field of information retrieval, can solve the problem of affecting the retrieval efficiency, low efficiency of location information, etc., to achieve the effect of improving retrieval efficiency, reducing the amount of reading, and improving efficiency

Active Publication Date: 2016-05-04
NEW FOUNDER HLDG DEV LLC +1
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0016] To sum up, when searching, the existing full-text retrieval system needs to sequentially read the position information of the term in the document including at least one term obtained by dividing the search term, in order to read the position information of the term in the preliminary hit document. Therefore, there is a problem of low efficiency in determining the position information of the search terms in the initially hit documents due to reading redundant or invalid information, thus affecting the retrieval efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for determining the position information of a search term in a document
  • A method and device for determining the position information of a search term in a document
  • A method and device for determining the position information of a search term in a document

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0039] Embodiment 1 of the present invention provides a method for determining the position information of a search term in a document. This method can be applied in a full-text retrieval system. The low efficiency of the position information of the words in the preliminary hit document affects the retrieval efficiency.

[0040] figure 2 It shows a schematic flow chart of determining the position information of a search term in a document provided by Embodiment 1 of the present invention. Specifically, this Embodiment 1 will aim at determining the position of the terms obtained by dividing the search term in the preliminary hit document The information is described, and the position information of one of the divided terms in the preliminary hit document is determined as an example for specific illustrations, such as figure 2 As shown, the process of determining the position information of the term in the preliminary hit document mainly includes the following steps:

[0041...

Embodiment 2

[0079] The second embodiment provides an application scenario of a method for determining position information of a search term in a document.

[0080] In the application scenario of a method for determining the position information of a search term in a document provided in Embodiment 2, the full-text retrieval system retrieves "the method for accelerating digital information processing" as an example for illustration. Specifically, the full-text retrieval system uses " Method for accelerating digital information processing" is divided into the terms "digital", "information", "processing", "acceleration", "of" and "method" and then retrieved, and the 7 documents in the above table 1 are obtained. The seven documents are documents including at least one term above, wherein document 4 and document 7 include all terms obtained after the search term is divided, and document 4 and document 7 are preliminary hit documents.

[0081] This technical solution will be described by takin...

Embodiment 3

[0093]The third embodiment provides a device for determining the position information of the search term in the document. The device can be applied in the full-text retrieval system. The low efficiency of the location information in the initially hit documents affects the retrieval efficiency.

[0094] specifically, Figure 8 A schematic structural diagram of a device for determining the position information of a search word in a document provided by Embodiment 3 of the present invention is shown, as shown in Figure 8 As shown, the device for determining the position information of the search term in the document includes:

[0095] Retrieval term division unit 801 and location information reading unit 802; wherein:

[0096] A search term division unit 801, configured to divide the search term into a plurality of terms;

[0097] The location information reading unit 802 is configured to perform, for each term obtained by dividing the search term by the search term division ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and a device for determining the location information of a search term in documents. The method for determining the location information of the search term in the documents includes the following steps: aimed at each lexical item obtained by dividing the search term, execution is carried out respectively; a storage location of the location information of each lexical item in each preliminary hit document is determined, according to the determined storage locations, the location information of the lexical items in the preliminary hit documents is read, and the preliminary hit documents contain each lexical item obtained by dividing the search term. According to the technical scheme, the process of reading the location information of each lexical item in non-preliminary hit documents is eliminated, and therefore the reading quantity is reduced, the efficiency of determining the location information of the search term in the document is improved, and the retrieval efficiency is further improved.

Description

technical field [0001] The present invention relates to the technical field of information retrieval, in particular to a method and a device for determining the position information of a search word in a document. Background technique [0002] The full-text retrieval system is a retrieval system that is very popular at present. The retrieval system mainly determines the documents that match the retrieval terms submitted by the user terminal based on the pre-established inverted index file. word document. [0003] At present, the process of creating an inverted index file by a full-text retrieval system includes: scanning each term in the document through an index program, and building an index item for each term, and the index item is used to identify the corresponding term in the document. The location information that appears in the document, and create an inverted index file based on the index entries established for each term in the document. After the inverted index f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 童征宇徐剑波闫进兵
Owner NEW FOUNDER HLDG DEV LLC
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More