Information storage and query method based on vertical search engine and device thereof

A vertical search engine and information storage technology, applied in the fields of instruments, calculations, electrical digital data processing, etc., can solve the problems of occupying more memory resources and reducing the search rate

Active Publication Date: 2013-06-19
ALIBABA GRP HLDG LTD
View PDF4 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] The above described in detail the specific implementation of data search using vertical search engine technology, because the positive list needs to be stored in the memory, and in the data file of the positive list, the stored attribute values ​​​​of the indexed documents, There will be more duplicate storage of the same file attribute value, for example in the above Figure 4 In the document, the document attribute value with document ID 0 and the document attribute value with document ID 2 are exactly the same, but in the data file of the positive list, it has to be stored twice, and this repetitive storage will take up more time. Memory resources, so that when users use vertical search engines to query relevant information, the search rate will be reduced due to insufficient memory resources of the system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information storage and query method based on vertical search engine and device thereof
  • Information storage and query method based on vertical search engine and device thereof
  • Information storage and query method based on vertical search engine and device thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 2

[0063] Furthermore, the deduplication dictionary can have various storage forms. In the second embodiment of the present application, the document identification information and attribute related value information stored in the deduplication dictionary is taken as an example, and then another vertical search based on the embodiment of the present application is explained in detail. The engine information storage method, the specific process is as follows:

[0064] S21~S22, according to the configuration information, initialize the header information file Fieldname.pfl.info in the forward list. After the initial configuration of the forward list header information file, for each document to be stored, according to the specified The attribute value contained in the attribute field determines the attribute related value of the specified attribute field of the document to be stored. Wherein, for the specific implementation process of S21-S22, please refer to the detailed descripti...

Embodiment 3

[0079] The information storage method based on the vertical search engine proposed in the first and second embodiments above is a full storage operation for all documents to be stored, that is, a full storage operation for one document to be stored. However, in the information storage process of the vertical search engine, the attributes of the documents are not static. Within a certain period of time, the attributes of each stored document may change, that is, for the stored documents, in the specified attribute field, Its attribute value may change. For example, the attribute value corresponding to the specified attribute field of a stored document increases or decreases, or some attribute values ​​are different from the stored attribute value. At this time, it is necessary to make changes to the changed document. The corresponding update is to perform incremental storage operations on the changed documents. Based on this, the third embodiment here proposes a method for stor...

Embodiment 4

[0089] Furthermore, based on the information storage method based on the vertical search engine proposed in the first to third embodiments above, the fourth embodiment of the present application proposes a corresponding information query method based on the vertical search engine, such as Figure 10 As shown, the specific process is as follows:

[0090]Step 101, splitting the search term input by the user, and performing an inverted index based on the split search term based on the posting table, that is, according to the split search term, respectively searching in the dictionary and the split search term The dictionary information corresponding to each search term is then queried in the posting list according to the found dictionary information, and the identification information of at least one document where the split search term appears is obtained. Embodiment 4 of the present application Here, the search term input by the user is dell computer as an example, and the info...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an information storage and query method based on a vertical search engine and a device of the information storage and query method based on the vertical search engine. The method comprises the steps of confirming attribute correlation values of each document to be stored according to attribute values included in a specified attribute field, judging whether numerical values identical with the attribute correlation values are stored in a message dictionary or not, writing offset and the number of the attribute values of an initial position stored in the message dictionary in a positive row table index file if the numerical values identical with the attribute correlation values are stored in the message dictionary, confirming offset and the number of the attribute values of the initial position of the attribute values in the positive row table index file if the numerical values identical with the attribute correlation values are not stored in the message dictionary, storing the confirmed attribute correlation values, the confirmed offset and the confirmed number of the attribute values in the message dictionary, writing the confirmed offset and the confirmed number of the attribute values in the positive row table index file, and starting to write the attribute values included in the specified attribute field of the document to be stored. Therefore, occupation of internal memory resources is reduced, and the rate that a user uses the vertical search engine to inquire relevant information is improved.

Description

technical field [0001] The present application relates to the technical field of search engines, in particular to an information storage method and device based on a vertical search engine, and an information search method and device based on a vertical search engine. Background technique [0002] Vertical search engine is a new search engine service model proposed relative to the problems of general search engines, such as large amount of information, inaccurate query, and insufficient depth. Provide valuable information and related services for a specific group of people or a specific need. The vertical search engine integrates a certain type of special information in the webpage library, extracts the required data by orientation and field, processes the data and returns it to the user in some form. [0003] The basic architecture of a vertical search engine is as follows: figure 1 As shown, the indexing system database of the vertical search engine mainly includes three...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 孙权程丽敏
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products