Full-text search method based on block method

A full-text search and block technology, which is applied in the field of full-text search based on the block method, can solve problems such as inappropriateness of enterprises and individuals, outdated document search methods, and low work efficiency, and achieve simple and practical workflow, low search efficiency, and Use quick effects

Active Publication Date: 2020-01-07
吴生友
View PDF2 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem to be solved by the present invention is that the current document search method is too old, the work efficiency is low and the resources consumed are too much, and the cost of the fast retrieval method is too high, which is not suitable for enterprises and individuals

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Full-text search method based on block method
  • Full-text search method based on block method
  • Full-text search method based on block method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0028] Embodiment: a kind of full-text search method based on block method, it is characterized in that: comprise the following steps:

[0029] (1) The establishment of the block index, when searching for the first time, when it is necessary to generate block data and block information tables, extract the text content of each file, write it into the block data file, and extract the file name, file md5 value and area The corresponding start position and end position of the file in the block data are written into the block information table, and the size of each block is calculated by the total file index data size;

[0030] (2) Update the block data, extract the md5 value corresponding to the file name in the block index, and compare it with the md5 value in the index. If there is any change, it means that the block data has changed, and the entire block needs to be updated. When the md5 values ​​corresponding to each file in the block index table are equal, it means that the f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a full-text search method based on a block method. The full-text search method comprises the following steps: (1) establishing a block index; (2) updating the block data; and (3) searching keywords. Compared with the prior art, the method has the advantages that full-text search can be performed on multiple documents more quickly, and when the number of documents containingkeywords is smaller, the search efficiency of the block method is higher due to the fact that the number of keyword comparison times is smaller during search. No matter how many documents containingkeywords are in conventional search, all the documents need to be compared, so that the search time is the same. When each document contains the search keyword, the search times of the two documents are the same, the file opening and reading times of the block method are less, the search efficiency difference between the two documents is minimum, the work flow is simple and practical, and the useis faster.

Description

technical field [0001] The invention relates to the field of computer office work, in particular to a full-text search method based on a block method. Background technique [0002] Find documents containing a certain keyword in many documents. At this time, it is necessary to perform a full-text search on multiple documents. The traditional method is to open each file to search, and return the documents containing the search keyword, such as searching 10,000 documents. It takes about 3 minutes for a regular document, and it takes 30 minutes to search 100,000 documents, so some methods are needed to solve the search speed problem. [0003] Currently, taxonomy is used to classify documents first, then determine which category the keyword is in, and then search in the corresponding category, thereby reducing the search range and shortening the search time, but only suitable for clear categories and able to judge It is available when the category of the keyword is listed. There...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/31
CPCG06F16/316
Inventor 吴生友
Owner 吴生友
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products