Text search method and device and storage medium

A text search and information search technology, which is applied in the field of search engines, can solve problems such as long read and write delays, the index cannot be read in real time, and the index cannot be updated to achieve the effect of comprehensive search results

Pending Publication Date: 2021-10-12
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] During the research and practice of the existing technology, the inventor of the embodiment of the present application found that in the above method of updating the retrieval index, the written index cannot be read in real time, and the data needs to be written to the disk file before it can be provided to the user. Retrieval, there is a long read and write delay
It can be seen that when the current search engine updates the index, it cannot guarantee real-time update index

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text search method and device and storage medium
  • Text search method and device and storage medium
  • Text search method and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0090] The terms "first" and "second" in the description and claims of the embodiments of the present application and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence. It is to be understood that the terms so used are interchangeable under appropriate circumstances such that the embodiments described herein can be practiced in sequences other than those illustrated or described herein. Furthermore, the terms "comprising" and "having", as well as any variations thereof, are intended to cover a non-exclusive inclusion, for example, a process, method, system, product or device comprising a series of steps or modules is not necessarily limited to the expressly listed Those steps or modules, but may include other steps or modules that are not clearly listed or inherent to these processes, methods, products or equipment. The division of modules that appear in the embodiments of the present application is ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a text search method and device and a storage medium. The method comprises the steps: receiving search information; determining at least one piece of candidate inverted index information matched with the search information based on historical inverted index information and inverted index information of a cache region, wherein the reverse index information of the cache region is obtained by updating a historical document and at least one target document, and the target document is a document updated based on the historical document; combining the at least one piece of candidate inverted index information to obtain target inverted index information; and outputting the target inverted index information as a search result. According to the scheme, the index can be updated in real time, the writing efficiency of partial field updating is improved, and the retrieval performance of a search engine is improved.

Description

technical field [0001] The embodiments of the present application relate to the technical field of search engines, and in particular to a text search method, device and storage medium. Background technique [0002] In a search engine, the following method is generally adopted to update the retrieval index: the search engine performs word segmentation on the text content of the received document to obtain a word list. Then, the engine will maintain a data structure named data table (DWPT) in memory, including: word list, word inverted list, document content, word frequency statistics, when the data size of DWPT reaches the set threshold, such as 10MB, The engine will persist the data in the memory to the disk to form a segment (Segment). Each segment has multiple files, which respectively record the dictionary (trie tree[8] / bkd tree[7]) in the segment, the inverted list, Positive content, word frequency statistics, etc. [0003] During the research and practice of the exist...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/31G06F16/332
CPCG06F16/3329G06F16/319G06F16/334
Inventor 曹希保曾楚伟李斌
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products