Real-time search method, device and system

A real-time search and information search technology, applied in the field of network search, can solve problems such as data inconsistency, large memory usage, and affecting retrieval performance, and achieve the effect of reducing crawling and improving speed

Inactive Publication Date: 2012-05-09
SHENZHEN AIGU TECH
View PDF3 Cites 39 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] 2. The merging mechanism of incremental index and full index database is complicated and difficult to control
If a single incremental index and a single full index are used, the merge process will be extremely slow due to the full index becoming too large in the long-term operation, which will also affect the retrieval performance
If the method of multi-level incremental index and multi-level full index library is adopted, the update and delete operations of existing data contained in the incremental index will be distributed in multiple full index libraries, and additional management is required when merging Agencies assist in processing, which greatly increases the complexity of the system, and is also prone to data inconsistencies
[0011] 3. Traditional indexes usually create an index for a specific application. Each specific index and its supporting resources (such as tokenizers, similarity calculators, etc.) are independent, and supporting resources between multiple indexes cannot be shared
For example, the thesaurus of the tokenizer will take up a lot of memory. If multiple indexes are deployed on the same server, each index must load a thesaurus independently, resulting in a lot of memory waste.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Real-time search method, device and system
  • Real-time search method, device and system
  • Real-time search method, device and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] In order to describe the technical content, structural features, achieved goals and effects of the present invention in detail, the following will be described in detail in conjunction with the embodiments and accompanying drawings.

[0052] The geometric growth of the Internet scale and the lack of standardization of the World Wide Web make network information retrieval significantly different from traditional information retrieval: the object of Internet information retrieval is massive data; The content of the information is all-encompassing and the form is varied. In order to provide users with structured and intuitive data, we must process the collected web pages in a series of data processing such as denoising, filtering, purification, and structured extraction of topic information.

[0053] At present, mainstream search engines are relatively weak in the field of structured data extraction. General search engines such as Baidu and Google only delabel the collecte...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a real-time search method. The method comprises the following steps: S1, setting interest point data specified by a system; S2, according to the interest point data, capturing associative data from a target web site into the system; S3, according to a preset data acquisition cycle, traversing the target web site; S4, judging whether an updated target web site exists in the target web site, wherein the updated items include newly-presented web pages and changed web pages; if no updated target web site exists in the target web site, back to the step 2; otherwise turning to a step S5; and S5, capturing associative data on the updated target web site into the system and updating the target web site so as to achieve synchronous acquisition. The invention also discloses a real-time search device and a real-time search system. By using the real-time search method, the real-time search device and the real-time search system disclosed by the invention, instant information can be searched in real time at high speed with small resource occupation.

Description

technical field [0001] The invention relates to the field of network search, in particular to a method, device and system. Background technique [0002] Information is an important factor most closely related to people's work and life, ranging from the whole world to every enterprise, business, and even a family and individual. Although the technology of search engines has become more and more advanced in these years, there is still a big problem no matter whether the information search on the Internet is successful or not. Anyone who has used a search engine has experienced this feeling: sometimes you will not find the results you want, on the contrary, sometimes you will find millions of unnecessary results. In fact, the second result is the most troublesome and difficult to deal with. If you want to find the information you really need from these millions of search results, it is like looking for a needle in a haystack. [0003] Suppose the Internet is a giant library,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 刘晓刚
Owner SHENZHEN AIGU TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products