Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Multi-level database building method and device for retrieval database and storage medium

A database and data technology, applied in the field of big data search, can solve the problems of large retrieval database, low frequency and low efficiency of retrieval database update, achieve high update efficiency and update frequency, ensure efficiency and stability, and the effect of general capacity

Pending Publication Date: 2021-09-24
北京百舸飞驰科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The general issues that need to be considered in the construction of existing retrieval databases are: large-capacity retrieval databases generally have higher retrieval processing efficiency and better retrieval stability, but due to the large capacity of the retrieval database, the frequency and efficiency of retrieval database updates are lower

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-level database building method and device for retrieval database and storage medium
  • Multi-level database building method and device for retrieval database and storage medium
  • Multi-level database building method and device for retrieval database and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0066] see figure 1 As shown, a multi-level database building method for searching databases, including real-time database building, used to save the data information changed from the first time node in the past to the current time node, and the first time node is regularly updated so that the first time node The time interval from the current time node remains unchanged.

[0067] The real-time database building described in this embodiment includes:

[0068] Write the data information of business changes in the retrieval database into the publish-subscribe system in real time;

[0069] Subscribe to the publish-subscribe system to obtain real-time changed data information;

[0070] Generate forward index data according to the real-time changed data information, and push it into the real-time database fragments.

[0071] The real-time database building database in this embodiment is used to save the data information changed from the first time node in the past to the current...

Embodiment 2

[0188] see image 3 As shown, this embodiment provides a data update method for a retrieval database, the retrieval database includes a base database and an incremental database, and the base database and the incremental database are respectively produced during the indexing process. The intermediate files of the volume library and the intermediate files of the incremental library are stored;

[0189] Merge the data of the incremental database into the base database, match the intermediate files of the incremental database with the intermediate files of the base database, and balance the index data of the base database.

[0190] In a search engine, after the data is crawled, it will be indexed for easy retrieval. The index includes a forward index and an inverted index. The forward index means that the document ID is the key, and the number of occurrences of each keyword is recorded in the table. When searching, the information of each word in each document in the table is sc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of big data search, and discloses a multi-level database building method and device for a retrieval database and a storage medium. The multi-level database building method for the retrieval database comprises the steps that a real-time database is built, and is used for storing data information changed from a first time node to a current time node in the past, wherein the first time node is periodically updated, so that the time interval between the first time node and the current time node is kept unchanged. The real-time database building comprises the following steps of: writing data information of business change in a retrieval database into a publishing and subscribing system in real time; subscribing the publishing and subscribing system to obtain data information changed in real time; generating forward index data according to the data information changed in real time, and pushing the forward index data into real-time database fragments. According to the method, a multi-level database building scheme at least comprising the real-time database is realized, the retrieval efficiency and stability can be ensured, the retrieval timeliness can also be ensured, the retrieval requirement of a user is met, and the retrieval experience of the user is improved.

Description

technical field [0001] The invention relates to the technical field of big data search, in particular to a multi-level database building method, device and storage medium for a retrieval database. Background technique [0002] In the era of big data, search engine services based on retrieval databases are widely used. With the increase of data, there are more complex requirements for the construction of retrieval databases. [0003] The general issues that need to be considered in the construction of existing retrieval databases are: large-capacity retrieval databases generally have higher retrieval processing efficiency and better retrieval stability, but due to the large capacity of the retrieval database, the frequency and efficiency of retrieval database updates are relatively low. In particular, for more and more retrieval databases that can store hundreds of millions of data, when building a retrieval database, not only the efficiency and stability of retrieval must b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/21G06F16/22G06F16/23G06F16/31
CPCG06F16/21G06F16/23G06F16/2228G06F16/319
Inventor 董金奎王岩程童匡柘溪黄鹤南王敏颜聪刘向阳
Owner 北京百舸飞驰科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products