Unlock instant, AI-driven research and patent intelligence for your innovation.

Data search method and device

A data search and data technology, applied in the field of data processing, can solve the problems of low efficiency of Boolean operation search, large space and time overhead, etc., to achieve the effect of reducing time and space overhead and improving search efficiency

Active Publication Date: 2019-04-09
RUN TECH CO LTD BEIJING
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In view of this, the embodiment of the present invention provides a data search method and device to solve the problem of the excessive space and time overhead required by the index and the search efficiency brought by the Boolean operation of the regular expression in the existing data search algorithm. low problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data search method and device
  • Data search method and device
  • Data search method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0026] figure 2 It is a schematic flowchart of a data search method provided in Embodiment 1 of the present invention. The method is suitable for searching, classifying and sorting massive data by various search engines such as industries and commerce. The method is executed by a device, a terminal or a server with a search engine, and can be implemented in the form of software and / or hardware. Such as figure 2 As shown, the data search method includes:

[0027] S210. Convert the search rules from Boolean rules to disjunctive normal form rules.

[0028] For massive data in the Internet, social networks, etc., there are massive search rules corresponding to it. This correspondence between data and search rules enables users to search and filter massive data. The search rules issued in this correspondence can be called Boolean rules.

[0029] A Boolean rule is specifically one or multiple objects connected by logical operation symbols ("logic and", "logic or" and "logic no...

Embodiment 2

[0042] image 3 It is a schematic flow chart of a data search method provided by Embodiment 2 of the present invention. This embodiment further describes the data search method in detail on the basis of the foregoing embodiments. Such as image 3 As shown, the data search method includes:

[0043] S301. Convert the search rules from Boolean rules to disjunctive normal form rules.

[0044] S302. Decompose the disjunctive normal form rules into multiple simple conjunction rules.

[0045] S303. Allocate a corresponding increment identifier for each of the simple conjunction rules in the disjunctive normal form rules.

[0046] Each Boolean rule issued corresponds to a unique identifier, which is the basis for the index when the search rule is searched. Therefore, if Boolean rules are to be finally divided into simple conjunction rules, an incremental identifier can be assigned to each simple conjunction rule for the convenience of finding. And in order to be applicable to all...

Embodiment 3

[0063] On the basis of the above-mentioned embodiments, this embodiment further defines a data search method to make the method easier to implement, and uses simple examples to illustrate.

[0064] For example, among the distributed massive search rules, a certain Boolean rule is A=1&(B=2||C=3), which is transformed into the disjunctive normal form (A=1&B=2)||(A= 1&C=3), after conversion, it can be decomposed into two simple conjunction rules, (A=1&B=2) and (A=1&C=3), respectively assigning the increment flag 1 and the increment flag to the above two rules 2. Then generate rule 1: (A=1&B=2) and rule 2: (A=1&C=3) respectively. Correspondingly, a comparison index as shown in Table 1 can be established.

[0065] Table 1

[0066] A=1

1,2

B=2

1

C=3

2

[0067] As shown in Table 1, both rule 1 and rule 2 include the field A=1, then the increments corresponding to A=1 in the comparison index are identified as 1 and 2; similarly, the increments corr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a data search method and device. The method comprises the following steps: converting a search rule into a disjunctive normal form rule from a Boolean rule; decomposing the disjunctive normal form rule into a plurality of simple conjunctive form rules; and comparing data in a database with each simple conjunctive form rule in the disjunctive normal form rule, and obtaining data which hits all objects contained in any one simple conjunctive form rule, and taking the data as search data corresponding to the search rule. The technical scheme of the invention solves the problems of overhigh expenditure of time and space required by indexing and low search efficiency brought by the Boolean operation of a regular expression in a traditional data search algorithm.

Description

technical field [0001] The invention relates to the field of data processing, in particular to a data search method and device. Background technique [0002] With the rapid development of the Internet, social networks, e-commerce, etc., the amount of new data added every day is also rapidly increasing. For massive data, we need to filter and filter massive data through high-speed real-time comparison according to preset rules. [0003] In the traditional algorithm implementation, the name of the field or condition of the rule is used for hash storage. The mapping value not only needs to store the value of the field or condition, but also needs to store the expression ID (Identification, identification number) and the field or condition in position in the expression. When filtering and searching data, it is necessary to search each field or condition in the massive data in the hash table, and then obtain all expression IDs and positions according to the value of the field o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/2453G06F16/2455
CPCG06F16/2453G06F16/24564
Inventor 周寅生
Owner RUN TECH CO LTD BEIJING