Mass data real-time screening analysis method and system, and storage medium
A technology of massive data and analysis methods, applied in the field of data processing, can solve problems such as ineffective caching of intermediate calculation results, increase server operating costs, and large computing pressure on servers, save data search time, improve computing speed, and expand The effect of the amount of cached data
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0038] Taking a real-time screening analysis process for massive data as an example, the technical solution of the present invention is described in detail below, as figure 1 As shown, the analysis process includes the following steps:
[0039] S01: Build a basic condition database, compress the basic condition database and store it in the cloud space, and locally cache the data list corresponding to each data in the basic condition database;
[0040] Summarize all the basic conditions and the existing basic condition sets to obtain the basic condition database, and record the basic condition database as a compressed file in BitSet format, and store it in the general cloud space. This operation is generally performed offline. It should be noted that: the BitSet format is a compressed storage set format, which can only store index information of data sets that meet certain conditions rather than the data itself, and can quickly complete operations such as intersection, union, a...
Embodiment 2
[0059] Below in conjunction with the massive data real-time screening and analysis system of the present invention, the system is applied to realize the above-mentioned massive data real-time screening and analysis method, and its workflow is:
[0060] First, build the basic condition database according to the basic conditions and the existing basic condition sets, record the basic condition database as a compressed BitSet format file through the basic condition compression and cache module, and store it in the general cloud space / server, and record each item in the basic condition database as a compressed BitSet format file. The data list corresponding to the data is cached locally;
[0061] The data extraction module extracts the available data in the basic condition database and the local cache data according to the filtering conditions;
[0062] The data screening module filters the available data on the operation side, and obtains the intermediate operation results and sc...
Embodiment 3
[0065] A storage medium storing a computer program, when the computer program is executed by a processor, implements the steps of the above analysis method.
[0066] The above three embodiments respectively introduce the method, system and storage medium for real-time screening of massive data in detail. Those skilled in the art should understand that the embodiments of the present invention can be provided as methods, apparatuses or computer program products. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


