Massive data real-time screening and analysis method, system and storage medium
A technology of massive data and analysis methods, applied in the field of data processing, can solve problems such as ineffective caching of intermediate calculation results, increase server operating costs, and large server computing pressure, so as to reduce computing pressure, improve computing speed, and expand cached data. amount of effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0038] Taking a real-time screening analysis process for massive data as an example, the technical solution of the present invention is described in detail below, as figure 1 As shown, the analysis process includes the following steps:
[0039] S01: Build a basic condition database, compress the basic condition database and store it in the cloud space, and locally cache the data list corresponding to each data in the basic condition database;
[0040] Summarize all the basic conditions and the existing basic condition sets to obtain the basic condition database, and record the basic condition database as a compressed file in BitSet format, and store it in the general cloud space. This operation is generally performed offline. It should be noted that: the BitSet format is a compressed storage set format, which can only store index information of data sets that meet certain conditions rather than the data itself, and can quickly complete operations such as intersection, union, a...
Embodiment 2
[0059] Below in conjunction with the massive data real-time screening and analysis system described in the present invention, this system is applied to realize the above-mentioned massive data real-time screening and analysis method, and its workflow is:
[0060] First, build the basic condition database according to the basic condition and the existing basic condition set, record the basic condition database as a compressed BitSet format file through the basic condition compression cache module, and store it in the general cloud space / server, and record each item in the basic condition database The data list corresponding to the data is cached locally;
[0061] The data extraction module extracts the available data in the basic condition database and local cache data according to the filter conditions;
[0062] The data screening module screens the available data at the operation end to obtain intermediate calculation results and screening results; specifically, the data scre...
Embodiment 3
[0065] A storage medium stores a computer program, and when the computer program is executed by a processor, the steps of the above analysis method are realized.
[0066] The above-mentioned three embodiments respectively introduce the massive data real-time screening method, system and storage medium in detail. Those skilled in the art should understand that the embodiments of the present invention can be provided as methods, devices or computer program products. Accordingly, the present invention can take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


