Index statistics method, system and device and storage medium
A statistical method and statistical system technology, applied in computing, special data processing applications, instruments, etc., can solve the problems of not supporting time window update one by one, large amount of calculation, and high delay
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0097] Such as figure 1 As shown, this embodiment provides an index statistics method based on the sliding window of the big data processing framework, wherein the big data processing framework is Flink, and this method uses the State object in the mapWithState operator provided by Flink to store the following keys The parameters are the time linked list, the minimum value of the time linked list, the maximum value of the time linked list, and the statistical value of the indicators; among them, the time linked list is a two-way linked list. The direction of positioning to the location of out-of-order data and expired data, the complexity of inserting and deleting operations in the time-linked list is O(1), which can improve processing speed and efficiency.
[0098] The index statistical method provided in this embodiment includes the following steps:
[0099] Step 101, Flink uses multiple receivers to receive external data in parallel; wherein, each piece of data includes an...
Embodiment 2
[0119] Such as figure 2 As shown, the index statistics system based on the sliding window of the big data processing framework in this embodiment includes a parameter storage module 1 , a data processing module 2 and a real-time processing module 3 .
[0120] The big data processing framework is Flink, which includes multiple data receivers.
[0121] The parameter storage module 1 uses the State object in the mapWithState operator to store several parameters. The parameters include the time linked list, the minimum value of the time linked list, the maximum value of the time linked list, and index statistics. Among them, the time-linked list is a doubly-linked list, which stores data in chronological order, and can choose the fastest direction to locate out-of-order data and expired data each time it is searched. The complexity of inserting and deleting operations in the time-linked list is O (1), it can increase the processing speed and improve the efficiency.
[0122] The...
Embodiment 3
[0152] On the basis of Embodiment 2, each piece of data in this embodiment includes several fields of indicators to be counted. In addition, if Figure 7 As shown, different from the real-time processing module 3 in Embodiment 2, the real-time processing module 3 in this embodiment includes a third data generating module 304 , a fourth data generating module 305 and a second current data generating module 306 .
[0153] The third data generating module 304 is configured to use the latest received piece of data as the third data.
[0154] The fourth data generation module 305 is configured to use a KeyBy operator to distribute the third data to different nodes according to the dimension of the indicator field to be counted, and the data distributed to the nodes is the fourth data.
[0155] The second current data generating module 306 is configured to use the fourth data as the current data.
[0156] Common statistical indicators are based on a certain business dimension for ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com