Page access upstream and downstream flow calculation method
A calculation method and page access technology, applied in the field of big data analysis, can solve problems such as the inability to reflect the gradual decrease of traffic, repeated statistics, and the inability to meet the analysis needs of traffic conditions, and achieve the effect of speeding up query matching efficiency and facilitating implementation.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0031] Example 1, such as figure 1 As shown in , a method for calculating upstream and downstream traffic of page visits includes the following steps:
[0032] Obtain user access logs from the Nginx reverse proxy server, transmit and upload them to HDFS storage, use the Hive offline processing engine to analyze the access logs, and extract the logs including: access time, user ID, session ID, current page link and source Page links are stored in the Hive data table, and the storage format uses Parquet or ORC to improve storage query efficiency.
[0033] Next, when processing the access log, the Hive offline processing engine is also used for processing, and the access path rule is implemented by encapsulating the logic of the UDAF function, combined with the HQL grouping based on the user ID, and the access path of the same user is input to the UDAF function for processing. After processing, the output is serialized binary data of a special data structure, which is stored as ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 

