Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for distributed log visitor volume counting

A distributed, distributed real-time technology, applied in multi-program devices, computing, resource allocation, etc., can solve problems such as bandwidth fluctuation, distortion of statistical results, and inability to observe statistical data in real time, so as to reduce load and reduce execution operations. , The effect of improving the response speed of data processing

Active Publication Date: 2018-08-03
MICRO DREAM TECHTRONIC NETWORK TECH CHINACO
View PDF5 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1. The load of each machine is huge and log loss often occurs, because the bandwidth of a single machine may fluctuate, so it may not be enough to receive all access logs, resulting in distortion of statistical results;
[0005] 2. Frequent access to the file system will slow down the system response;
[0006] 3. If any machine goes down, it will affect all statistics of the day;
[0007] 4. After the statistics are finished every day, the statistical data of the previous day will be obtained the next day, and the statistical data cannot be observed in real time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for distributed log visitor volume counting
  • Method and device for distributed log visitor volume counting

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0024] Such as figure 1 As shown, it is a flow chart of a method for distributed statistical log access volume in an embodiment of the present invention, including:

[0025] 101. In the server cluster of the distributed real-time computing system, multiple log data are obtained in real time;

[0026] 102. Divide the acquired plurality of log data into a first predetermined number of log data streams;

[0027] 103. Create the first predetermined number of wor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the invention provide a method and a device for distributed log visitor volume counting. The method comprises: in a server cluster of a distributed real-time calculating system, obtaining a plurality of pieces of log data in real time; dividing the obtained plurality of pieces of log data to log data streams in a first preset number; establishing working units in the first preset number, and correspondingly distributing each log data stream to each working unit in a one-to-one manner; in each working unit, counting each piece of log data in the distributed log data streams in the current working unit, and using the obtained log statistical data to update a database in real time. Through the method and the device, load of each machine is greatly reduced. Since execution operation on log files is greatly reduced, data processing response speed of the whole cluster is greatly improved. The method and the device realize to rapidly and accurately count user effective visit times in real time every day.

Description

technical field [0001] The invention relates to the technical field of computer data processing, in particular to a method and device for distributed statistics of log visits. Background technique [0002] In a large website, each time the server accepts a user request, an access log is generated. The log fields of the access log usually include access IP (Internet Protocol, a protocol for interconnection between networks), access time, access path, access duration, processing The machine of the log data and the status code of this HTTP (HyperText Transfer Protocol, Hypertext Transfer Protocol) request. If it is necessary to count the daily effective visits and total visits of the website, but because the number of website visit logs is very large and may reach hundreds of billions of records per day, it is not feasible to use conventional statistical methods to perform statistical operations on such logs Yes, because the processing speed of a single machine is far behind t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/30G06F11/34G06F9/50
CPCG06F9/5027G06F11/3065G06F11/3093G06F11/3452
Inventor 王嘉伟
Owner MICRO DREAM TECHTRONIC NETWORK TECH CHINACO