Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for analyzing log

An analysis method and log technology, applied in the field of Internet communication, can solve problems such as inability to analyze log files in real time and unbalanced use of system resources, and achieve the effects of improving timeliness, improving timeliness, and rationally and balanced use.

Inactive Publication Date: 2013-06-26
ALIBABA GRP HLDG LTD
View PDF3 Cites 76 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The present application provides a log analysis method and device to at least solve the problems in the prior art that log files cannot be analyzed in real time and the use of system resources is unbalanced

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for analyzing log
  • Method and device for analyzing log
  • Method and device for analyzing log

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0053] Based on the above-mentioned preferred embodiments, the present application provides a preferred log analysis device, so as to achieve the technical effect of improving the timeliness of log analysis and using system resources in a reasonable and balanced manner. Preferably, the log analysis device in this embodiment can be set in figure 1 In the log analysis server cluster 106 in. In order to achieve the above purpose, specifically, as figure 2 As shown, the above-mentioned log analysis device includes: a collection unit 202, which is used to collect log files generated by a cluster of website log servers; an analysis unit 204, which communicates with the collection unit 202, and is used to perform session-by-session analysis on the collected log files at predetermined intervals Distributed clickstream log analysis based on units of , wherein the interval period makes the resources of the system for analyzing log files evenly used in a day.

[0054]In the above-ment...

Embodiment 2

[0120] exist Figure 1-10 Based on the above, the present application provides an optimal log analysis method, so as to improve the timeliness of log analysis and use system resources in a reasonable and balanced manner. In order to achieve the above purpose, specifically, as Figure 11 As shown, the above log analysis methods include:

[0121] S1102: Collect log files generated by the website log server cluster;

[0122] S1104: Perform a session-based distributed clickstream log analysis on the collected log files at a predetermined interval period, wherein the interval period enables the resources of the system used to analyze the log files to be used evenly in a day.

[0123] In the preferred embodiment above, the collected log files are analyzed in session units at predetermined intervals, and while the timeliness of log file analysis is improved, the system resources for analyzing log files are averaged every day in a day. It can be used in a predetermined period, whic...

Embodiment 3

[0189] On the basis of the above preferred embodiments, the present application provides a preferred log analysis method, so as to improve the timeliness of log analysis and use system resources in a reasonable and balanced manner. In order to achieve the above purpose, specifically, as Figure 12 As shown, the above log analysis methods include:

[0190] S1: Download the original log file by the hour and upload it to the distributed computing file system;

[0191] Preferably, downloading log files on an hourly basis is equivalent to setting the predetermined period to 1 hour, but it is not limited thereto. According to different needs, it can also be 30 minutes, 2 hours, etc., so as to improve the timeliness of log analysis. At the same time, when log files Or when an error occurs in the analysis result, the data can be reprocessed or re-analyzed for the predetermined cycle in which the error occurs, so as to reduce the workload. In addition, the distributed framework has u...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and device for analyzing a log. The method includes: collecting log documents generated by a website log server cluster; and conducting click stream log analysis on the collected log documents based on distribution with conversion as a unit according to a preset interval period, and the interval period enables system resources for analyzing the log documents to be used evenly in a day. By means of the method and device, correct real-time analysis of the click stream log based on the distribution with the conversion as the unit is achieved, and a problem in the prior art that the system resources can not be used in real time and evenly is solved, and therefore the flexibility and timeliness of website log analysis are improved.

Description

technical field [0001] The present application relates to the field of Internet communication, in particular, to a log analysis method and device. Background technique [0002] With the development of Internet information services, many enterprises, companies, government agencies and schools already have or are building their own websites. For the management of the website, we are required not only to pay attention to the daily throughput of the server, but also to further understand the visits of each webpage of the website, to improve the content and quality of the webpage according to the click frequency of each webpage, and to improve the readability of the content. Therefore, Website administrators need to know the analysis results of log files in a timely manner. [0003] At present, the existing click stream log analysis is to collect, organize, analyze, and count the web server logs of the website, mine the commercial value hidden in it, and convert the data describ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): H04L12/24
Inventor 乔平许玉勤
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products