Internet information statistical method and Internet information statistical system

A technology of Internet information and statistical methods, applied in the field of Internet information statistical methods and its systems, can solve the problems of not being able to provide options and interfaces, rough ranking information, etc.

Inactive Publication Date: 2013-07-10
亿赞普(北京)科技有限公司
View PDF4 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] However, Alexa can only provide a rough ranking information and cannot provide more options and interfaces

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Internet information statistical method and Internet information statistical system
  • Internet information statistical method and Internet information statistical system
  • Internet information statistical method and Internet information statistical system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0041] figure 1 It is the flow chart of the Internet information statistics method described in this embodiment, as figure 1 As shown, the Internet information statistics method described in this embodiment includes:

[0042] S101. Divide network access data into multiple service subject data sets.

[0043] In this step, the network access data is divided into multiple business theme data sets through MapReduce according to the business theme. The network access data includes network-wide traffic data IMOS log data for data analysis, and these massive data are stored in the large-scale distributed storage system ODS. The high-speed partition processing of massive data is exactly what the MapReduce data processing mechanism is good at. This data processing mechanism can divide a large amount of data into different data sets through distributed parallel computing in a very short period of time. Therefore, this The invention adopts the MapReduce mechanism to implement the divi...

Embodiment 2

[0054] According to the same idea of ​​the present invention, the present invention also provides an Internet information statistics system, figure 2 It is a block diagram of the Internet information statistics structure described in this embodiment, such as figure 2 As shown, the system includes: a data splitting unit 201 , a data summarizing unit 202 , and a data query unit 203 .

[0055] Wherein, the data splitting unit 201 divides the network access data into multiple business theme data sets through MapReduce according to the business theme. The network access data includes network-wide traffic data IMOS log data for data analysis, and these massive data are stored in the large-scale distributed storage system ODS. The high-speed partition processing of massive data is exactly what the MapReduce data processing mechanism is good at. This data processing mechanism can divide a large amount of data into different data sets through distributed parallel computing in a very...

Embodiment 3

[0063] The present invention also provides the Internet information statistics system realized based on the distributed data processing framework Handoop, such as image 3 As shown, the system mainly includes an upper-level business system 301 , a service layer 302 , a data mart (DM) 303 , a data warehouse (DW) 304 , and a distributed storage system (ODS) 305 . Among them, the data mart DM is implemented based on HBASE, the data warehouse DW is implemented based on HIVE, and the distributed storage system ODS is implemented based on HDFS.

[0064] Next, introduce its data processing process. First, import the network access data IMOS from the outside to the storage system ODS, and then extract the data from the ODS to the data warehouse DW through ETL. The full name of ETL is Extraction-Transformation-Loading, that is, data extraction, transformation and loading. Tools that can implement ETL include: OWB (Oracle Warehouse Builder), ODI (Oracle Data Integrator), Informatic Po...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an Internet information statistical method and an Internet information statistical system. The Internet information statistical method includes the following steps. Firstly, according to service themes, network accessing data of users are divided into multiple service theme data sets through the MapReduce. Secondly, data comprised by each service theme data set undergo statistics according to different indexes and statistics data are stored. Thirdly, when a request for searching statistics information is received, according to a service theme which is required to be searched by the searching request, corresponding statistics data are obtained and fed back. According to the Internet information statistical method and the Internet information statistical system, the users can deeply know information such as accessing volume, accessing times, accessing users, searched key words and flow of all the searched key words of a certain industry, a certain website or some competing websites, rich statistics data can be accurately presented in different grain sizes and a high speed for different service systems and users so that complicated internal relations among network accessing data can be found and displayed and detailed and subjective data support can be provided for a decision-making department.

Description

technical field [0001] The invention relates to the technical field of computer networks, in particular to an Internet information statistics method and system thereof. Background technique [0002] Alexa is the Internet's leading company that provides website traffic information for free. Founded in 1996, Alexa has been committed to developing tools for web crawling and website traffic calculation. Alexa rank is an indicator often cited to evaluate the traffic of a website. [0003] There are two main types of Alexa website world rankings: comprehensive ranking and category ranking. [0004] Comprehensive ranking is also called absolute ranking, that is, the ranking of a specific website among all websites. Alexa publishes a new comprehensive website ranking every three months. This ranking is based on the cumulative geometric mean of the number of user links (Users Reach) and page views (Page Views) for three months. [0005] Classification ranking, one is to classify ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L29/08G06F17/30H04L29/12
Inventor 余效伟罗峰黄苏支李娜
Owner 亿赞普(北京)科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products