Cross-time multi-dimensional abnormal data monitoring method and system

An abnormal data and multi-dimensional technology, applied in the computer field, can solve problems such as low efficiency, damage to the interests of advertisers and advertising platforms, and inability to adapt to business expansion, so as to achieve good accuracy and achieve the effect of accuracy

Inactive Publication Date: 2017-02-22
杭州启冠网络股份有限公司
View PDF4 Cites 40 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Various cheating behaviors that are easy to occur after online advertisements are placed, including advertisement clicks generated by illegal traffic generated by maliciously simulating user clicks through programs or scripts, damage the interests of advertisers and advertising platforms. In the prior art, The modeling method for abnormal advertisement click behavior is mainly based on the supervised learning method, which extracts the characteristics of each record from the original log, undergoes manual labeling, and regenerates the model. This method is not only inefficient, but also unable to adapt to business expansion. When When the amount of access data reaches TB or even PB level, it cannot meet business needs in terms of timeliness and processing performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-time multi-dimensional abnormal data monitoring method and system
  • Cross-time multi-dimensional abnormal data monitoring method and system
  • Cross-time multi-dimensional abnormal data monitoring method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is only some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0029] figure 1 It is a flowchart of an embodiment of a method for monitoring abnormal data across time and dimensions in the present invention, such as figure 1 As shown, the method for monitoring abnormal data across time and in multiple dimensions includes:

[0030] 10. Collect Web logs and OLAP log queries to obtain website information,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a cross-time multi-dimensional abnormal data monitoring method and system. The method includes: collecting Web logs and OLA log queries to obtain website information, delivery advertisement information and user information; cleaning obtained heterogeneous data by employing Python in a Spark environment, generating an electronic table corresponding to each dimension, and preparing modeling analysis; establishing a normal advertisement click behavior model by employing a GBDT algorithm; conducting a modeling analysis on an abnormal data sample by employing a support vector machine, and establishing an abnormal advertisement click behavior model; and generating a cheating click monitoring model, and deploying an online anti-cheating model. According to the method and system, the problems of fusion analysis of various heterogeneous data and small sample classification accuracy are solved by employing machine learning and cloud computing technologies through monitoring of cheating Internet advertisement traffic, subsequent Internet advertisement click data can be recorded and analyzed in a whole machine learning chain, the anti-cheating model is improved, and the accuracy of advertisement delivery can be better achieved.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a method and system for monitoring abnormal data across time and in multiple dimensions. Background technique [0002] Various cheating behaviors that are easy to occur after online advertisements are placed, including advertisement clicks generated by illegal traffic caused by maliciously simulating user clicks through programs or scripts, damage the interests of advertisers and advertising platforms. In the prior art, The modeling method for abnormal advertisement click behavior is mainly based on the supervised learning method, which extracts the characteristics of each record from the original log, undergoes manual labeling, and regenerates the model. This method is not only inefficient, but also unable to adapt to business expansion. When When the amount of access data reaches TB or even PB level, it cannot meet business needs in terms of timeliness and processing performa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q30/02
CPCG06Q30/0245
Inventor 史建民龚安邦
Owner 杭州启冠网络股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products