Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Detection system and detection method of advertisement clicking anomaly based on Spark Streaming

A technology of anomaly detection and advertisement clicks, applied in relational databases, structured data retrieval, marketing, etc., it can solve the problem that a fast decision-making scheme cannot provide a theoretical basis quickly, and the processing technology cannot solve online problems in real time, and data security And the problem of weak performance in large batch data processing

Active Publication Date: 2017-05-10
CHONGQING UNIV OF POSTS & TELECOMM
View PDF4 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The current processing technology is generally based on offline batch processing. Such processing technology cannot solve online problems in real time, and cannot quickly provide theoretical basis for some fast decision-making solutions.
For real-time systems such as Storm, although it has the ability to process data in real time, it is weaker than Spark Streaming in terms of data security and large-scale data processing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Detection system and detection method of advertisement clicking anomaly based on Spark Streaming
  • Detection system and detection method of advertisement clicking anomaly based on Spark Streaming
  • Detection system and detection method of advertisement clicking anomaly based on Spark Streaming

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The technical solutions in the embodiments of the present invention will be described clearly and in detail below in conjunction with the drawings in the embodiments of the present invention. The described embodiments are only a part of the embodiments of the present invention.

[0047] The technical scheme of the present invention is as follows:

[0048] Such as figure 1 As shown, an advertisement click anomaly detection system based on Spark Streaming is characterized by comprising a data collection unit 1, a data cleaning unit 2, a distributed data message system 3, a first abnormal data detection unit 4, and a suspicious data extraction unit 5. , Normal data and abnormal data classifier 6 and classified data database unit; among them

[0049] The data collection unit 1 is used to collect log information of users clicking on advertisements;

[0050] The data cleaning unit 2 cleans and standardizes the logs collected by the data collection unit 1, and finally sends the stand...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a detection system and a detection method of advertisement clicking anomaly based on Spark Streaming, and relates to the field of computer technique application. Logs are collected when a user clicks the advertisements on a webpage, data collected in real time are cleaned, data field format is standardized, and the standardized data is transferred to the Kafka data information system by Flume, data are classified through a KNN neighborhood algorithm of Spark Streaming, and the three classes of abnormal data, suspicious data, and normal data can be obtained. The abnormal data and the normal data are stored in a database, the suspicious data are sent to the Kafka data information system, and naive Bayes classifiers are trained through the abnormal data, the classification information of the suspicious data can be obtained using the classifier, and data are saved in the database. Advertiser expenses are justly collected by the amount of normal data, in the meantime, the popularities of each advertisement are obtained by analyses, the directions for industrial developments are provided for the advertisers, and the information such as user distributions in the country is provided.

Description

Technical field [0001] The invention relates to the application field of computer technology, in particular to a detection system and a detection method based on Spark Streaming advertisement click abnormality. Background technique [0002] With the explosive growth of data, the era of big data has come. Safe, fast, real-time, and efficient data processing can not only allow enterprises to avoid risks in advance, but also provide timely data and information for enterprise development, product production and development. Effective basis. [0003] However, due to the open nature of the network, it also brings inauthentic information, malicious access, and malicious attacks while making it convenient for the public. This is a problem faced by all open websites. How to prevent these problems, how to extract real and effective data, and reduce the malicious load of the server is the research focus of each open website. Among them, malicious clicks on advertisements are a typical probl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06Q30/02
CPCG06Q30/0242G06F16/285
Inventor 刘群谭敢锋戴大祥
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products