Method and system for identifying whether webpage includes malicious content or not

A technology for web content and web pages, which is applied in the field of network security and can solve the problems of low detection rate, lag, and user economic losses of malicious websites such as phishing websites.

Active Publication Date: 2016-09-21
量子创新(北京)信息技术有限公司
View PDF8 Cites 33 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But the ensuing problem is: a large number of malicious website attacks are increasing year by year. They use a series of technical means to disguise their identities to deceive users' trust, and then seek illegal benefits. Users suffer huge economic losses under the attack of malicious websites.
[0003] The existing technologies for preventing malicious websites are mainly to send a suspicious webpage URL to the blacklist database for query. However, due to the continuous updating of phishing websites, this method has a low detection rate for malicious websites such as phishing websites. high with hysteresis
Either scan the content of the webpage to find out whether there are malicious keywords in the webpage; or extract the basic features of the webpage image to calculate the similarity between the suspicious webpage and the real webpage, so as to judge whether the suspicious webpage is suspected of imitation, but the above Each method has its own limitations, resulting in a high rate of misjudgment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for identifying whether webpage includes malicious content or not
  • Method and system for identifying whether webpage includes malicious content or not
  • Method and system for identifying whether webpage includes malicious content or not

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0028] figure 1 A flow chart of a method 100 for identifying whether a webpage contains malicious content according to an embodiment of the present invention is shown.

[0029] According to an embodiment of the present invention, in order to improve the identification efficiency of malicious webpages, a preprocessing operation is performed on the input webpages to be identified, that is, a black and white list is used to filter the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for identifying whether a webpage includes malicious content or not. One identification method comprises the following steps: resolving a URL (Uniform Resource Locator) of a webpage to be identified to extract a URL feature from the URL in order to generate a first feature set; generating a first feature vector according to the first feature set; and processing the first feature vector by using a first feature model, and outputting a first result in order to represent whether the webpage to be identified includes the malicious content or not. The invention also discloses another three identification methods, and corresponding systems for identifying whether the webpage includes the malicious content or not.

Description

technical field [0001] The invention relates to the technical field of network security, in particular to a method and a system for identifying whether malicious content is contained in a web page. Background technique [0002] With the development of the Internet, WEB-based applications are becoming more and more popular. People can query bank accounts and shop online through browsers. WEB provides people with a convenient and fast way of interaction. But the ensuing problem is: a large number of malicious website attacks are increasing year by year. They use a series of technical means to disguise their identities to deceive users' trust, and then seek illegal benefits. Users suffer huge economic losses under the attack of malicious websites. . Therefore, how to identify malicious content in web pages and prevent malicious websites has become a very meaningful research topic in the field of network security. [0003] The existing technologies for preventing malicious web...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/56
Inventor 李唱康靖陈虎
Owner 量子创新(北京)信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products