Picture-oriented fraudulent webpage identification method, system and device and medium
A recognition method and web page technology, applied in computer security devices, character and pattern recognition, network data retrieval, etc., can solve problems such as slow speed, inability to obtain valid information, poor effect, etc., achieve fast detection speed, and improve calculation speed. and the effect of high precision, precision and recall
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0046] A method for identifying fraudulent webpages based on pictures of the present invention comprises the following steps:
[0047] S100. Collect fraudulent webpages mainly based on pictures to construct webpage samples;
[0048] S200. For each fraudulent webpage, extract the tag tree information through the web page tag tree extraction tool, encode the tag tree through characters, construct a tag tree sequence according to the characters corresponding to the tag, and use the tag tree sequence as a fraudulent tag tree sequence;
[0049] For the malicious value corresponding to each fraud tag tree sequence, the above malicious degree is initialized based on the sample statistical value, and the malicious value is the malicious value of the malicious keyword;
[0050] S300. Construct a feature library based on each of the fraudulent label tree sequences and the update time and malicious value corresponding to each fraudulent label tree sequence. The update time of the above f...
Embodiment 2
[0062] A system for identifying fraudulent webpages based on pictures of the present invention includes a collection module, a label extraction module, a fraudulent label tree module, a malicious value initialization module, a feature database initialization module, a feature database initialization module, and a preliminary judgment module for a webpage to be tested , a suspected fraudulent web page judging module, a fraudulent web page judging module and a feature library cleaning module, the above-mentioned system can execute the method disclosed in Embodiment 1.
[0063] Wherein, the collection module is used to collect fraudulent webpage construction webpage samples mainly based on pictures.
[0064] The label extraction module is used to extract the label tree information through the webpage label tree extraction tool, and encode the label tree through characters, and construct the label tree sequence according to the characters corresponding to the label; or, it is...
Embodiment 3
[0078] An apparatus according to the present invention includes: at least one memory and at least one processor; the at least one memory is used to store a machine-readable program; the at least one processor is used to call the machine-readable program to execute the embodiment 1 public method.
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 
