The invention relates to a financial
fishing webpage detection method based on
Web page characteristics. The method is based on pre-established financial first Title keyword
library, second Title keyword
library, sensitive keyword
library and webpage Logo icon
characteristic point rule library, and comprises the steps of obtaining an
HTML of a to-be-detected webpage by employing a crawler, extracting text information of a Title
label, calculating a matching degree with the first and second Title keyword libraries, and if the matching degree is greater than a threshold value, judging as a
fishing webpage, or otherwise, going to the next step of detecting; extracting the text information of a special
label of the to-be-detected webpage, making a statistics on a matching number with the sensitive keyword library, calculating a sensitive characteristic value, and if the characteristic value is greater than the threshold value, judging as the
fishing webpage, or otherwise, going to the next step of detecting; and carrying out fixed point interception on the to-be-detected webpage, obtaining a Logo icon of the to-be-detected webpage, extracting characteristic points of the Logo icon, comparing with the icon
characteristic point rule library, calculating a similar degree according to the matching number of characteristic points, and if the similar degree is greater than the threshold value, judging as the fishing webpage, or otherwise, as a normal webpage. According to the financial fishing webpage detection method based on the
Web page characteristics related by the invention, whether the to-be-detected
Web page is the financial fishing webpage can be judged accurately and quickly.