The invention discloses a
patent infringement clue
web crawler method for network commodities, which comprises the following steps: constructing a
patent infringement clue template, automatically selecting keywords, pictures and technical features according to high-risk infringement products, related information of user complaints or related expert experience, and putting the keywords and the pictures into a
queue to be captured; extracting the keywords and the pictures to be captured from the
queue to be captured, putting the keywords and the pictures to be captured into a
search engine, downloading the searched corresponding URL webpage, and storing the searched URL webpage into a downloaded URL webpage
library; in addition, putting the web pages into a captured
queue; analyzing URL webpages in the captured queue, analyzing other URL webpages contained in the captured URL webpages, and putting the URLs into a to-be-captured URL queue, so as to enter second capturing, and repeating the steps; and analyzing the downloaded data in the finally captured URL to obtain information of related products, and finally pushing the information to a page. According to the method, the accuracy of analysis and judgment of network
patent infringement and counterfeit clues can be effectively improved.