A method and system for intelligent identification of webpage types based on deep learning
A deep learning and intelligent recognition technology, applied in character and pattern recognition, network data retrieval, network data query, etc., can solve the problems of obvious human factors, trouble, and low classification accuracy in the determination of features, so as to solve the classification defects. The effect of low rate, improved accuracy, and improved efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0054] figure 1 It is a kind of web page type intelligent identification method based on deep learning of the present invention, comprising the following steps:
[0055] S1. Input the webpage to be classified and identified;
[0056] S2. The deep learning classification model classifies and recognizes the input webpage, and obtains category information of the webpage to be classified and recognized.
[0057] figure 2 For the specific training process of the deep learning classification model:
[0058] S2.1. Obtaining a web page data set marked with categories;
[0059]Targeted collection of web pages, and mark the web page category, through the crawler targeted collection of web pages, the provincial / city portals are classified into one category, the ministry websites are grouped into one category, and the vertical system websites are grouped into one category. Take several webpages, a total of 100,000 webpages as training webpages, and mark these training webpages with w...
Embodiment 2
[0078] A web page type intelligent identification system based on deep learning of the present invention, the system includes the following modules:
[0079] A web page type intelligent identification system based on deep learning, the system includes the following modules:
[0080] Input module: input the webpage to be classified and identified;
[0081] Type identification module: the deep learning classification model classifies and identifies the input webpage, and obtains the category information of the webpage to be classified and identified;
[0082] The deep learning classification model is further composed of the following modules:
[0083] Data acquisition module: acquire web page data sets marked with categories;
[0084] Screening module: filter training webpage set and test webpage set;
[0085] Preprocessing module: perform preprocessing operations on web pages;
[0086] Model calculation module: deep learning classification model calculatio...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


