Webpage classification method, terminal equipment and storage medium
A webpage classification and webpage technology, applied in neural learning methods, website content management, network data retrieval, etc., can solve the problems of not being widely applicable to webpage data, limited scope of application, low generalization ability, etc., and achieve sparse webpage features Problems, broad applicability, effects of addressing limitations
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0030] The embodiment of the present invention provides a webpage classification method, such as figure 1 As shown, the method includes the following steps:
[0031] S1: Collect multiple types of web pages, construct graph structures based on at least two types of features in each web page, and mark the types of web pages, and then form a training set with all graph structures with type labels.
[0032] The construction of the graph structure includes the construction of nodes and the construction of edges. Nodes in this embodiment include picture nodes corresponding to picture types, text nodes corresponding to text types, and webpage nodes corresponding to webpage structure types, such as figure 2 As shown, the nodes beginning with "O" represent different web page nodes, the nodes beginning with "W" represent different text nodes, and the nodes beginning with "P" represent different image nodes.
[0033] 1. Image node
[0034] In this embodiment, the picture nodes use th...
Embodiment 2
[0076] The present invention also provides a webpage classification terminal device, which includes a memory, a processor, and a computer program stored in the memory and operable on the processor, and the implementation of the present invention is realized when the processor executes the computer program. Steps in the above method embodiment of Example 1.
[0077] Further, as an executable solution, the web page classification terminal device may be computing devices such as desktop computers, notebooks, palmtop computers, and cloud servers. The web page classification terminal device may include, but not limited to, a processor and a memory. Those skilled in the art can understand that the composition structure of the above-mentioned webpage classification terminal device is only an example of the webpage classification terminal device, and does not constitute a limitation to the webpage classification terminal device, and may include more or less components than the above, ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com