A webpage classification method, terminal equipment and storage medium
A web page classification and web page technology, applied in neural learning methods, website content management, web data retrieval and other directions, can solve the problems of not being widely applicable to web page data, limited application scope, and high classification error rate, to solve the problem of sparse web page features Problems, wide application range, good recognition effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0030] The embodiment of the present invention provides a webpage classification method, such as figure 1 As shown, the method includes the following steps:
[0031] S1: Collect multiple types of web pages, construct graph structures based on at least two types of features in each web page, and mark the types of web pages, and then form a training set with all graph structures with type labels.
[0032] The construction of the graph structure includes the construction of nodes and the construction of edges. Nodes in this embodiment include picture nodes corresponding to picture types, text nodes corresponding to text types, and webpage nodes corresponding to webpage structure types, such as figure 2 As shown, the nodes beginning with "O" represent different web page nodes, the nodes beginning with "W" represent different text nodes, and the nodes beginning with "P" represent different picture nodes.
[0033] 1. Image node
[0034] In this embodiment, the picture nodes use ...
Embodiment 2
[0076] The present invention also provides a webpage classification terminal device, which includes a memory, a processor, and a computer program stored in the memory and operable on the processor, and the implementation of the present invention is realized when the processor executes the computer program. Steps in the above method embodiment of Example 1.
[0077] Further, as an executable solution, the web page classification terminal device may be computing devices such as desktop computers, notebooks, palmtop computers, and cloud servers. The web page classification terminal device may include, but not limited to, a processor and a memory. Those skilled in the art can understand that the composition structure of the above-mentioned webpage classification terminal device is only an example of the webpage classification terminal device, and does not constitute a limitation to the webpage classification terminal device, and may include more or less components than the above, ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com