A web page feature extraction method and device
A web page feature and extraction method technology, applied in the Internet field, can solve problems such as structural dependence, web page structure dependence, unreasonable calculation of feature word weight, etc., to achieve the effect of optimizing quality and ensuring correctness
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0043] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the drawings in the embodiments of the present invention.
[0044] The web page feature extraction method provided by the present invention adopts position weight in extracting web page features, and integrates the influence of the two elements of position weight and frequency of occurrence on web page feature vector extraction. On the basis of extracting high-frequency words that are distinguishable from other web pages on the entire network, the target web page is divided into multiple document parts according to the basic position structure of web page information, and different weight ratio values are assigned to each document part, and according to the web page The number of occurrences of the feature word with the highest number of times is used as the weight value of the basic position, and the product of the two is used to det...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com