Method and device for analyzing mobile user internet behavior based on URL analysis model
A mobile user and behavior analysis technology, applied in the direction of network data retrieval, network data indexing, special data processing applications, etc., can solve the problems of cumbersome implementation, high crawler performance requirements, heavy system workload, etc., and achieve the goal of reducing workload Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0051] Such as figure 1 As shown, it is a flow chart of a method for analyzing the online behavior of mobile users based on the URL analysis model provided by the present invention, and the method includes:
[0052] Step S1, downloading the webpage.
[0053] Specifically, the HTTP protocol is used to communicate with the web server, and the web page is downloaded by using the socket method in the case of preventing the crawler from accessing a large number of pages under the same host in a short period of time.
[0054] Step S2, performing preprocessing and information extraction on the downloaded webpage.
[0055] Specifically, the downloaded webpage is preprocessed, specifically including: encoding conversion: performing encoding conversion on the content of the webpage, converting other types of encoding types into GBK types, and converting traditional Chinese characters into simplified Chinese characters at the same time; CSS processing: from Extract relevant CSS, JS, Ti...
Embodiment 2
[0063] Such as figure 2 As shown, it is a functional block diagram of a mobile user online behavior analysis device based on the URL analysis model provided by the present invention. A mobile user online behavior analysis device based on a URL analysis model, the device includes: a download module 10 , a web page analysis module 20 , a URL and topic correlation judgment module 30 , a sorting module 40 and a matching module 50 . Wherein, the download module 10 is used for downloading the webpage. The webpage analysis module 20 is configured to preprocess and extract information from downloaded webpages. The URL and topic correlation judging module 30 is used to judge the topic correlation of all the extracted effective links. The sorting module 40 is used to sort the URLs related to the topics according to their PageRank values, and at the same time create a mapping table of corresponding URLs and topics. The matching module 50 is used to match the URL generated by the user...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com