Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for analyzing mobile user internet behavior based on URL analysis model

A mobile user and behavior analysis technology, applied in the direction of network data retrieval, network data indexing, special data processing applications, etc., can solve the problems of cumbersome implementation, high crawler performance requirements, heavy system workload, etc., and achieve the goal of reducing workload Effect

Inactive Publication Date: 2016-09-21
GUANGDONG KINGPOINT DATA SCI & TECH CO LTD
View PDF3 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the processing method of mobile Internet users' surfing behavior, the URL generated by the user's surfing behavior can be incrementally crawled, and the crawled web pages are analyzed and then matched with the operator's business. The performance requirements are very high, the implementation is cumbersome, and the workload in the later stage of the system is heavy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for analyzing mobile user internet behavior based on URL analysis model
  • Method and device for analyzing mobile user internet behavior based on URL analysis model
  • Method and device for analyzing mobile user internet behavior based on URL analysis model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0051] Such as figure 1 As shown, it is a flow chart of a method for analyzing the online behavior of mobile users based on the URL analysis model provided by the present invention, and the method includes:

[0052] Step S1, downloading the webpage.

[0053] Specifically, the HTTP protocol is used to communicate with the web server, and the web page is downloaded by using the socket method in the case of preventing the crawler from accessing a large number of pages under the same host in a short period of time.

[0054] Step S2, performing preprocessing and information extraction on the downloaded webpage.

[0055] Specifically, the downloaded webpage is preprocessed, specifically including: encoding conversion: performing encoding conversion on the content of the webpage, converting other types of encoding types into GBK types, and converting traditional Chinese characters into simplified Chinese characters at the same time; CSS processing: from Extract relevant CSS, JS, Ti...

Embodiment 2

[0063] Such as figure 2 As shown, it is a functional block diagram of a mobile user online behavior analysis device based on the URL analysis model provided by the present invention. A mobile user online behavior analysis device based on a URL analysis model, the device includes: a download module 10 , a web page analysis module 20 , a URL and topic correlation judgment module 30 , a sorting module 40 and a matching module 50 . Wherein, the download module 10 is used for downloading the webpage. The webpage analysis module 20 is configured to preprocess and extract information from downloaded webpages. The URL and topic correlation judging module 30 is used to judge the topic correlation of all the extracted effective links. The sorting module 40 is used to sort the URLs related to the topics according to their PageRank values, and at the same time create a mapping table of corresponding URLs and topics. The matching module 50 is used to match the URL generated by the user...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and a device for analyzing mobile user internet behaviors based on a URL analysis model. The device comprises a download module, a webpage analysis module, a URL and topic relevance determination module, an ordering module, and a matching module. Compared with the prior art, the method and a device for analyzing mobile user internet behaviors based on a URL analysis model have beneficial effects in that user internet behavior analysis based on URL analysis is realized, and through using a topical crawler, a mapping table is formed, and the URLs generated by user internet behaviors are used to match with the mapping table, and the URLs are classified in corresponding classifications. Thus, work of the crawler is brought forward before development, and later-phase workload of a system is reduced. In addition, aimed at a defect of topic drift caused by just using a PageRank algorithm by a common topical crawler, before URL ordering, through determining topic relevance, topic offset degree can be reduced on the basis of not substantially increasing complexity of the algorithm.

Description

technical field [0001] The invention relates to the technical field of subject crawlers, in particular to a method and device for analyzing mobile user online behavior based on a URL analysis model. Background technique [0002] With the advent of Internet 2.0, mobile terminals have become a part of our lives, which has accumulated a huge number of users' online behaviors for operators. Effective use of these online behaviors to push services of interest to users can improve the user experience and enhance the competitiveness of operators at the same time. In the processing method of mobile Internet users' surfing behavior, the URL generated by the user's surfing behavior can be incrementally crawled, and the crawled web pages are analyzed and then matched with the operator's business. The performance requirements are very high, the implementation is cumbersome, and the workload in the later stage of the system is heavy. [0003] In view of the above-mentioned defects, the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/9566G06F16/951
Inventor 窦钰景简宋全李青海邹立斌
Owner GUANGDONG KINGPOINT DATA SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products