Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for classifying URLs

A technology for classifying results and web page types, applied in the field of big data and the Internet, it can solve the problem of low URL classification efficiency, and achieve the effect of high classification efficiency

Active Publication Date: 2019-07-09
CHINA TELECOM CORP LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] One of the technical problems to be solved by the embodiments of the present invention is: to solve the problem of low URL classification efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for classifying URLs
  • Method and device for classifying URLs
  • Method and device for classifying URLs

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0027] The relative arrangements of components and steps, numerical expressions and numerical values ​​set forth in these embodiments do not limit the scope of the present invention unless specifically stated otherwise.

[0028] At the same time, it should be understood that, for the convenience of description, the sizes of the various parts shown in the drawings are not drawn according to the actual proportional relationship.

[0029] Techniques, methods and devices...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and a device for classifying a URL (Uniform Resource Locator), and relates to the field of big data and the Internet technology. The method comprises the following steps of: obtaining the user feature information of each user who accesses the URL and an access frequency for each user to access the URL, wherein the user feature information comprises a user tag determined on the basis of the historical online behavior of the user and the weight of each user tag; according to the obtained user feature information of each user and the access frequency for each user to access the URL, determining URL feature information, wherein the URL feature information comprises the webpage type of the URL and the weight of each webpage type; and according to the URL feature information, classifying the URL. By use of the method, URL classification efficiency can be improved.

Description

technical field [0001] The invention relates to the technical fields of big data and the Internet, in particular to a method and a device for classifying URLs (Uniform Resource Locator, Uniform Resource Locator). Background technique [0002] At present, the analysis of users' online behavior based on DPI (Deep Packet Inspection) data is mainly achieved by matching the URLs visited by users through the URL address library, and then tagging users. [0003] The URL address library generally adopts web page content extraction and identification technology to classify URLs to construct, but the inventors of the present invention found that the method of using web page content extraction and identification technology to classify URLs has the following disadvantages: [0004] First, due to the need to design personalized algorithms for different websites, the workload and efficiency are low when classifying URLs; [0005] The second is that after different websites are revised, U...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/955
CPCG06F16/9566
Inventor 赵钧石屹嵘黄磊邱晨旭
Owner CHINA TELECOM CORP LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products