URL storage matching method and device

A matching method and technology of a matching device, which are applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of large space occupation, low partial matching accuracy, and URL storage matching method unable to achieve accurate matching, etc. To avoid the effect of taking up huge space

Inactive Publication Date: 2015-04-22
CHINA UNITED NETWORK COMM GRP CO LTD
View PDF5 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0026] The present invention provides a method and device for URL storage and matching, which are used to solve the problems that the exist

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • URL storage matching method and device
  • URL storage matching method and device
  • URL storage matching method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] Such as image 3 As shown, the preferred embodiment of the present invention provides a method for storing and matching URLs, comprising the following steps: preprocessing each original URL in the Uniform Resource Locator (URL) library to obtain the URL of the reserved domain name; The URL is converted into uppercase letters and the domain name is reversed to obtain the URL to be stored; a dictionary tree is created for the URL to be stored; and the URL to be matched is queried according to the created dictionary tree.

[0047] Herein, based on the content of the URL library shown in Table 1 and the URL extracted from mobile phone wireless Internet access records shown in Table 2, the URL storage and matching method provided by the preferred embodiment of the present invention is described.

[0048] Specifically, firstly, each URL in the URL library (ie, Table 1) is preprocessed. Preprocessing is mainly divided into processes such as removing the http protocol identifi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a URL storage matching method and device. The method includes the following steps: each original URL in a URL bank is preprocessed to obtain a URL with a reserved domain name; the URLs with the reserved domain names are converted into capital letters, and domain name inverted ranging is carried out to obtain to-be-stored URLs; a dictionary tree is created for the to-be-stored URLs; to-be-matched URLs are inquired according to the created dictionary tree. By means of the URL storage matching method and device, the problems that an existing URL storage matching method cannot achieve accurate matching, large space is occupied, and the partial matching accuracy is not high are solved.

Description

technical field [0001] The invention relates to the field of mass data storage and query, in particular to a method and device for storing and matching web addresses. Background technique [0002] At present, with the increasing development of network communication, mobile terminals wirelessly access the Internet every day, generating hundreds of millions of massive data, occupying terabytes of storage space, and a month's massive data reaches trillions and petabytes. A variety of useful information can be mined from this massive database, such as statistics on the ranking of top 1000 website clicks per month. Since the URL information contained in the Internet access record is a detailed uniform resource locator (Uniform Resource Locator, URL) link, when it is necessary to classify and count different URLs of the same website, it will encounter how to convert and match the URL to The problem with the website name. For example: two URLs: www.baidu.com and map.baidu.com are...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/9566
Inventor 尹为强罗云彬赵锡成王伟华
Owner CHINA UNITED NETWORK COMM GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products