Supercharge Your Innovation With Domain-Expert AI Agents!

URL processing method and device

A processing method and processor technology, applied in search systems, URL normalization search engine devices, electronic equipment, and URL processing fields, can solve problems such as inability to meet the accuracy of URL normalization processing results and individual needs

Pending Publication Date: 2020-07-03
ALIBABA GRP HLDG LTD
View PDF6 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] This application provides a URL processing method to solve the existing problems that cannot meet the accuracy and individualization requirements of URL normalization processing results in specific business scenarios

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • URL processing method and device
  • URL processing method and device
  • URL processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0085] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the application. However, the present application can be implemented in many other ways different from those described here, and those skilled in the art can make similar promotions without violating the connotation of the present application. Therefore, the present application is not limited by the specific implementation disclosed below.

[0086] In order to reduce the waste of search time, memory consumption, storage space, and other computing resources caused by repeated searches, downloads, or indexing of the same network resource, and improve the efficiency of search engines, URLs need to be normalized. The existing URL normalization processing methods are mainly simple regular expression matching methods and whitelist matching methods.

[0087] The simple regular expression matching method traverses and matches the path and request fields of URLs ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a URL processing method and device. The method comprises the steps of obtaining a plurality of URLs corresponding to a target host domain name; obtaining the number of path sub-information which is located at the same position and has different contents in the plurality of URLs; and normalizing the path information in the plurality of URLs according to the number of path sub-information which is located at the same position and has different contents in the plurality of URLs to obtain a normalized URL which corresponds to the target host domain name and is used for crawling network contents. Method of using the same, according to the invention, the URL normalization processing process can be combined with the service scene of the network resource pointed by the URLor the service demand under the service scene; according to the invention, the URL normalization method adapted to the service scene of the network resource pointed by the URL or the service requirement under the service scene can be formulated, and the accuracy and individuation requirements of the URL normalization processing result under the specific service scene can be met.

Description

technical field [0001] The present application relates to the technical field of network data communication, in particular to a URL (Uniform Resource Locator, Uniform Resource Locator) processing method. The present application relates to a URL processing device and an electronic device at the same time. The present application further relates to a URL processing method, a URL processing device and an electronic device. The present application additionally relates to a search system and a URL normalized search engine device. Background technique [0002] A search engine refers to a system that uses specific computer programs to collect, organize and process network resources from the Internet according to predetermined strategies, accept users' search services for the above-mentioned network resources, and display relevant network resources to users. Locating network resources through the Uniform Resource Locator URL is the basic way to obtain network resources. When searc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/955
CPCG06F16/9566
Inventor 沈馨悦刘翔宇
Owner ALIBABA GRP HLDG LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More