Dynamic URL filtering method and device
A filtering method and filtering device technology, applied in special data processing applications, using information identifiers to retrieve web data, instruments, etc., can solve the problems of long process, slow speed, resource consumption, etc., achieve fast processing speed, reduce storage, The effect of saving processing time and computing resources
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
no. 1 example
[0039] In the first embodiment of the present invention, a dynamic URL filtering method, such as figure 1 shown, including the following specific steps:
[0040] Step S101, create an information dictionary based on the URL annotation set, and the content of the information dictionary includes two types: character string features and statistical features.
[0041]Specifically, the statistical features and the character string features are derived from all URLs in the URL annotation set, and the statistical features include at least the normalized value of one of the following items: the number of occurrences of the set punctuation marks, the path depth , the number of digits in the domain name and / or path, the length of the longest character string in the domain name and / or path, the length of the suffix, and the conversion frequency between numbers and characters. For example: the method of determining the normalized value of the number of occurrences of the set punctuation m...
no. 3 example
[0061] The third embodiment of the present invention, this embodiment is based on the above-mentioned embodiments, taking the dynamic and static classification of URL collections by using the linear logistic regression classification algorithm as an example, combined with the attached Figure 3-7 An application example of the present invention is introduced.
[0062] Different from the traditional method of classifying static / dynamic URLs with MD5 values, the application example of the present invention classifies URLs based on a linear logistic regression classification algorithm and a new feature set. The flow of the whole classification process is as follows image 3 shown.
[0063] In the application example of the present invention, the linear logistic regression classification algorithm is applied to solve the dynamic URL filtering problem. In addition, although the present invention follows the idea of classification by logistic regression, the feature extraction st...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com
