Certificate and domain name resolution-based gambling domain name identification method

An identification method and certificate analysis technology, applied in the computer field, can solve the problem of low identification accuracy of gambling domain names, and achieve the effect of simple, fast, accurate and fast identification and high classification accuracy.

Pending Publication Date: 2022-04-19
HARBIN INST OF TECH AT WEIHAI
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In order to solve the technical problem of low recognition accuracy of existing gambling domain names without parsing the web page text, the present invention provides a high recognition accuracy, time-saving and quick gambling domain name recognition method based on certificate and domain name analysis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Certificate and domain name resolution-based gambling domain name identification method
  • Certificate and domain name resolution-based gambling domain name identification method
  • Certificate and domain name resolution-based gambling domain name identification method

Examples

Experimental program
Comparison scheme
Effect test

experiment example

[0072] 1. Experimental environment

[0073] All experiments were carried out on a Huawei computer with Windows 10 operating system, which is equipped with i7 processor, 16G memory and 512G solid-state hard drive.

[0074] 2. Experimental data acquisition

[0075] Obtain 10,000 gambling domain names with digital certificates through step (1) based on the Chinese classification model fine-tuned by Bert, and select and exclude the 200,000 domain names used in the process of building domain name whitelist substring collections from the Alex Top 1 million as the top ranked ones 10000 domain names as benign domain names. Obtain the gambling domain name and benign domain name according to step (2) digital certificate analysis method to obtain the feature vector of length 50 obtained by digital certificate analysis, and obtain the gambling domain name by the method of obtaining the text feature vector of the domain name through the N-gram method according to step (3) The feature vec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a gambling domain name recognition method based on certificates and domain name resolution, which solves the technical problem that the existing gambling domain name recognition accuracy is low under the condition that a webpage text is not resolved, and comprises the following steps of: establishing a Chinese classification model constructed based on Bert fine tuning; respectively carrying out digital certificate analysis on the Chinese gambling domain name and the benign domain name; obtaining a text feature vector of the domain name through an N-gram method; and training and testing the digital certificate analysis feature vector and the domain name text feature vector of the Chinese gambling domain name and the benign domain name through RNN (Recurrent Neural Network), Decision Tree, ExtraTree, RandomForest, KNN (Kalman Neural Network) and SVM (Support Vector Machine) learning algorithms, and constructing a Chinese gambling domain name mining model. The method can be widely applied to identification of Chinese gambling domain names.

Description

technical field [0001] The invention relates to the field of computers, in particular to a method for identifying gambling domain names based on certificates and domain name resolution. Background technique [0002] With the rapid development of computer technology, the Internet has entered thousands of households, but while the Internet brings information and convenience to people, it also brings negative information. All kinds of bad content promoting pornography, violence, and gambling are flooding the Internet, which not only seriously pollutes the minds of minors, but also destroys the social atmosphere. Digital certificates implement public key management in the public key infrastructure, and can effectively avoid man-in-the-middle attacks in the network communication process. Many Chinese gambling websites will apply for gambling digital certificates that can be mistaken for benign certificates by browsers, thereby increasing the number of users. Trust in Chinese gam...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06N20/00
CPCG06F16/35G06N20/00
Inventor 张兆心孙国营程亚楠许海燕常利婷李冷文婷
Owner HARBIN INST OF TECH AT WEIHAI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products