Method and device for classifying character strings

A string and character technology, applied in the field of character string classification, can solve problems such as low efficiency, achieve high efficiency, improve accuracy, and improve accuracy

Active Publication Date: 2020-02-28
ALIBABA GRP HLDG LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The existing methods for classifying strings rely on manual implementation, which is very inefficient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for classifying character strings
  • Method and device for classifying character strings
  • Method and device for classifying character strings

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0066] Example description

[0067] The implementation of the method of the present invention will be further described below with an embodiment. Such as figure 1 As shown, it is a flow chart of a method for classifying character strings according to an embodiment of the present invention, and the method includes:

[0068] S101: Obtain character strings to be classified.

[0069] Specifically, any character string input into the computing device may be acquired, and the acquired character string may be classified as a character string to be classified.

[0070] S102: Extract multiple classification features from the character string to be classified.

[0071] Specifically, the classification features include: longest adjacent vowel distance, string information entropy, or string length.

[0072] Specifically, the longest adjacent vowel distance represents the longest distance between all adjacent vowel characters of any character string, and in this embodiment, "-" and t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and device for classifying character strings, and belongs to the technical field of computer communication. The method includes the steps that character strings to be classified are acquired; multiple classification features are extracted from the character strings to be classified; normalization processing is carried out on the classification features, and multiple normalized classification features are obtained; through a classification model obtained through offline training, the character strings to be classified are classified according to the normalized classification features, and a classification result of the character strings to be classified is obtained. The device comprises an acquisition module, a first extraction module, a normalization module and a classification module. According to the method and the device, through the classification model obtained through offline training, the character strings to be classified are classified according to the normalized classification features, the classification result of the character strings to be classified is obtained, classification can be automatically achieved without manual work, and thus the efficiency is quite high.

Description

technical field [0001] The invention relates to the technical field of computer communication, in particular to a method and device for classifying character strings. Background technique [0002] With the development of computer communication technology, on the one hand, terminal devices such as computers, tablets, and mobile phones have gradually become indispensable tools for people's life and work; There are many, and the requirements for the operating capabilities of computing devices such as terminal devices and service devices are getting higher and higher. In many scenarios (such as registration machine maliciously registering a large number of invalid accounts, attack machine maliciously forging a large number of invalid domain name requests, etc.), the computing device will receive a large number of random strings (such as "aaaxbhzqegs-2", "4s7pTDAOV-L#", "!oC|w4&s", etc.), these random strings have no meaning, but the computing device does not know it when it fir...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/906
Inventor 李家宏
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products