Method, device, equipment and readable storage medium for identifying abnormal character strings

A technology of abnormal characters and strings, which is applied in the field of data processing, can solve the problems of difficult recognition of strings and low recognition of deformed characters, and achieve the effects of improving recognition rate, reducing labor costs and improving efficiency

Active Publication Date: 2020-05-08
RAJAX NETWORK &TECHNOLOGY (SHANGHAI) CO LTD
View PDF15 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this method has a low degree of recognition of deformed characters, and it is difficult to recognize character strings that users deliberately input through deformed characters.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device, equipment and readable storage medium for identifying abnormal character strings
  • Method, device, equipment and readable storage medium for identifying abnormal character strings
  • Method, device, equipment and readable storage medium for identifying abnormal character strings

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031]As mentioned above, the current Internet business data is huge, if only relying on manual identification, not only the cost is high, but also the processing speed is slow. However, the method of matching abnormal characters through regular expressions has a low degree of recognition of deformed characters, and cannot accurately identify all abnormal characters. For example, a user registers a new user with another mobile phone number on an application service platform to enjoy discounts, and then informs the service party on the service platform of the real mobile phone number in the form of a combination of typos, letters, and disordered symbols in the remarks; another example , Advertise your own store in product reviews, leave your personal contact information with a combination of typos, letters, and disordered symbols. Therefore, neither manual recognition nor regular expression matching recognition can meet the data processing requirements of the massive services o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an abnormal character string identification method and device, equipment and a readable storage medium. The method comprises the following steps: acquiring an original character string, and respectively converting the original character string into a corresponding picture and phonetic symbol string; respectively inputting the original character string, the picture and the phonetic symbol string into a first deep learning model, a second deep learning model and a third deep learning model to obtain a corresponding first deep learning feature vector, a corresponding second deep learning feature vector and a corresponding third deep learning feature vector; determining a standardized character string corresponding to the original character string based on the first deep learning feature vector, the second deep learning feature vector and the third deep learning feature vector; and matching the standardized character string with a character string in a preset abnormal database, identifying an abnormal character string in the standardized character string, and outputting an identification result. According to the scheme, the abnormal character string is automatically recognized, the recognition efficiency is improved, and the precision and accuracy are improved.

Description

technical field [0001] The embodiments of the present invention relate to the technical field of data processing, and in particular, to a method, device, device, and readable storage medium for identifying abnormal character strings. Background technique [0002] Nowadays, people cannot live without the Internet. Users will generate text content in shopping, chatting, studying, working and other scenarios. Often users will input abnormal content subjectively or unintentionally during the writing process. In order to reduce the dissemination of these abnormal contents, it is necessary to identify the content input by the user. At present, two methods are generally used: 1. manual identification; 2. regular expression matching identification. [0003] However, with the rapid development of science and technology, the frequency of users using the Internet has increased sharply, and it takes more manpower and time to identify abnormal content. If only relying on manual identific...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/903G06F16/906
CPCG06F16/90344G06F16/906
Inventor 陆青姜敏华
Owner RAJAX NETWORK &TECHNOLOGY (SHANGHAI) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products