Method for identifying irregular spam short message on the basis of Chinese word segmentation

A Chinese word segmentation and spam short message technology, which is applied in telephone communication, branch equipment, special data processing applications, etc., can solve the problems of reducing the accuracy of spam short message recognition, improve the precision rate, improve the recall rate, and avoid missed judgments. Effect

Inactive Publication Date: 2014-06-18
SHANGHAI LIANGJIANG COMM SYST
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although this method identifies irregular spam text messages that match keywords to a certain extent, it also causes some normal text messages that do not contain "receipts" to be identified as spam text messages, reducing the accuracy of spam text message identification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for identifying irregular spam short message on the basis of Chinese word segmentation
  • Method for identifying irregular spam short message on the basis of Chinese word segmentation
  • Method for identifying irregular spam short message on the basis of Chinese word segmentation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The present invention will be further described below in conjunction with accompanying drawing.

[0025] In order to facilitate those skilled in the art to understand and realize the present invention, the following short message is taken as an example to describe the embodiment of the present invention:

[0026]

[0027]

[0028] As above, in order to avoid keyword identification, spam text messages are arranged in an irregular manner. When keywords such as "invoice" or "provide" are usually set, the normal arrangement of short messages can match the keywords, but the irregular arrangement cannot match the keywords according to the normal arrangement.

[0029] see figure 1 , the present invention's method for identifying irregular spam text messages based on Chinese word segmentation comprises the following steps:

[0030] Step S1, receiving a text message and reading the content of the text message; taking the above text message as an example:

[0031]

[...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for identifying an irregular spam short message on the basis of Chinese word segmentation. Chinese word segmentation is performed on the same short message in a way of normal transverse reading firstly according to contents of the short message, and then weight is calculated according to number of words of a word segmentation result. Then according to the characteristic that the irregular spam short message is necessarily constrained in number of characters in each row, the range of the contents of the short message is judged, vertical arrangement of the characters in the range of the contents of the irregularly arranged short message is converted into transverse arrangement, then Chinese word segmentation is performed, and weight is calculated according to the number of words of the total word segmentation result. Then according to comparison of two times of weight, whether the short message is the normally arranged short message or the irregularly arranged short message is judged. Then according to the type of arrangement, content analysis matching keywords are adopted to identify whether the short message is the spam short message so that leakage judgment of the spam short message is avoided, all-checking rate and accurate-checking rate of the spam short message are enhanced.

Description

technical field [0001] The invention relates to a method for identifying junk short messages, in particular to a method for identifying irregular junk short messages based on Chinese word segmentation. Background technique [0002] At present, short message service is a basic service of mobile communication network. While providing convenient message communication service for users, it has also become a channel for sending reactionary, pornographic and fraudulent short messages. In the field of spam SMS management, there is a patent with application number: 200710036831.4 "A SMS Purification System Based on Signaling Processing Technology". The message detection processing device MPM is composed of a service management center CSM. MPM analyzes and processes the passing SMS messages, realizes gating and intercepting processing of SMS messages according to business rules and black and white lists, and transmits relevant messages to CSM, which performs frequency statistics, bu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04W4/14H04M1/725G06F17/27
Inventor 肖克华
Owner SHANGHAI LIANGJIANG COMM SYST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products