Check patentability & draft patents in minutes with Patsnap Eureka AI!

Method and system for text classification of short messages

A text classification and short message technology, applied in text database clustering/classification, unstructured text data retrieval, special data processing applications, etc., can solve problems that cannot meet real-time requirements, and improve response speed, speed and The effect of precision

Active Publication Date: 2017-09-08
CHINA UNITED NETWORK COMM GRP CO LTD
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] These two characteristics determine the recognition and classification method based on the content of text messages, which cannot meet the classification requirements of short messages with high real-time requirements.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for text classification of short messages
  • Method and system for text classification of short messages
  • Method and system for text classification of short messages

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] In order to make the purpose, technical solution and advantages of the present invention more clear, the embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined arbitrarily with each other.

[0040] The steps shown in the flowcharts of the figures may be performed in a computer system, such as a set of computer-executable instructions. Also, although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in an order different from that shown or described herein.

[0041] Stream computing is mainly used in functions such as real-time data processing and statistical learning. With the explosion of Internet big data, stream computing also adopts more advanced distributed computing methods to improve processing speed, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a short message text classifying method and system. The short message text classifying method comprises the following steps: dividing a short message classifying process into different tasks in advance, and allocating different tasks onto different ports of a working node of a stream-oriented computation system; extracting the keywords of a short message text to be classified, determining a class library needing to be updated, updating the class library needing to be updated, and computing the characteristic vector of the short message text to be classified; acquiring the similarity between the short message text to be classified and the characteristic vectors of different class library members according to the computed characteristic vector, and determining the class of the short message text to be classified according to the acquired similarity. According to a short message text classifying scheme, the updating of the class library and the computation of the characteristic vector are performed in parallel through different task ports of the working node of the stream-oriented computation system respectively after characteristic preprocessing of short message text, thereby greatly increasing the response speed of short message text processing, and increasing the speed and accuracy of junk short message filtering.

Description

technical field [0001] The invention relates to short message text processing technology, in particular to a short message text classification method and system. Background technique [0002] In the era of Internet big data, real-time processing and analysis of user behavior is more important. Taking SMS text processing as an example, the flood of spam messages, such as fraudulent SMS, advertising promotion, and reactionary SMS, has brought great harm to users. Therefore, operators need to filter spam messages by identifying the content of SMS messages. The timeliness of the short message determines that the processing and distribution of the short message must be completed within a short period of time, so the short message processing system is undoubtedly required to have high real-time performance. [0003] At present, there are mainly two text classification methods for short messages, one is a classification method based on "keywords" + matching rules, and the other is...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/35
Inventor 李浩罗云彬王志军王伟华
Owner CHINA UNITED NETWORK COMM GRP CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More