Information processing method and device

An information processing method and preprocessing technology, which is applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of high missed judgment rate, low filtering ratio, increased labor costs, etc., and reduce the misjudgment rate. and the rate of missed judgments and the effect of improving accuracy

Inactive Publication Date: 2016-07-20
CHINA MOBILE COMM GRP CO LTD
View PDF8 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these methods have problems such as high misjudgment rate, high misjudgment rate, and low filtering ratio in automatically identifying spam text messages by machines. As a result, these methods can only be used as auxiliary solutions and cannot completely replace manual review.
In other words, these methods can only be used as the discovery stage of suspected spam messages, and then reported to manual review, which increases labor costs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information processing method and device
  • Information processing method and device
  • Information processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0083] The information processing method provided by this embodiment, such as figure 1 shown, including the following steps:

[0084] Step 101: Generate the fingerprint of the short message to be calibrated according to the text content of the short message to be calibrated;

[0085] Here, before generating the fingerprint of the short message to be marked, the method may also include:

[0086] Perform preprocessing and denoising processing on the text content;

[0087] Correspondingly, the fingerprint of the short message to be calibrated is generated according to the text content after preprocessing and denoising processing.

[0088] Wherein, the preprocessing and denoising processing of the text content may specifically include:

[0089] Preprocessing and denoising of SMS text content, including conventional preprocessing and denoising operations such as word segmentation, stop words removal, special character strings and special symbols removal, traditional and simplifi...

Embodiment 2

[0122] In this embodiment, on the basis of the first embodiment, the process of marking normal short messages and spam short messages is described in detail.

[0123] figure 2 It is a schematic diagram of the overall framework and the main workflow of collaborative filtering based on the black-and-white dual-fingerprint library in this embodiment. Such as figure 2 As shown, the overall framework is mainly divided into the following two parts:

[0124] The first part is to establish a double-fingerprint library of spam text messages and normal text messages according to the calibration results of users, and automatically check and identify the conflicts in the fingerprint library, which is realized by steps 201 to 204 in the main process;

[0125] In the second part, for the new short message, the black and white double fingerprint library is used for collaborative comparison and automatic calibration, which is realized by steps 205-208 in the main flow.

[0126] The main ...

Embodiment 3

[0194] In order to implement the methods of Embodiments 1 and 2, this embodiment provides an information processing device, such as Image 6 As shown, the device includes: a fingerprint generating unit 61, a comparison unit 62 and a marking unit 63; wherein,

[0195] The fingerprint generating unit 61 is configured to generate the fingerprint of the short message to be marked according to the text content of the short message to be marked;

[0196] The comparison unit 62 is used to compare the fingerprint of the short message to be marked with the fingerprint in the black fingerprint library of spam text messages and the white fingerprint library of normal text messages;

[0197] Described marking unit 63 is used for according to the comparison result of the fingerprint in the black fingerprint storehouse of described spam message provided by described comparison unit 62 and the comparison result with the fingerprint in the white fingerprint storehouse of described normal shor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an information processing method. The method comprises the following steps: generating a fingerprint of a to-be-calibrated short message according to text content of the to-be-calibrated short message; simultaneously comparing the fingerprint of the to-be-calibrated short message with fingerprints in a junk short message black fingerprint library and a normal short message white fingerprint library; calibrating the to-be-calibrated short message as the junk short message or the normal short message according to the comparison result of the fingerprint of the to-be-calibrated short message and the fingerprint in the junk short message black fingerprint library and the comparison result of the fingerprint of the to-be-calibrated short message and the fingerprint in the normal short message white fingerprint library. The invention further discloses an information processing device.

Description

technical field [0001] The invention relates to the field of data services in wireless communication, in particular to an information processing method and device. Background technique [0002] With the scale of telecom users and the rapid development of Internet instant messaging and social networking applications, all kinds of information generated in the form of short texts are rapidly accumulating and disseminating. All kinds of spam or bad information involving illegality, fraud, pornography, advertisements, harassment, etc. have become a headache for users and operators. [0003] At present, the identification and filtering technology of spam SMS is greatly influenced by the identification and filtering technology of spam, mainly including: black and white list method, user behavior rule method, SMS body keyword rule method, and SMS text content mining modeling method . However, these methods have problems such as high misjudgment rate, high misjudgment rate, and low...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06K9/62
Inventor 邓超张峰粟栗冉鹏
Owner CHINA MOBILE COMM GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products