Method and device for detecting abnormal messages based on account number attributes

A detection method and message technology, applied in the field of computer networks, can solve problems such as inability to accurately match words, inability to correctly segment words, misjudgment, etc., and achieve the effect of improving flexibility

Active Publication Date: 2014-02-12
TENCENT TECH (SHENZHEN) CO LTD
View PDF4 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] (1) It is difficult to accurately obtain the spam sample library:
[0007] Generally, the spam sample library can only be discovered manually or by some other behavior detection algorithms. The discovery time is often delayed by several hours, and there are cases of misjudgment.
This has a great impact on the integrity and accuracy of the sample, which directly leads to a great deviation between the spam probability of each word and the real value
[0008] (2) Existing spam messages or advertisements do circumvention processing for word segmentation, resulting in the inability to correctly segment words:
As a result, the message becomes isolated words after word segmentation, which cannot be accurately matched with the words in the sample library

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for detecting abnormal messages based on account number attributes
  • Method and device for detecting abnormal messages based on account number attributes
  • Method and device for detecting abnormal messages based on account number attributes

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] As mentioned in the background technology, since there is no clear implementation plan in the prior art for the instant update and maintenance of the spam sample database and the word segmentation interference for artificial settings, the existing abnormal message detection technology cannot cover most micro Therefore, it is impossible to realize instant and effective detection of abnormal messages.

[0038] In order to solve the above problems, the present invention provides a method for detecting abnormal messages, in which it is no longer necessary to establish and maintain normal samples or spam sample databases in advance, but directly determine the account number and spam sending normal messages according to the attribute characteristics of the published accounts The attribute abnormal probability of the account; at the same time, it no longer performs specific word segmentation for new incoming messages, but directly divides the message text, and calculates the ra...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Disclosed is a method for detecting an abnormal message, comprising: dividing a text of a detected message into a plurality of text segments; obtaining one or more account attributes of each text segment, and determining a publication proportion parameter corresponding to the account attributes of each text segment; determining a first factor corresponding to the account attributes of each text segment according to the publication proportion parameter; determining a second factor of the detected message according to the first factor corresponding to the account attributes of each text segment; and determining according to the second factor of the detected message whether the detected message is an abnormal message. Through the combination of publication account attributes of messages with undifferentiated text segmentation and the use of Bayesian algorithm, batches of junk messages of a microblog account are effectively limited, and the flexibility of junk message processing is improved.

Description

technical field [0001] The invention relates to the field of computer networks, in particular to a method and system for detecting abnormal messages based on account attributes. Background technique [0002] Network Instant Messenger (IM, Instant Massager) tool has been developed to today, has been accepted by most of the network users, and has become one of the indispensable software tools for network users, not only used in leisure and entertainment at ordinary times, but also in the user's It is also widely used at work. In the IM software, the main realization is the one-on-one friend chat alone and the one-on-N group or discussion group message chat mode. With the continuous development of Internet applications, microblogging applications similar to Twitter (twitter) are also developing and growing. [0003] Weibo is the abbreviation of micro blog, which has high efficiency of information transmission and low threshold. Through Weibo, users can disseminate and transm...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04L12/26H04L12/58
CPCH04L67/306H04L51/212H04L51/52H04L51/42
Inventor 钟清华王金华
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products