Email type judgement method and device and establishment device of system and behavior model

A judging device and mail technology, applied in the Internet field, can solve the problems of low accuracy, slow training, slow mail judgment, etc., and achieve the effect of fast judgment

Inactive Publication Date: 2007-11-28
HUAWEI TECH CO LTD
View PDF0 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The content of the email body is large and varied, which will lead to problems such as slow training and incomplete training set, which may lead to low filtering accuracy; and, because the content and format of the email body are uncertain, it may cause email judgment The speed is slow; further, the email experience of non-Chinese emails and other emails is expressed as a zero vector, so that the email is considered as a normal email, so when spam is also expressed as a zero vector, it cannot be filtered, which further reduces the efficiency of filtering. Correct rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Email type judgement method and device and establishment device of system and behavior model
  • Email type judgement method and device and establishment device of system and behavior model
  • Email type judgement method and device and establishment device of system and behavior model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0040] As shown in Figure 1, Embodiment 1 of the behavior model building device provided by the present invention includes:

[0041] A mail header reading unit 101, configured to read the mail headers of classified known mails;

[0042] Introduce the mail header first. The mail header is some signaling interaction transmitted between the mail servers according to the Simple Message Transfer Protocol (SMTP: Simple Message Transfer Protocol) during the mail delivery process. Generally, these contents are important to the mail writer and the mail server. The receiver of the mail is invisible; because it is transmitted according to the SMTP protocol, in order to ensure the normal delivery of the mail, the content of the mail header is formatted, and some of the fields are also pre-set according to the requirements of the SMTP protocol; the classification has been Known mail means that the classification of the mail is known, that is, whether the mail is a normal mail or a spam mai...

Embodiment 3

[0082] As shown in Figure 3, the third embodiment of the method for judging the mail type provided by the present invention includes:

[0083] Step 301, read the mail header and mail body of the classified unknown mail;

[0084] Step 302, extracting field 1 meeting preset condition 1 from the email header, and extracting field 2 meeting preset condition 2 from the email body;

[0085] The operation of the mail body is similar to the operation process of the mail header, but the fields selected for the mail body are similar to the prior art, that is, the corresponding keywords are selected from the mail body;

[0086] Step 303, vectorize the combination of field 1 and its expression form to obtain a feature vector 1 with a preset number 1, and vectorize the combination of field 2 and its expression form to obtain a feature vector 2 with a preset number 2;

[0087] The expressions of keywords include: there is this keyword, there is no such keyword, the number of times this key...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a judging method, device and system of mail pattern and model building device in the internet technical domain, wherein the judging method of mail pattern comprises the following steps: reading and classifying unknown mail head; extracting the field one satisfying the preset condition one from the mail head; vectorizing the field one and its display pattern composition; obtaining the characteristic vector one of the preset quantity one; inputting based on the characteristic vector one; prebuilding the reserved model data as line; using preset predicting algorism to calculate; getting the calculating result; classifying the unknown mail pattern to judge according to the calculated result. The invention also provides the corresponding device and system corresponding to the method, which improves the judging speed of the mail pattern.

Description

technical field [0001] The invention relates to Internet technology, in particular to a mail type judging method, device and system, and a behavior model building device. Background technique [0002] E-mail, as the largest application of the Internet, has always been favored by the majority of Internet users. However, in recent years, the spam problem has become increasingly serious. The basic feature of spam is "uninvited", and most spam has commercial or other publicity purposes. At the same time, the judgment of spam has a lot to do with the receiver of the mail, and different users may have different judgment results for the same mail. With the advancement of technology, spam filtering technology is changing from a single based on static rules and statistical classification to behavior-based filtering technology. [0003] The existing mainstream spam filtering methods are all based on email content. One spam filtering method is based on Learning Vector Quantization (...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L12/58G06F17/30G06Q10/00
CPCG06Q10/107H04L12/585H04L51/212
Inventor 刘竟刘峤秦志光郑志彬
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products