Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Real-time rubbish mail filtering method and system used for transmission influx stage

A spam filtering and spam technology, applied in the field of real-time spam filtering, can solve problems such as slow speed and easy failure, and achieve the effect of maintaining dynamic stability, good memory overhead, and satisfying real-time mail filtering.

Inactive Publication Date: 2008-03-12
BEIJING VENUS INFORMATION TECH
View PDF0 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The disadvantage of content filtering technology is that it is slow and requires users to continuously update the rule base or training set, etc. With the continuous improvement of spammers' technical level, this method is prone to failure

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Real-time rubbish mail filtering method and system used for transmission influx stage
  • Real-time rubbish mail filtering method and system used for transmission influx stage
  • Real-time rubbish mail filtering method and system used for transmission influx stage

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0043] FIG. 2 is a schematic diagram of the spam filtering system of the present invention. As shown in Figure 2, the system of the present invention includes a fingerprint collection device module, which uses an efficient hash algorithm to convert the original mail into a feature pattern (fingerprint collection), and uniquely characterizes the original mail with this;

[0044] A classifier module that classifies mail traffic by comparing similarities between patterns;

[0045] A statistical analysis device module, which automatically discovers spam through rule matching and statistical analysis of class description information;

[0046] A management device module, which establishes and maintains the description information of each mail class and the corresponding fingerprint collection in the pattern library and the mail library, and adopts the pattern elimination algorithm based on the class average distance to maintain the dynamic stability of the pattern library and the ma...

Embodiment 2

[0091] Embodiment 2: a kind of real-time spam filtering method that can be used in transmission collection stage, comprises the following steps;

[0092] (1) The filtering system firstly reassembles the email conversation, records such description information as the number of the email and the sending and receiving address; and extracts the text of the email. In order to reduce the memory overhead and ensure the usability of the system, an efficient feature pattern generation algorithm is used to convert the email body into a collection of several fingerprints (this set is called feature pattern in the system) to represent the original email.

[0093] (2) If the characteristic pattern of the current mail does not match the characteristic pattern in the memory, add a mail category in the memory, and save the mail description information and characteristic pattern in the memory; if it matches, only modify the description of the mail category in the memory Information (such as th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to a method and a system for filtering a real-time junk mail. A mail text and mail description information are extracted first, an algorithm is generated by adopting an efficient characteristic pattern to switch the mail text to be a combination of a plurality of fingerprints, to represent an original mail. If the memory has no mail with the similar content with the current mail, a mail class is created in the memory, and the information about fingerprint and description of the current mail is preserved; only the description information in the mail class is updated, if the mail number contained in the mail class exceeds the predefined threshold value of the junk mail, the mail class is taken as a junk mail class, and the mails distributed to the class at later times are determined as junk mails by a filtering system. The present invention can effectively identify the junk mail with a small amount of fingerprints, the maintenance cost is low, and the user privacy is not violated; the present invention has perfect performance in the execution efficiency and in the memory expenses, the present invention is used for filtering the junk mails under a great flow capacity environment, the problem that the junk mains occupy the network band width is abated, thus the utilization ratio of the network resources is improved, and simultaneously the filtration pressure of the client side is reduced.

Description

technical field [0001] The invention relates to the field of e-mail sending and receiving, in particular to a real-time spam filtering method and system which can be used in the transmission and collection stage. Background technique [0002] E-mail system is a successful example of Internet application, which has brought profound changes to people's work and life. The easy-to-use, open and trust-based design concept makes it one of the most important communication methods today, and has become an indispensable and important part of people's work and life to some extent. The e-mail system transmits information through the SMTP (Simple Mail Transfer Protocol) protocol, and the letters transmitted in it are generally called e-mail (E-mail). [0003] According to the SMTP protocol, the mail transmission process can be divided into three stages: sending, transmission and collection, and receiving, as shown in Figure 1. [0004] Sending stage 1: The sending mail client may be a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): H04L12/58H04L29/06
CPCH04L51/212
Inventor 张尼姜誉方滨兴郭莉严立马骏
Owner BEIJING VENUS INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products