Massive email analyzing method and system based on relational graph

An email and analysis method technology, applied in the field of network information security, can solve problems such as the inability to quickly and effectively process massive email data, and achieve the effect of accurate real-time analysis level, speed improvement, and high scalability.

Inactive Publication Date: 2013-05-15
INST OF INFORMATION ENG CAS
View PDF3 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem to be solved by the present invention is to provide a mass email analysis method and system based on a relationship graph, which ...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Massive email analyzing method and system based on relational graph
  • Massive email analyzing method and system based on relational graph
  • Massive email analyzing method and system based on relational graph

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0068] Association analysis steps: Introduce the IP address geographic information database and the email user identity information database, associate the two databases with the email table, and perform association analysis based on the generated relationship graph, and then associate the association analysis process with the results. shown in the figure. Embodiment 1 is based on the above method, and is specifically composed of three implementation parts:

[0069] 1. Parallel parsing and processing of massive emails

[0070] like figure 2 As shown, a flowchart of parallel parsing and processing of massive emails is given, and the specific implementation steps are as follows:

[0071] 1) Obtain configuration information such as parsing tasks and source data access paths from massive email database configuration tables, use the configuration table to build a mutual exclusion mechanism, and use parallel processing to parse the email source data, including detecting whether t...

Embodiment 2

[0098] Embodiment 2 Based on the system principle of Embodiment 1, a software system "mass mail intelligent analysis and management system" is designed, and is deployed and implemented. This software system uses four high-performance servers, two of which are used to deploy the parallel parsing module, one server user deploys the attachment storage detection module, and the last server user deploys the relationship graph generation module and association analysis module. Through actual operation, the system can parse more than 1 million e-mail data every day; it supports attachment storage and fast retrieval of more than half a year of data, and can be expanded; it supports the correlation analysis of hundreds of millions of e-mail data, and a single page can Display and edit graphs of more than 100 nodes.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a massive email analyzing method and a system based on a relational graph. A massive email analyzing method based on the relational graph comprises the following steps: parallelly analyzing email source data, extracting head information and text information of an email and then storing the head information and text information of the email to an email list; storing parallelly analyzed summary information of an attachment on an email attachment list in a setting structure, and conducting detecting; constructing an email relational chart according to analyzed email data, and generating a single-point or multi-point relational graph according to user need and the email relational chart; introducing internet protocol (IP) address geography information database and email user identity information database, conducting relational analyzing on the email list, and displaying relational information on the generated relational graph. A massive email analyzing system based on the relational graph correspondingly comprises a parallel analyzing module, an attachment storing detecting module, a relational graph generating module and a relational analyzing module. The massive email analyzing method and the system based on the relational graph can effectively solve problems of massive email analyzing and processing and spam tracing and positioning in email network.

Description

technical field [0001] The invention relates to the technical field of network information security, relates to email detection and analysis technology, and in particular relates to a massive email analysis method and system based on a relational graph. Background technique [0002] E-mail, the English name "Electronic mail" ("Email" for short), is a communication tool for information exchange through an electronic communication system. Now it is often associated with the Internet (Internet) and has become the most popular Internet application service. one. With the rapid development of the Internet and the continuous growth of the number of netizens, there are more and more service providers and users of e-mail, and the functions tend to be diversified; Popular Internet application services such as instant messaging, social networking, and Weibo are closely related to e-mail. For example, users can use e-mail to verify accounts or retrieve passwords of other application se...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06Q10/10G06F17/30
Inventor 李书豪云晓春张永峥郝志宇霍永亮
Owner INST OF INFORMATION ENG CAS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products