Cloud cluster extraction method of network information

A technology of network information and extraction method, which is applied in the field of cloud clustering extraction of network information content, can solve the problems of inability to obtain illegal content sources, difficult data processing, etc., and achieve the effect of good feature extraction performance and high feature extraction speed.

Inactive Publication Date: 2013-03-27
BEIJING NORMAL UNIV ZHUHAI
View PDF1 Cites 39 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The research work in the field of network content monitoring in our country is in its infancy. Some common network content monitoring software currently appearing are mostly passive working modes, usually running on the gateway. When illegal words are found, the web pages containing the words will be blocked. This method generally controls the network card, grabs network data packets, and analyzes the content of t

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cloud cluster extraction method of network information
  • Cloud cluster extraction method of network information
  • Cloud cluster extraction method of network information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] Please refer to Figure 1 to Figure 5 , the present invention provides a cloud clustering extraction method for network information, combining cloud computing technology and artificial intelligence technology, actively monitors and warns network content, actively collects website content, obtains hot web page information in the website, and obtains hot web page information Contains the content and analyzes the content. By discarding irrelevant features and redundant features, the dimensionality is effectively reduced, the calculation time is reduced, and the system work efficiency is improved. The content of network information is varied, and it is very difficult to extract harmful information that endangers the country and the public society. The invention proposes to focus on cloud clustering method to extract the characteristics of harmful information, and then use GP (genetic programming) prediction algorithm to analyze harmful information, so as to improve the hit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a cloud cluster extraction method of network information. The cloud cluster extraction method comprises the following steps of: performing file writing, data storage and access to network information by a distributed file system; performing seamless combination on calculation models Map/Reduce of SOM (Self-Organizing Maps), a Kmeans clustering algorithm and cloud calculation to obtain a Map/Reduce SOM and Kmeans clustering algorithm based on the cloud calculation; performing control on the whole Map/Reduce by JobTracker, and distributing Map tasks or Reduce tasks by free TaskTracker; executing an instruction sent from the JobTracker and processing movement of data between Map and Reduce phases at the same time by the TaskTracker; periodically reporting finished work and state updating by each TaskTracker node; and if one TaskTracker node keeps silent for longer than a pre-set time interval, recording that the state of the node is dead and sending data distributed to the node to the other nodes by the JobTracker. The cloud cluster extraction method of the network information has good characteristic extracting performance and overcomes the disadvantage of too strong subjectivity in the existing network flow time sequence analyzing and predicating algorithm.

Description

technical field [0001] The invention relates to cloud computing and data mining technology, in particular to a cloud clustering extraction method for network information content. Background technique [0002] With the exponential growth of the number of websites and the number of webpages existing on the Internet, and the extensive development of e-government and e-commerce, these have greatly promoted the country's informatization construction and brought more and more benefits to people's study, work, and life. More and more convenience. However, at the same time, the Internet has also become a place for the dissemination of pornographic, cult, reactionary, Taiwan independence, and violent information. Therefore, how to prevent the dissemination and browsing of illegal information on the Internet, supervise and control the content of online information, protect the security of network information, effectively prevent the illegal dissemination of bad information in our cou...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30H04L29/08
Inventor 吕威
Owner BEIJING NORMAL UNIV ZHUHAI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products