Chinese spam recognition method based on chaos particle swarm optimization CNN

A technology of chaotic particle swarms and spam, applied in neural learning methods, biological neural network models, office automation, etc., can solve problems such as difficulty in establishing classifier models and practical application problems, and improve the accuracy of classification and recognition. The effect of robustness and stability

Active Publication Date: 2018-10-12
CHONGQING UNIV OF POSTS & TELECOMM
View PDF11 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the large number and variety of Chinese spam, and the diversity of spam content forms, traditional machine learning methods are limited when processing a large number of data samples, and it is difficult to establish an efficient classifier model. At the same time, in practice Also hampered by application issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese spam recognition method based on chaos particle swarm optimization CNN
  • Chinese spam recognition method based on chaos particle swarm optimization CNN
  • Chinese spam recognition method based on chaos particle swarm optimization CNN

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The technical solutions in the embodiments of the present invention will be described clearly and in detail below with reference to the drawings in the embodiments of the present invention. The described embodiments are only some of the embodiments of the invention.

[0038] The technical scheme that the present invention solves the problems of the technologies described above is:

[0039] As shown in the figure, the Chinese spam identification method based on the chaotic particle swarm optimization CNN network provided by the present embodiment includes the following steps:

[0040] Step 1: Preprocessing the collected Chinese spam corpus. The email corpus is mainly stored in the form of text. Since the text content contains a lot of noise, such as stop words, particles, symbols, and characters, which affect the representation of text vectors and the recognition of classification models, it is necessary to correlate the data set preprocessing to get clean email text. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention protects a Chinese spam recognition method based on a chaos particle swarm optimization CNN (convolutional neural network). According to the method, first, a word segmentation device isused to perform word segmentation, stop word removal and other preprocessing on a Chinese spam dataset; second, a Word2vec model is adopted to acquire word vectors, and text vectors of Chinese spam are obtained by solving the sum and the mean value of the word vectors; third, the chaos thought is introduced into a particle swarm optimization algorithm to train network parameters of the CNN; fourth, a Chinese spam classification model is established based on the chaos particle swarm optimization CNN; and last, a test set is adopted to realize spam classification through the established model, and the classification correct rate is calculated. Through the method, quick convergence can be realized through the model established according to optimization parameters of the chaos particle swarm optimization algorithm, good robustness and stability are achieved, and meanwhile the classification recognition rate of Chinese spam can be increased.

Description

technical field [0001] The invention belongs to the technical field of Chinese spam classification, in particular to a method for identifying Chinese spam based on a chaotic particle swarm optimized CNN network. Background technique [0002] The proliferation of Chinese spam has seriously endangered the development of my country's Internet information technology. Therefore, establishing an effective Chinese spam classification and identification model to distinguish spam from normal mail will help reduce the waste of network resources and storage space and maintain the Internet information environment. stable development. [0003] Commonly used spam classification methods include neural network (Neural network, NN), Bayesian theory (Bayes), decision tree and support vector machines (Support vector machines, SVM) and so on. Due to the large number and variety of Chinese spam, and the diversity of spam content forms, traditional machine learning methods are limited when proces...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30G06N3/00G06N3/04G06N3/08G06Q10/10
CPCG06N3/006G06N3/08G06Q10/107G06F40/279G06N3/045
Inventor 唐贤伦万亚利熊德意李佳歆林文星魏畅昌泉伍亚明
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products