Unlock instant, AI-driven research and patent intelligence for your innovation.

Secondary equipment fault short text data classification method based on convolutional neural network

A convolutional neural network and secondary device technology, applied in the field of Chinese natural language processing, can solve the problems of different subjectivity and experience, short text information, and there is no disclosure of methods for classifying short text information of secondary devices, etc. The effect of improving accuracy and generalization ability, reducing text vector dimension, and good feature screening ability

Pending Publication Date: 2020-10-13
ELECTRIC POWER SCI RES INST OF STATE GRID XINJIANG ELECTRIC POWER +5
View PDF7 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] During the operation of secondary equipment, a lot of short-text data of faults and defects has been accumulated. These data are often recorded manually by inspection personnel to complete the classification of defects. However, due to the difference in subjectivity and experience of inspection personnel, it is difficult to Accurate classification is achieved, and due to the large amount of fault data, a large amount of manpower is required, and the efficiency is difficult to guarantee
[0004] At present, for the classification of short texts, there have been researches abroad on emotional classification of hotel reviews through natural language processing. Text classification does not work, English natural language processing is difficult to use in Chinese, and due to different industries, there are many proper nouns in the field of secondary equipment text classification, lack of research on the improvement of classification models, mostly based on traditional machine learning fields
In addition, due to manual recording by recorders, there are many colloquial records and short text information. At present, there is no public method for classifying short text information of secondary equipment.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Secondary equipment fault short text data classification method based on convolutional neural network
  • Secondary equipment fault short text data classification method based on convolutional neural network
  • Secondary equipment fault short text data classification method based on convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] In order to describe the technical solution disclosed in the present invention in detail, further elaboration will be made below in conjunction with the accompanying drawings and specific embodiments.

[0044] Such as figure 1 As shown, a kind of secondary equipment failure short text data classification method based on convolutional neural network provided by the present invention, the steps are as follows:

[0045] S1: Determine the data set;

[0046] Collect fault short text data generated during the operation of secondary equipment, divide it into "serious defect", "critical defect" and "general defect" according to the requirements of relevant guidelines, and divide the text data set into : training set, validation set, test set.

[0047] S2: text preprocessing;

[0048] Construct a dictionary of stop words, filter and remove the noise in the short text information of secondary equipment faults, and retain words with specific practical meanings such as nouns, ve...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a secondary equipment fault short text data classification method based on a convolutional neural network, and the method comprises the steps: firstly collecting text information data of a secondary equipment fault, determining a data set, and obtaining a training sample set; preprocessing the secondary equipment fault short text data, and performing one-to-one mapping on the word data and the word vectors to obtain text vector data; training the convolutional neural network model by using the training sample to obtain a trained convolutional neural network model; and according to the verification set data, inputting test input data into the trained convolutional neural network model, and taking a model output value as a classification result of the to-be-classifiedsecondary equipment fault short text information. According to the secondary equipment fault short text data classification method based on the convolutional neural network, the prediction precisionis improved and the generalization ability of the model is enhanced by using the good feature screening analysis ability of the convolutional neural network model.

Description

technical field [0001] The invention belongs to Chinese natural language processing technology, and in particular relates to a method for classifying short text data of secondary equipment faults based on a convolutional neural network. Background technique [0002] During the construction and operation of the smart grid, electric power big data has exploded. According to the white paper on the development of China's electric power big data compiled by the China Electrical Engineering Society Informatization Committee in 2013, these data can be roughly divided into two categories. One is based on Output power, equipment and its environment temperature and humidity, light intensity of optical modules, etc. represent time-series structured data, and the other type is semi-structured and unstructured data represented by text, images, audio, etc., which are difficult to express using relational databases. structured data. The structured data mining work has been relatively matu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06F40/242G06N3/04G06N3/08G06Q50/06
CPCG06F16/353G06F40/242G06N3/08G06Q50/06G06N3/045
Inventor 王开科南东亮孙永辉吴杰于同伟卜强生庞福滨杨毅杨飞钱海赵启张路王利超卢盛阳
Owner ELECTRIC POWER SCI RES INST OF STATE GRID XINJIANG ELECTRIC POWER