Abnormal event classification method and system based on text processing

A technology for abnormal events and text processing, applied in text database clustering/classification, data processing applications, electrical digital data processing, etc. rate effect

Pending Publication Date: 2021-04-09
JIANGSU DAWN INFORMATION TECH CO LTD
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Among them, Naive Bayes is a simple model that relies on the bag-of-words model. It can only count macroscopic semantic features, and cannot obtain word position information and the relationship between words. It is a low-level model with low accuracy. TextCNN, on the other hand, is a convolutional neural network that uses text as input. It has the following main defects: 1. It relies on word segmentation and trained word vectors. For abnormal event texts, training a set of general word vectors needs to rely on place names, The identification of entities such as proper nouns requires a lot of work; 2. The classification of an abnormal event can only be reflected by a few words in the text, and the neural network cannot focus on specific segments of the input text sequence, resulting in The discrimination accuracy is low; 3. The convolutional neural network uses a sliding window to scan the input text, so the connection between non-adjacent words cannot be obtained, and it is difficult for the model to obtain a better semantic representation ability

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Abnormal event classification method and system based on text processing
  • Abnormal event classification method and system based on text processing
  • Abnormal event classification method and system based on text processing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] In this embodiment, a method for classifying abnormal events based on text processing is characterized in that it includes the following steps:

[0046] Step 1. Set the naive Bayesian model;

[0047] Step 2. Use BERT to classify abnormal event labels;

[0048] Step 3. Judging the abnormal event type according to the business logic model.

[0049] In a further embodiment, the Naive Bayesian model described in step 1 takes the independence between feature words as a premise assumption, learns the joint probability distribution from input to output, and then based on the probability distribution, finds the , so that the setting of the output with the largest posterior probability is first based on the original text data of abnormal events, using named entity recognition to replace elements that are not closely related to the characteristics of abnormal events with the same character representation, perform data preprocessing, and then convert the text For word segmentati...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an abnormal event classification method and system based on text processing, and the method comprises the steps: carrying out the classification of a text through naive Bayes, and proving that the text is highly correlated with a label; performing abnormal event label classification by utilizing a BERT model; judging according to the combination of a BERT model and a service logic model, adjusting the output weight of the BERT according to an artificial rule, learning joint probability distribution from input to output by taking the independence among naive Bayesian model feature words as a premise hypothesis, and solving an output enabling the posterior probability to be maximum under an input condition according to the probability distribution; and in combination with naive Bayes, a BERT model and a service logic model, automatic classification and labeling of abnormal event data are realized, so that an abnormal event analysis service of the security industry is assisted. The BERT model is used for abnormal event classification and is combined with the naive Bayesian model, related business knowledge is fused for different types of label classification tasks, and different model adjustments are carried out.

Description

technical field [0001] The invention relates to a natural language processing technology, in particular to a method and system for classifying abnormal events based on text processing. Background technique [0002] Abnormal event classification refers to the label classification and marking of an abnormal event through natural language processing algorithms. The tags include alarm type, event type, case type, and address area type. The current text classification of abnormal events, the main algorithms are Naive Bayesian and TextCNN. [0003] Among them, Naive Bayes is a simple model that relies on the bag-of-words model. It can only count macroscopic semantic features, and cannot obtain word position information and the relationship between words. It is a low-level model with low accuracy. TextCNN, on the other hand, is a convolutional neural network that uses text as input. It has the following main defects: 1. It relies on word segmentation and trained word vectors. For ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F40/295G06Q50/26G06K9/62
CPCG06F16/35G06F40/295G06Q50/26G06F18/24155
Inventor 叶恺翔吕晓宝张谦孙亚文姚伯祥王元兵王海荣
Owner JIANGSU DAWN INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products