Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Domain-adaptive chemical potential safety hazard short text classification method and system

A technology of potential safety hazards and classification methods, applied in the field of chemical safety hazards short text classification methods and systems, can solve the problems of large differences in text length, difficulty in capturing short text domain related feature information, and sparse text features, etc., and achieves good classification effect. The effect of compensating for the deviation of domain information

Pending Publication Date: 2021-07-20
QINGDAO UNIV OF SCI & TECH
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] Short texts have the characteristics of large differences in text length, lack of context information, sparse text features, and obvious domain-dependent features of word semantics. It is difficult for general short text classification technology to capture domain-related feature information of short texts, resulting in low classification accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Domain-adaptive chemical potential safety hazard short text classification method and system
  • Domain-adaptive chemical potential safety hazard short text classification method and system
  • Domain-adaptive chemical potential safety hazard short text classification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0040] This embodiment provides a domain-adaptive chemical safety hazard short text classification method;

[0041] Such as figure 1 As shown, a domain-adapted chemical safety hazard short text classification method includes:

[0042] S101: Obtain several short texts to be classified in the field of chemical safety hazard investigation;

[0043] S102: Perform vector extraction for each short text to be classified to obtain an initial text vector corresponding to each short text to be classified;

[0044]S103: Input the initial text vectors corresponding to all the segment texts to be classified into the trained short text classification model, and output the short text classification results.

[0045] Further, said S102: perform vector extraction on each short text to be classified, and obtain an initial text vector corresponding to each short text to be classified; specifically include:

[0046] Based on the BERT model, vector extraction is performed for each short text to...

Embodiment 2

[0143] Embodiment 2 This embodiment provides a domain-adaptive chemical safety hidden danger short text classification system;

[0144] A domain-adapted chemical safety hazard short text classification system, including:

[0145] The obtaining module is configured to: obtain several short texts to be classified in the field of chemical safety hazard investigation;

[0146] The extraction module is configured to: perform vector extraction on each short text to be classified, and obtain an initial text vector corresponding to each short text to be classified;

[0147] The classification module is configured to: input the initial text vectors corresponding to all segment texts to be classified into the trained short text classification model, and output the short text classification results.

[0148] What needs to be explained here is that the above acquisition module, extraction module and classification module correspond to steps S101 to S103 in the first embodiment, and the e...

Embodiment 3

[0152] This embodiment also provides an electronic device, including: one or more processors, one or more memories, and one or more computer programs; wherein, the processor is connected to the memory, and the one or more computer programs are programmed Stored in the memory, when the electronic device is running, the processor executes one or more computer programs stored in the memory, so that the electronic device executes the method described in Embodiment 1 above.

[0153] It should be understood that in this embodiment, the processor can be a central processing unit CPU, and the processor can also be other general-purpose processors, digital signal processors DSP, application specific integrated circuits ASIC, off-the-shelf programmable gate array FPGA or other programmable logic devices , discrete gate or transistor logic devices, discrete hardware components, etc. A general-purpose processor may be a microprocessor, or the processor may be any conventional processor, o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a domain-adaptive chemical potential safety hazard short text classification method and system. The method comprises the steps of acquiring a plurality of to-be-classified short texts in a chemical potential safety hazard investigation field; performing vector extraction on each to-be-classified short text to obtain an initial text vector corresponding to each to-be-classified short text; and inputting the initial text vectors corresponding to all the to-be-classified short texts into the trained short text classification model, and outputting a short text classification result. Information fusion representation of different levels of characters, words and sentences in a specific field of the short text is learned by adopting GRU + HAN, the problem of field information deviation of a general corpus short text is solved, and a better classification effect is shown in a classification task of chemical potential safety hazard investigation.

Description

technical field [0001] The invention relates to the technical field of short text classification, in particular to a domain-adaptive chemical safety hidden danger short text classification method and system. Background technique [0002] The statements in this section merely mention the background technology related to the present invention and do not necessarily constitute the prior art. [0003] With the rapid development of deep learning technology, many researchers try to use deep learning to solve text classification problems, especially in CNN (Convolutional Neural Network, convolutional neural network) and RNN (Recurrent Neural Network, cyclic neural network), there have been many Novel and fruitful classification method. The method of text classification can solve problems such as Internet news classification and sentiment analysis very well, but in the application of specific related fields, due to the different text characteristics of the field, there are practica...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06F40/30G06K9/62G06N3/04G06N3/08
CPCG06F16/35G06F40/30G06N3/08G06N3/048G06N3/045G06F18/2415
Inventor 杜军威朱孟帅李浩杰胡强于旭江峰陈卓
Owner QINGDAO UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products