Hierarchical label extraction method based on bidirectional Transformer model

A labeling and hierarchical technology, applied in the field of hierarchical label extraction based on the bidirectional Transformer model, can solve the problem of less end-to-end algorithm research and achieve the effect of three-dimensional labeling

Pending Publication Date: 2021-11-26
WUHAN YANGTZE COMM IND GRP
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, for the multi-layer label classification problem, that is, the classified label itself contains one level, two levels, or even more levels. At present, most of them adopt the pipeline method, first classify the labels level by level, and then connect multiple steps in sequence. However, end-to-end algorithm research is less

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hierarchical label extraction method based on bidirectional Transformer model
  • Hierarchical label extraction method based on bidirectional Transformer model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0023] like figure 1 As shown, the present invention provides a technical solution: a method for extracting hierarchical labels based on a bidirectional Transformer model, comprising the following steps:

[0024] Obtain feature vectors through unsupervised pre-training text data;

[0025] Use the bidirectional Transformer model to further optimize the feature vector;

[0026] Combined with the multi-layer label knowledge base system, the nested Multi-Class c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a hierarchical label extraction method based on a bidirectional Transformer model. The hierarchical label extraction method comprises the following steps: obtaining a feature vector through unsupervised pre-training text data; further adjusting and optimizing the feature vector by using a bidirectional Transformer model; in combination with a multi-layer label knowledge base system, performing supervised training on the nested Multi-Class classification model by utilizing softmax and a manual labeling label; and finally, outputting a hierarchical prediction label. According to the invention, a bidirectional Transformer model is adopted, and multi-level labels are combined and nested into a multi-class classification model for learning and training. According to the method, the learning of the multi-level classification labels of the text is realized, and the method can be used for automatic labeling (a plurality of specific labels in a certain field) of the network public opinion text and case analysis (layer-by-layer deepening of case skills) in a police service platform, so that the three-dimensional labeling of the text data is realized.

Description

technical field [0001] The invention relates to the fields of machine learning and text mining analysis, in particular to a method for extracting hierarchical labels based on a bidirectional Transformer model. Background technique [0002] With the rapid development of information technology, Internet information platforms and some business platforms (such as police comprehensive platforms) are flooded with massive data, which contains a lot of valuable information. How to quickly identify the subject tags corresponding to massive information is of great importance for efficient screening. Value information is of great significance. [0003] Labeling a piece of text is essentially a multi-class problem. In machine learning algorithms, SVM, Bayesian, and LSTM all play an important role, and statistical language models can also play a certain role. However, for the multi-layer label classification problem, that is, the classified label itself contains one level, two levels, o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06N3/084G06N3/047G06F18/2414G06F18/2431G06F18/2415
Inventor 金勇陈宏明胡林利
Owner WUHAN YANGTZE COMM IND GRP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products