Extreme multi-label learning method based on space-time network clustering reduction integration

A learning method and multi-label technology, applied in the field of multi-label text mining, can solve problems such as ignoring label sparsity, label-level model training, and poor learning scalability, so as to solve time and space consumption, improve representation ability, and improve general chemical effect

Pending Publication Date: 2022-06-28
YUNNAN UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] (1) The traditional multi-label model cannot adapt to the shortcomings of extreme multi-label scenarios
[0005] Traditional multi-label learning only focuses on a relatively small number of labels, such as less than 100, but with the increasing number of Internet data, the number of labels has exceeded ten thousand or one million. Due to the huge number of labels, the traditional multi-label learning method takes a long time The complexity is too large to adapt to extreme multi-label learning scenarios
[0006] (2) Low generalization performance of existing extreme multi-label learning models
[0007] Existing extreme multi-label learning methods are mainly based on tree ensembles, embeddings,

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Extreme multi-label learning method based on space-time network clustering reduction integration
  • Extreme multi-label learning method based on space-time network clustering reduction integration
  • Extreme multi-label learning method based on space-time network clustering reduction integration

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0058] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0059] The present invention provides a technical solution: an extreme multi-label learning method based on space-time network clustering reduction integration, comprising the following steps:

[0060] S1: Space-time network attention ensemble representation;

[0061] S11: Acquisition of original extreme multi-label data; learning based on extreme multi-label data acquired in different actual application scenarios;

[0062] S12: Phra...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an extreme multi-label learning method based on space-time network clustering reduction integration in the technical field of multi-label text mining. The extreme multi-label learning method comprises the following steps: space-time network attention integration characterization; self-adaptive label relation enhancement and clustering reduction learning are carried out; carrying out unbalanced learning on the weighted reduction label set; according to the method, interactive attention among the words, the phrases and the labels in the multi-label text is integrated, the dependency relationship among the words, the phrases and the labels is explored, and the extreme multi-label text characterization capability is effectively improved; a self-adaptive label relation enhancement and clustering reduction learning mechanism is provided, through self-adaptive label relation enhancement, the dependency relation between the labels can be effectively mined, the generalization of the model is improved, and through clustering reduction learning, the labels of different magnitudes can be effectively adapted to the existing model for training; a weighted reduced label set imbalance learning mechanism is provided, and the problems of poor model generalization and expandability and the like caused by label sparsity and imbalance are solved.

Description

technical field [0001] The invention relates to the technical field of multi-label text mining, in particular to an extreme multi-label learning method based on space-time network clustering reduction integration. Background technique [0002] With the continuous development of Internet technology, more and more labels present a multi-label situation. Traditional multi-label learning only focuses on a relatively small amount of labels. With the increase of Internet data, the amount of labels has exceeded 10,000 or millions. , the most widely used is extreme multi-label text classification, such as short text classification such as Taobao, web page links, etc., with more than 10,000 tags; Wikipedia text classification, with more than one million tags. When faced with such a large amount of labels, the traditional multi-label learning method will not be applicable. Therefore, in view of the application limitations of traditional multi-label learning, it is of great practical s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/35G06F16/31G06N3/04G06N3/08
CPCG06F16/353G06F16/322G06N3/08G06N3/044G06N3/045
Inventor 夏跃龙杨云
Owner YUNNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products