Extreme multi-label learning method based on space-time network clustering reduction integration

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A learning method and multi-label technology, applied in the field of multi-label text mining, can solve problems such as ignoring label sparsity, label-level model training, and poor learning scalability, so as to solve time and space consumption, improve representation ability, and improve general chemical effect

Pending Publication Date: 2022-06-28

YUNNAN UNIV

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] (1) The traditional multi-label model cannot adapt to the shortcomings of extreme multi-label scenarios

[0005] Traditional multi-label learning only focuses on a relatively small number of labels, such as less than 100, but with the increasing number of Internet data, the number of labels has exceeded ten thousand or one million. Due to the huge number of labels, the traditional multi-label learning method takes a long time The complexity is too large to adapt to extreme multi-label learning scenarios

[0006] (2) Low generalization performance of existing extreme multi-label learning models

[0007] Existing extreme multi-label learning methods are mainly based on tree ensembles, embeddings, and deep network end-to-end methods. These methods simply use linear or nonlinear mapping for multi-label learning, ignoring the dependencies between labels, especially when labels When the amount is huge, the labels are usually presented in a graph, which makes the generalization performance of the existing multi-label learning model low

[0008] (3) Poor scalability of existing extreme multi-label learning

[0009] Existing extreme multi-label learning methods are mainly developed for specific label magnitudes, and cannot adapt to different label magnitude models for training, and ignore the problems of label sparseness and imbalance, resulting in poor generalization and scalability of the model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0058] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0059] The present invention provides a technical solution: an extreme multi-label learning method based on space-time network clustering reduction integration, comprising the following steps:

[0060] S1: Space-time network attention ensemble representation;

[0061] S11: Acquisition of original extreme multi-label data; learning based on extreme multi-label data acquired in different actual application scenarios;

[0062] S12: Phra...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an extreme multi-label learning method based on space-time network clustering reduction integration in the technical field of multi-label text mining. The extreme multi-label learning method comprises the following steps: space-time network attention integration characterization; self-adaptive label relation enhancement and clustering reduction learning are carried out; carrying out unbalanced learning on the weighted reduction label set; according to the method, interactive attention among the words, the phrases and the labels in the multi-label text is integrated, the dependency relationship among the words, the phrases and the labels is explored, and the extreme multi-label text characterization capability is effectively improved; a self-adaptive label relation enhancement and clustering reduction learning mechanism is provided, through self-adaptive label relation enhancement, the dependency relation between the labels can be effectively mined, the generalization of the model is improved, and through clustering reduction learning, the labels of different magnitudes can be effectively adapted to the existing model for training; a weighted reduced label set imbalance learning mechanism is provided, and the problems of poor model generalization and expandability and the like caused by label sparsity and imbalance are solved.

Description

technical field [0001] The invention relates to the technical field of multi-label text mining, in particular to an extreme multi-label learning method based on space-time network clustering reduction integration. Background technique [0002] With the continuous development of Internet technology, more and more labels present a multi-label situation. Traditional multi-label learning only focuses on a relatively small amount of labels. With the increase of Internet data, the amount of labels has exceeded 10,000 or millions. , the most widely used is extreme multi-label text classification, such as short text classification such as Taobao, web page links, etc., with more than 10,000 tags; Wikipedia text classification, with more than one million tags. When faced with such a large amount of labels, the traditional multi-label learning method will not be applicable. Therefore, in view of the application limitations of traditional multi-label learning, it is of great practical s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F16/35G06F16/31G06N3/04G06N3/08

CPCG06F16/353G06F16/322G06N3/08G06N3/044G06N3/045

Inventor 夏跃龙杨云

Owner YUNNAN UNIV

Features

Generate Ideas
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Extreme multi-label learning method based on space-time network clustering reduction integration

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology