Method and device for carrying out multi-label classification on text data

A technology of text data and data categories, applied in the field of data processing, can solve problems such as inefficiency and time-consuming, and achieve the effects of mitigating the impact, reducing the time for data prediction, and improving the effect of the model

Pending Publication Date: 2022-02-11
SHENGDOUSHI SHANGHAI SCI & TECH DEV CO LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the output space of multi-label classification increases exponentially with the number of label items, when the number of label items is large, this classification method of directly predicting all-level labels takes a long time and is relatively inefficient.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for carrying out multi-label classification on text data
  • Method and device for carrying out multi-label classification on text data
  • Method and device for carrying out multi-label classification on text data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] Exemplary embodiments of the present application will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many forms and should not be construed as limited to the embodiments set forth herein; rather, these The concepts are fully conveyed to those skilled in the art. In the drawings, the size of some elements may be exaggerated or deformed for clarity. The same reference numerals in the drawings denote the same or similar structures, and thus their detailed descriptions will be omitted.

[0019] Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided in order to give a thorough understanding of the embodiments of the present application. However, those skilled in the art will appreciate that the technical solutions of the present application may be prac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for classifying text data, which comprises the following steps: predicting a first label item string of the text data, which is used for representing a first-level category of the text data, and the first label item string comprises one or more levels of label items with a progressive relationship; according to the first-level category of the text data, predicting a second label item string, used for representing a second-level category of the text data, of the text data, where the second-level category belongs to a sub-category of the first-level category, and the second label item string comprises one or more levels of label items with progressive relations; and performing splicing according to the first label item string and the second label item string to obtain a classification result of the text data. The invention further relates to a device for classifying the text data, a computer readable storage medium and electronic equipment.

Description

technical field [0001] The present application relates to data processing, in particular to a method and a device for classifying text data whose category is indicated by a label implying a multi-level progressive relationship. Background technique [0002] In data classification, labels are often used to indicate the category to which the data corresponds. At present, there are two main label systems, one is a single and independent fragmented label system, such as the label system of NetEase Cloud Music, its label set includes "Chinese", "Japanese", "Cantonese", "popular", " Rock", "folk" and other labels. In the fragmented label system, each label contains only one label item, and they are independent of each other and have no joint relationship. The other is a label system that implies a multi-level progressive relationship, in which each label contains multiple label items separated by separators and arranged in a progressive relationship, so that a string is composed...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/289G06F16/35G06K9/62G06N3/04G06N3/08
CPCG06F40/289G06F16/35G06N3/04G06N3/08G06F18/241
Inventor 凌悦付宇赵新歌
Owner SHENGDOUSHI SHANGHAI SCI & TECH DEV CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products