Check patentability & draft patents in minutes with Patsnap Eureka AI!

Network big data long text multi-label classification method and system, equipment and medium

A classification method and long text technology, applied in the field of data processing, can solve the problems of poor classification efficiency and classification accuracy, and achieve the effect of improving efficiency and accuracy

Active Publication Date: 2021-12-24
CENT SOUTH UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In view of this, the embodiments of the present disclosure provide a network big data long text multi-label classification method, system, device and medium, which at least partially solve the problems of poor classification efficiency and classification accuracy in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Network big data long text multi-label classification method and system, equipment and medium
  • Network big data long text multi-label classification method and system, equipment and medium
  • Network big data long text multi-label classification method and system, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] Embodiments of the present disclosure will be described in detail below in conjunction with the accompanying drawings.

[0055] Embodiments of the present disclosure are described below through specific examples, and those skilled in the art can easily understand other advantages and effects of the present disclosure from the contents disclosed in this specification. Apparently, the described embodiments are only some of the embodiments of the present disclosure, not all of them. The present disclosure can also be implemented or applied through different specific implementation modes, and various modifications or changes can be made to the details in this specification based on different viewpoints and applications without departing from the spirit of the present disclosure. It should be noted that, in the case of no conflict, the following embodiments and features in the embodiments can be combined with each other. Based on the embodiments in the present disclosure, a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a network big data long text multi-label classification method and system, equipment and a medium and belongs to the technical field of data processing. The method comprises steps of acquiring an original data set; analyzing the original data set to construct a keyword table corresponding to each label, and performing preprocessing to obtain text data; converting the text data into word vectors and word vectors, and calculating position vectors corresponding to different words in the text data; obtaining an embedded vector; inputting the embedded vector into a void gate convolutional layer for coding to obtain a coding vector; and extracting features of the coding vectors according to the self-attention mechanism model to obtain relevance of each word in the text data, and forming a classification result. According to the scheme, the multi-label long text is segmented by constructing the keyword table, the text data is converted into different vectors and then coded, then the self-attention mechanism is used for extracting the features to obtain the relevance of each word, the classification result is formed, and the classification efficiency and accuracy are improved.

Description

technical field [0001] The embodiments of the present disclosure relate to the technical field of data processing, and in particular to a method, system, device and medium for classifying network big data with long text and multiple labels. Background technique [0002] At present, the advent of the Internet era has brought a series of new challenges to big data governance. However, due to the wide range of data in network big data, the processing methods for different types of network big data are not the same, so the network big data Fast splitting and classification processing are crucial for the next step of big data analysis and further processing. At present, methods based on traditional neural networks have good results in common network big data multi-label classification. Among them, LSTM is suitable for short-text network big data, and TextCNN is suitable for long-text network big data. However, when faced with document-level network big data, the effect of common ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06F40/289G06N3/04
CPCG06F16/35G06F40/289G06N3/045
Inventor 李芳芳苏朴真龙军陈先来徐雪松毛星亮
Owner CENT SOUTH UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More