Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text classification method based on representation enhancement and fusion

A text classification and representation technology, applied in text database clustering/classification, neural learning methods, unstructured text data retrieval, etc., can solve problems such as difficulty in correct classification and unbalanced distribution of sample data

Pending Publication Date: 2020-10-23
南京睿晖数据技术有限公司 +1
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The purpose of the present invention is to provide a text classification method based on representation enhancement and fusion, which can effectively solve the problem of unbalanced distribution of sample data among categories and difficulty in correct classification when there is a lack of samples in a few categories in the existing text multi-category classification. question

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method based on representation enhancement and fusion
  • Text classification method based on representation enhancement and fusion
  • Text classification method based on representation enhancement and fusion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] In order to make the objects and advantages of the present invention clearer, the present invention will be specifically described below in conjunction with examples. It should be understood that the following words are only used to describe one or several specific implementation modes of the present invention, and do not strictly limit the protection scope of the specific claims of the present invention.

[0041] A text classification method based on representation enhancement and fusion, including constructing a text classification model based on representation enhancement and fusion (reference figure 1 ), the text classification model based on representation enhancement and fusion includes data representation layer, representation enhancement layer, representation abstraction layer, classification layer and fusion layer, and the processing steps of the input text in the text classification model based on representation enhancement and fusion are:

[0042] First, for ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a text classification method based on representation enhancement and fusion. The method comprises the steps that a text classification model based on representation enhancement and fusion is constructed, and the processing steps of an input text in the text classification model based on representation enhancement and fusion are as follows: discrete characters of the inputtext are converted into continuous feature vectors in a data representation layer to obtain multiple representation vectors; adding disturbance into the representation vector in the representation enhancement layer to obtain a representation enhancement vector; further extracting and abstracting the characterization enhancement vector in the characterization abstraction layer to obtain an abstractcharacterization vector; classifying the abstract representation vectors in a classification layer to obtain output text tags; and synthesizing each output text label in the fusion layer to obtain afinal text label. The method can effectively solve the problems that in existing text multi-class classification, distribution of sample data among classes is unbalanced, and correct classification isdifficult when the number of samples in a small number of classes is insufficient.

Description

technical field [0001] The invention relates to the technical field of text classification, in particular to a text classification method based on representation enhancement and fusion. Background technique [0002] With the advent of the era of big data, the number of electronic texts has accumulated rapidly. Facing such a large amount of text data, its maintenance, management and utilization are extremely challenging. Using text classification technology, it can automatically classify unstructured data such as a large number of complex texts, which can enable users to classify and archive materials more conveniently and efficiently. [0003] Text classification refers to the process in which a computer maps a text containing information to a predetermined category or categories of topics. It is a topic that has been studied in natural language processing for many years, and it is also a classic machine learning technology. It has important applications in the fields of sy...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06K9/62G06N3/04G06N3/08
CPCG06F16/35G06N3/08G06N3/045G06F18/2415
Inventor 刘峰陈一飞
Owner 南京睿晖数据技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products