Supercharge Your Innovation With Domain-Expert AI Agents!

Short text classification method and system based on multi-feature fusion

A technology of multi-feature fusion and classification method, applied in the field of short text classification method and system based on multi-feature fusion, can solve the problems of difficult to achieve classification results, difficult to obtain better results, sparse information content, etc., to improve the classification accuracy , the model results are stable, the use effect is good

Inactive Publication Date: 2020-07-28
NANJING UNIV OF POSTS & TELECOMM
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

(4) The text format is not standard
However, the current short text classification technology mainly has the following problems: (1) Faced with specific scenarios and resources based on machine learning methods, the feature engineering steps depend on domain knowledge, the calculation efficiency is low, and it is difficult to extend to other scenarios
Moreover, using the bag-of-words feature to represent the text will lose the word order information of the text, and will cause the text feature dimension to be too high and the information content to be sparse, which only represents the very shallow content of the text.
Especially in the face of short text classification, the sparsity of features is serious, and it is difficult to have better results
(2) The current model cannot effectively solve the feature sparsity problem caused by short texts, and it is difficult to achieve better classification results
It cannot effectively solve the feature sparsity problem of short texts, and it is difficult to achieve better results in classification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Short text classification method and system based on multi-feature fusion
  • Short text classification method and system based on multi-feature fusion
  • Short text classification method and system based on multi-feature fusion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0029] Such as figure 1 As shown, the present invention involves a short text classification method based on multi-feature fusion, and short text classification must face the problem of feature sparsity brought about by text characteristics. Considering that the traditional feature extension method faces specific scenarios and resources, the feature engineering steps depend on domain knowledge, and the calculation efficiency is low, so it is difficult to extend to other scenarios. The utilization method of the word frequency inverse word order value proposed by the present invention has no such limitations, and combines the other two methods to extract features to solve the problem of feature sparsity. Considering the feature conflict problem caused by multi-feature fusion, which makes the focus of the method unclear an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a short text classification method and system based on multi-feature fusion, and the method comprises the steps: firstly carrying out the preprocessing of a text, including word segmentation, stop word processing and feature selection; secondly, extracting features of the processed text through a word frequency and inverse word order method, a convolutional neural networkand a long-short-term memory network algorithm, and forming three feature vectors; then, fusing the three types of features, using an attention mechanism to weight the fused features, and highlightingimportant features; and finally, enabling the fusion features to pass through a classifier to obtain a short text classification result. A feature dictionary is established by using word frequency inverse word order features, and vectorization representation is performed on a text; features are extracted in combination with a filter and a long-short-term memory network, the three kinds of features are fused to enrich short text features, an attention mechanism is used for distributing weights, and the classification effect is stabilized.

Description

technical field [0001] The invention belongs to the field of natural language processing, and in particular relates to a short text classification method and system based on multi-feature fusion. Background technique [0002] With the advent of the era of big data, paper documents are rapidly changing to electronic and digital, and text classification has become one of the most common tasks in natural language processing. With the advancement of network technology and the development of electronic social media, a new type of text—short text has become an important form of network information. Short texts have become an important form for individuals to express their opinions and share information on online platforms. Short text data has a wide range of applications, such as questions raised by users in question answering systems, chat records in social network exchange forums, sentiment analysis on review sites, etc. Short text refers to short text, which is relative to do...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06F16/36G06F40/284G06F40/247G06K9/62G06N3/04G06N3/08
CPCG06F16/35G06F16/374G06N3/08G06N3/047G06N3/044G06N3/045G06F18/2415G06F18/253
Inventor 徐小龙刘聪
Owner NANJING UNIV OF POSTS & TELECOMM
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More