Semi-supervised text classification method and system based on multi-granularity modeling

A text classification, semi-supervised technology, applied in the field of data processing technology and machine learning, can solve the problem of incomplete semantics, missing samples or features, unable to solve polysemy, etc., to achieve the effect of solving incomplete semantics

Active Publication Date: 2021-01-12
HEFEI UNIV OF TECH
View PDF4 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Aiming at the deficiencies of the prior art, the present invention provides a semi-supervised text classification method and system based on multi-granularity modeling, which solves the problem of missing samples or features in traditional semi-supervised classification methods, as well as the incomplete semantics and inability to solve a problem. polysemy problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semi-supervised text classification method and system based on multi-granularity modeling
  • Semi-supervised text classification method and system based on multi-granularity modeling
  • Semi-supervised text classification method and system based on multi-granularity modeling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention are clearly and completely described. Obviously, the described embodiments are part of the embodiments of the present invention, not all of them. example. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0053] The embodiment of the present application provides a semi-supervised text classification method and system based on multi-granularity modeling, which solves the unsatisfactory effect of semi-supervised text classification caused by the loss of samples or features and the use of a single-granularity language model in traditional semi-supervised classification methods. The problem of achieving a better classification effe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a semi-supervised text classification method and system based on multi-granularity modeling, and relates to the technical field of data processing and machine learning. According to the invention, a three-channel text vector model layer is formed in a multi-granularity text modeling mode, text modeling is conducted on the same text from the character level, the word level and the sentence level; modeling of the three levels serves as three channels respectively, and input and output of the three channels are connected into three base classifier sets; therefore, divergence between samples is obtained under the condition that the samples or characteristics are not lost, and a traditional resampling and random subspace method is replaced. Meanwhile, nine base classifiers are integrated into a design of three base classifier groups, the advantages of different base classifiers are integrated, different features of the same sample are obtained by using different baseclassifiers, and divergence among the base classifiers is obtained, so that the classification result accuracy of the semi-supervised text classification method is effectively improved.

Description

technical field [0001] The invention relates to the fields of data processing technology and machine learning technology, in particular to a semi-supervised text classification method and system based on multi-granularity modeling. Background technique [0002] The rapid development of Internet technology has made information transmission more and more rapid and convenient. In the process of continuous generation and interaction of information, more new information has been derived. This information has been growing exponentially. In these Massive information often contains many valuable things, which not only reflect a large number of potential needs of users from the side, but also feed back many problems existing in enterprise services. If these massive user information are quickly mined and effectively extracted, it is possible to easily grasp user needs, improve a large number of problems in enterprise services, provide users with better services, and seize a wider mark...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06N3/08
CPCG06F16/35G06N3/08
Inventor 余本功汲浩敏朱梦迪王胡燕王惠灵张子薇朱晓洁
Owner HEFEI UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products