Supercharge Your Innovation With Domain-Expert AI Agents!

Active learning classification method based on Gaussian mixture model and sparse Bayesian

A mixed Gaussian model and sparse Bayesian technology, applied in the field of machine learning, can solve the problems of small sample size and poor prediction performance

Active Publication Date: 2019-09-03
WUHAN UNIV OF TECH
View PDF6 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

As a supervised learning method, the traditional correlation vector machine only uses the labeled data as the training set to construct the learning model, which is easy to cause problems such as small sample size and poor predictive performance.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Active learning classification method based on Gaussian mixture model and sparse Bayesian
  • Active learning classification method based on Gaussian mixture model and sparse Bayesian
  • Active learning classification method based on Gaussian mixture model and sparse Bayesian

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0119] A specific embodiment of the inventive method is as follows:

[0120] A specific implementation application of the present invention is to apply the method of the present invention to text classification, and classify texts according to document topics. The dataset used for data input is the text classification dataset 20Newgroup. The dataset contains about 20,000 articles from different newsgroups, each newsgroup is about a different topic, and there are 20 topics in total. In this implementation application, the data of 8 subjects are extracted as experimental data, and the experimental data is divided into two parts, one part is used as a training set (60%), and the other part is used as a test set (40%). For the data of these 8 topics, 8 different binary classification data sets can be constructed with each topic as the positive class. Each topic training set has about 600 samples, and the test set has about 400 samples.

[0121] The subjects of these 8 datasets a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an active learning classification method based on a Gaussian mixture model and sparse Bayesian, and the method comprises the following steps of 1) employing the Gaussian mixture model to train all samples including marked samples and unmarked samples, and obtaining the mixing coefficient, the mean value, and the covariance of each Gaussian component; 2) constructing an initial training set XL, taking the initial training set as a marked sample set, and updating an unmarked sample set XU; 3) constructing an initial GMM-FRVM (Gaussian mixture kernel-based direct-push typerelevance vector machine) model; 4) updating the marked sample set and the unmarked sample set based on the GMM-FRVM model; 5) retraining the GMM-FRVM model based on the updated marked sample set andthe updated unmarked sample set; and 6) finishing the classification marking of all samples by adopting the final GMM-FRVM model. According to the method disclosed by the invention, a better classification effect is obtained through artificial marking as few as possible by means of the active learning combining the Gaussian mixture model and the sparse Bayesian.

Description

technical field [0001] The invention relates to the field of machine learning, in particular to an active learning classification method based on a mixed Gaussian model and sparse Bayesian. Background technique [0002] With the rapid development of computer-related technologies, society has become more information-based, and a large amount of data is generated every day. In actual scenarios, a large amount of data obtained by people is unlabeled. When traditional supervised learning methods use a small amount of labeled data for training, it is difficult to achieve better prediction results due to small sample size and less information. Manually labeling samples takes a lot of time and energy, and in some cases it is even impossible to complete the labeling of a large number of samples. Based on these problems, the present invention proposes an active learning classification method based on mixed Gaussian model and sparse Bayesian. [0003] Active learning expands the sam...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N20/10G06K9/62
CPCG06N20/10G06F18/24155G06F18/2411
Inventor 刘芳马登峰王洪海李政颖陈钢赵洋
Owner WUHAN UNIV OF TECH
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More