Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Inductive non-negative projection semi-supervised data classification method and system

A data classification and semi-supervised technology, applied in the direction of character and pattern recognition, instruments, computer components, etc., can solve the problems of classification performance impact, various noises, data redundancy, etc., to maintain neighborhood information and spatial structure, Accurate data representation results, good performance expansion effects

Active Publication Date: 2018-03-06
SUZHOU UNIV
View PDF4 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the existing inductive label propagation methods effectively solve the out-of-sample problem through embedding, there are still obvious shortcomings. Most of the data in practical applications usually contain redundancy or various noises, which have a great impact on classification performance.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Inductive non-negative projection semi-supervised data classification method and system
  • Inductive non-negative projection semi-supervised data classification method and system
  • Inductive non-negative projection semi-supervised data classification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0034] The present invention is tested on 6 real data sets, including MIT face, AR male and female face, YaleBface, ORL face and Yale face. Based on the consideration of computational efficiency, the size of all real images is compressed to 32x32; in the experiment, each image corresponds to a 1024-dimensional vector. In the experiment, any number of each class is randomly selected from each data set as labeled samples, and any number of each class is randomly selected as unlabeled samples. These datasets were collected from multiple sources, so the test results are generally descriptive.

[0035] see figure 1 As shown, an inductive non-negative projection semi-supervised data classification method includes the following steps:

[0036] (1), randomly divide the original data set into a training set and a test set, then initialize the training set and the test set to obtain an initial category label matrix;

[0037] Step (1) specifically includes:

[0038]The original sampl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The method of the invention definitely combines semi-supervised data representation and classification errors into a conventional non-negative matrix factorization framework for joint minimization learning, thereby applying weight coefficient construction and a label propagation process to the projection non-negative matrix factorization, avoiding the negative effects of noise, damage, or aliens included in original data on similarity measures and label prediction results. In addition, the joint minimization process can maintain the neighborhood information and the spatial structure in the projection non-negative matrix factorization process, and obtain an accurate data representation result. In addition, the weight construction and the inductive learning are integrated into a unified model so as to obtain an adaptive weight coefficient matrix, thereby avoiding the difficulty in selecting the optimal neighbor in a traditional algorithm. The method of the invention is an inductive model, can complete the induction and prediction of the data outside a sample, prevents introduction of additional reconstruction process, and has good scalability performance.

Description

technical field [0001] The invention relates to an inductive non-negative projection semi-supervised data classification method and system, belonging to the technical fields of pattern recognition and data mining. Background technique [0002] Graph-based semi-supervised learning has been an important topic in the fields of data mining and pattern recognition. Since the model can learn from a small amount of labeled data and a large amount of unlabeled data, it is very suitable for the characteristics of practical application data. Existing models can be broadly categorized into transductive learning and inductive learning based on whether they can be efficiently extended to out-of-sample new data. [0003] Label propagation, as a typical classification model, has attracted considerable attention and interest in academia in recent years. Existing typical transductive label propagation algorithms include Gaussian field and harmonic functions, local and global consensus lear...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/21322G06F18/21324G06F18/214
Inventor 张召贾磊李凡长王邦军张莉
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products