Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

An inductive non-negative projection semi-supervised data classification method and system

A data classification and semi-supervised technology, applied in character and pattern recognition, instruments, calculations, etc., can solve various problems such as noise, classification performance impact, data redundancy, etc., achieve accurate data representation results, maintain neighborhood information and Spatial structure and good expansion performance

Active Publication Date: 2021-07-09
SUZHOU UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the existing inductive label propagation methods effectively solve the out-of-sample problem through embedding, there are still obvious shortcomings. Most of the data in practical applications usually contain redundancy or various noises, which have a great impact on classification performance.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An inductive non-negative projection semi-supervised data classification method and system
  • An inductive non-negative projection semi-supervised data classification method and system
  • An inductive non-negative projection semi-supervised data classification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0034] The present invention is tested on 6 real data sets, including MIT face, AR male and female face, YaleB face, ORL face and Yale face. Based on the consideration of computational efficiency, the size of all real images is compressed to 32x32; in the experiment, each image corresponds to a 1024-dimensional vector. In the experiment, any number of each class is randomly selected from each data set as labeled samples, and any number of each class is randomly selected as unlabeled samples. These datasets were collected from multiple sources, so the test results are generally descriptive.

[0035] see figure 1 As shown, an inductive non-negative projection semi-supervised data classification method includes the following steps:

[0036] (1), randomly divide the original data set into a training set and a test set, then initialize the training set and the test set to obtain an initial category label matrix;

[0037] Step (1) specifically includes:

[0038]The original samp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The method of the present invention explicitly combines the semi-supervised data representation and classification error with the existing projective non-negative matrix factorization framework for joint minimization learning, thereby applying the weight coefficient construction and label propagation process to the projective non-negative matrix factorization, which can effectively Avoid the negative impact of noise, corruption or heterogeneity that may be contained in the original data on the similarity measure and label prediction results. In addition, the above joint minimization process can also effectively preserve the neighborhood information and spatial structure during the projected non-negative matrix factorization process, resulting in more accurate data representation results. In addition, the weight construction and inductive learning are integrated into a unified model, and an adaptive weight coefficient matrix can be obtained, thereby avoiding the difficulty of selecting the best neighbor in traditional algorithms. The method of the invention is an inductive model, which can complete the induction and prediction of out-of-sample data without introducing an additional reconstruction process, and has good scalability.

Description

technical field [0001] The invention relates to an inductive non-negative projection semi-supervised data classification method and system, belonging to the technical fields of pattern recognition and data mining. Background technique [0002] Graph-based semi-supervised learning has been an important topic in the fields of data mining and pattern recognition. Since the model can learn from a small amount of labeled data and a large amount of unlabeled data, it is very suitable for the characteristics of practical application data. Existing models can be broadly categorized into transductive learning and inductive learning based on whether they can be efficiently extended to out-of-sample new data. [0003] Label propagation, as a typical classification model, has attracted considerable attention and interest in academia in recent years. Existing typical transductive label propagation algorithms include Gaussian field and harmonic functions, local and global consensus lear...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62
CPCG06F18/21322G06F18/21324G06F18/214
Inventor 张召贾磊李凡长王邦军张莉
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products