Transfer learning method based on semi-supervised clustering

A semi-supervised clustering and transfer learning technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of low accuracy, large classification error, and low efficiency of transfer learning process, and achieve accuracy High and efficient effect

Active Publication Date: 2013-04-03
HARBIN ENG UNIV
View PDF3 Cites 37 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For new data, only the bridge refinement process can be performed again, which is very inefficient for the entire migration learning process
In addition, the current research on transfer learning methods is facing a common problem: the accuracy is not high
However, due to the limitations of knowledge and ability, the classification accuracy of the final classifier is not high, that is, the classification error is relatively large.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Transfer learning method based on semi-supervised clustering
  • Transfer learning method based on semi-supervised clustering
  • Transfer learning method based on semi-supervised clustering

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] The present invention adopts standard text data 20Newsgroup as experimental data. 20Newsgroup is a document set containing 20,000 newsgroups, divided into 20 subcategories and 7 top-level categories. In order to make the data set satisfy the migration scenario, the present invention reconstructs the data set. Since the dataset contains subcategories belonging to different top-level categories, subcategories of the same top-level category can be divided into subcategories of different top-level categories to form datasets in different fields. Based on this reconstruction, the source domain data and the target data come from different distributions, but because their subclasses have the same top-level class, there is a certain relationship between the two domain data. The source field and target field data are as follows:

[0020]

[0021]

[0022]

[0023] The present invention is described in detail below in conjunction with accompanying drawing example:

[...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a transfer learning method based on semi-supervised clustering. The transfer learning method based on the semi-supervised clustering comprises the following steps: calculating similarity and average similarity of data in each class of target data and auxiliary data; according to the average similarity, obtaining a similarity weight vector of the target data and a class tag; taking the vector with the maximum weight as a tag of the target data; with the target data as a centroid, performing K-means clustering into clusters, wherein the tag, having the maximum proportion of data in each cluster to the total data of the class to which the cluster belongs, is taken as a cluster tag; comparing a classification result with a pre-classification result; and in the finally-formed similarity weight vector of the target data, selecting a data tag with the maximum weight as the data tag of the target data so as to form a final classifier. The invention provides the transfer learning method based on the semi-supervised clustering, which can transfer a classifying method and a classifying technology from one field to another field and can improve the precision of the classification result.

Description

technical field [0001] The invention relates to a method in the field of machine learning, in particular to a migration learning method based on semi-supervised clustering. Background technique [0002] The traditional classifier in machine learning requires that the source data and the target data must have the same distribution. However, the information development in modern society is changing with each passing day. This assumption is difficult to realize in real life. When data with different distributions appears, traditional classifiers have to collect a large amount of data again, and experts will analyze and label these data. A lot of manpower and time will be invested, which is not only inefficient, but also expensive. Transfer learning can overcome the shortcomings of collecting data from scratch. It can transfer its useful methods and techniques from different but similar fields to the target field, and help the target field data to be classified. [0003] Ther...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 初妍陈曼沈洁夏琳琳王勇李丽洁高迪王兴梅
Owner HARBIN ENG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products