An Image Annotation Method Based on Weak Matching Probability Canonical Correlation Model

A canonical correlation and matching probability technology, which is applied in the field of image annotation based on the weak matching probability canonical correlation model, which can solve the problems of time-consuming and laborious manual matching, data asynchrony, and missing data.

Active Publication Date: 2018-08-31
INST OF COMPUTING TECH CHINESE ACAD OF SCI +1
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In canonical correlation analysis, two groups of related random variables can come from multiple sources of information (such as a person's voice and image), or they can be different features extracted from the information of the same source (such as image color features and texture features) , but the training data must be strictly matched one-to-one. Many reasons make it difficult to obtain such strictly matched training data. For example, in a multi-sensor acquisition system, the sensor sampling frequency is not synchronized or the sensor failure will cause the data collected by different channels to be out of sync or The data of a certain channel is lost; single-modal data is relatively easy to obtain, but manual matching is very time-consuming and laborious. In practice, the multi-modal data often only has a small amount of one-to-one strict matching, and the rest of the large amount of data is not matched. Weakly matched multimodal data
[0004] There are two basic approaches to canonical correlation analysis for weakly matched multimodal data: discard unmatched data and only use canonical correlation analysis to process strictly matched multimodal data; match multimodal data according to specific criteria, but Neither approach is likely to achieve the desired result

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An Image Annotation Method Based on Weak Matching Probability Canonical Correlation Model
  • An Image Annotation Method Based on Weak Matching Probability Canonical Correlation Model
  • An Image Annotation Method Based on Weak Matching Probability Canonical Correlation Model

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0100] The following are specific embodiments of the present invention, as follows:

[0101] Experiments on artificial weak matching multimodal datasets are as follows:

[0102] In order to verify the effectiveness of the SemiPCCA model, we construct the following artificial data sets: sample set subject to N(0,I d ), where the dimension d=2, the number of samples N=300, the complete set of matching samples Obtained by constructing,

[0103]

[0104]

[0105]

[0106] In order to obtain a weakly matched sample set, we construct a discriminant function f(x 2 ) = a T x 2 -θ, where θ represents the discrimination threshold, for the sample If its discriminant function value then from remove the sample. It can be seen that the larger θ is, the more samples are removed;

[0107] When comparing SemiPCCA with traditional CCA and PCCA, we chose the following weighted cosine distance,

[0108]

[0109] in, and Respectively represent the "true" d typical p...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an image tagging method and system based on a typical correlation model of weak matching probability. The invention relates to the technical field of network cross-media information processing, including acquiring tagged images and unlabeled images in an image database, and extracting the tagged images respectively. With the image features and text features of the unlabeled image, a matched sample set and an unmatched sample set are generated, the matched sample set includes an annotated image feature set and an annotated text feature set, and the unmatched sample set includes An unlabeled image feature set and an unlabeled text feature set; according to the matched sample set and the unmatched sample set, train the weak matching probability typical correlation model; through the weak matching probability typical correlation model, treat Annotate images for annotation. The present invention simultaneously uses marked images and their keywords and unmarked images to learn the association between the visual modality and the text modality, and accurately marks unknown images.

Description

technical field [0001] The invention relates to the technical field of network cross-media information processing, in particular to an image labeling method based on a typical correlation model of weak matching probability. Background technique [0002] The Internet of Things and the Internet have rich multimedia information resources such as text, images, video, and audio. These information resources are heterogeneous, and it is difficult to directly find the correlation between them. Canonical correlation analysis (CCA) is a It is a statistical analysis tool used to analyze the correlation between two groups of random variables. Its correlation preservation feature has been proved in theory, and it is applied in the fields of economics, meteorology and genome data analysis. CCA finds two groups of random variables through statistical methods. The potential relationship between heterogeneous multimodal features uses a unified model to associate different types of multimodal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06K9/62
CPCG06F16/5866G06F18/24
Inventor 张博史忠植王伟齐保元马刚
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products