Sparse coding algorithm suitable for multi-modal information and application thereof

A sparse coding and multi-modal technology, applied in the field of sparse coding algorithms, can solve problems such as reducing the accuracy of cross retrieval, losing similarity information, and sparse coding instability

Inactive Publication Date: 2015-07-08
HEFEI UNIV OF TECH +1
View PDF4 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] First, an over-complete codebook and independent sparse coding will lead to the loss of similarity information between data during the coding process, causing similar features to be coded into sparse codes with large differences, resulting in the instability of sparse coding
[0006] Second, the traditional sparse coding algorithm does not take into account the coding of multi-modal features. In the research on cross-retrieval of multi-modal information, the query item and the retrieved item are represented by features of different modalities. There will be a lot of difference, which also affects the stability of sparse coding, thereby reducing the accuracy of cross retrieval

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sparse coding algorithm suitable for multi-modal information and application thereof
  • Sparse coding algorithm suitable for multi-modal information and application thereof
  • Sparse coding algorithm suitable for multi-modal information and application thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] In this embodiment, a sparse coding algorithm suitable for multimodal information is performed as follows:

[0047] Step 1, perform feature extraction on the multimodal information D, and obtain the feature matrix of the multimodal information D, denoted as D=(X I ,X T );And a A feature matrix representing a social media image; Represents the features of the i-th social media image; t 1 Represent the dimension of the social media image feature matrix; m represents the number of social media images; in the present embodiment, the social media image feature adopts the BagofWord model representation: first extract the SIFT feature from the image, and obtain the center of the SIFT feature by the method of clustering point, and project the SIFT feature to each different cluster center point to obtain the BagofWord feature; A feature matrix representing text information; Represents the feature of the i-th text information; t 2 Represents the dimension of the tex...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a sparse coding algorithm suitable for multi-modal information and an application thereof. The sparse coding algorithm comprises the following steps that 1 features of images and texts of a social medium are extracted; 2 a Laplacian matrix is established for same modal features; 3 a maximized average divergence matrix is established; 4 a target function based on sparse coding is established; 5 a search algorithm of a feature symbol is adopted to update the sparse coding, and feature representation of the multi-modal information is obtained; 6 by using the obtained feature representation to carry out intersection searching. The multi-modal information can be sufficiently used to carry out coding, the distributional difference of different modals is lowered, so that the robustness of the sparse coding is improved, and the accuracy of the intersection searching is improved.

Description

technical field [0001] The invention relates to multimedia information retrieval, in particular to a sparse coding algorithm for multimodal information and its application. Background technique [0002] In recent years, with the rise of social network platforms such as Weibo and Facebook, multimedia information has shown explosive growth, which puts forward new requirements for traditional information retrieval technology. Simple text retrieval can no longer meet the increasingly complex information retrieval needs of users. Users hope to obtain data in different modalities such as text, image, audio, and video. Cross-retrieval between multi-modal information, such as inputting an image and retrieving the related text, or inputting a piece of text and retrieving the image that best matches it, has become a hot topic in the academic circle. [0003] From the existing multi-modal information processing technology, it can be seen that the core problem is the modeling of differ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06K9/46
Inventor 刘学亮刘菲
Owner HEFEI UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products