Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Hierarchical tag-based cross-modal hash model construction method, search method and device

A technology of hierarchical labeling and construction methods, applied in the field of multimedia data search, can solve problems such as failure to achieve performance, inaccurate processing and utilization of label information, etc.

Active Publication Date: 2019-08-30
SHANDONG UNIV
View PDF11 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, the inventors found in the research and development process that the existing technology still has some deficiencies in the use of tag information. Some tag information is treated independently, and the processing and utilization of tag information is not precise enough, so that the best performance cannot be achieved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hierarchical tag-based cross-modal hash model construction method, search method and device
  • Hierarchical tag-based cross-modal hash model construction method, search method and device
  • Hierarchical tag-based cross-modal hash model construction method, search method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0065] According to an aspect of one or more embodiments of the present disclosure, a method for constructing a cross-modal hash model based on hierarchical tags is provided.

[0066] Such as figure 1 As shown, a method for constructing a cross-modal hash model based on hierarchical labels, which includes:

[0067] S1 receives the multimodal data set, and preprocesses the samples in it to filter the interference data;

[0068] S2 inputs the pre-processed samples into the pre-trained multi-path neural network according to the different modes;

[0069] S3 extracts the feature data of different modalities according to the pre-trained neural network and multi-layer perceptron, and obtains the hierarchical hash representation of different modalities;

[0070] S4 constructs the similarity matrix of preprocessed samples at different levels according to the level label, and trains the inner product of the hash representation according to the median value of each level of similarity ...

Embodiment 2

[0134] According to an aspect of one or more embodiments of the present disclosure, there is provided a computer-readable storage medium.

[0135] A computer-readable storage medium stores a plurality of instructions, and the instructions are suitable for being loaded and executed by a processor of a terminal device to execute the method for constructing a cross-modal hash model based on hierarchical tags.

Embodiment 3

[0137] According to an aspect of one or more embodiments of the present disclosure, a terminal device is provided.

[0138] A terminal device, which includes a processor and a computer-readable storage medium, the processor is used to implement instructions; the computer-readable storage medium is used to store multiple instructions, and the instructions are suitable for being loaded by the processor and executing the described one A method for building cross-modal hashing models based on hierarchical labels.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a hierarchical tag-based cross-modal Hash model construction method, a search method and a device. The method comprises the following steps of receiving a multi-modal data set,and preprocessing; inputting the relative data of the preprocessed data of different modalities into a pre-trained multi-path neural network; according to the pre-trained neural network and the multi-layer perceptron, extracting the characteristic data of different modes respectively, and obtaining the hierarchical Hash representations of different modes; constructing the similarity matrixes of the preprocessed samples on different levels according to the level labels, and evaluating the semantic similarity among the samples according to the inner product of the Hash representation trained bythe median of each level of similarity matrix; adopting the hierarchical labels with different granularities, analyzing the influence of the hierarchical comparison on the neural network performance,and determining the optimal hierarchical ratio; obtaining a Hash code according to each layer of Hash representation; and training the dual-path neural network, optimizing and training the dual-pathneural network by using an SGD gradient descent method, and establishing a deep cross-modal hash model based on hierarchical tags for cross-modal search.

Description

technical field [0001] The disclosure belongs to the technical field of multimedia data search, and relates to a method for constructing a cross-modal hash model based on hierarchical tags, a search method and a device. Background technique [0002] The statements in this section merely provide background information related to the present disclosure and do not necessarily constitute prior art. [0003] With the vigorous development of multimedia devices, the amount of multimedia data on the Internet has increased unprecedentedly. Data often exists in multiple modalities, such as pictures, texts, and videos. There are often semantic correlations between these modalities of data, expressing the same object from different angles, so that people have a clearer and more complete understanding of the data. In real-life applications, such as major e-commerce platforms, people are more and more inclined to search for each other between different modal data. similar data. Theref...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/41G06F16/43G06N3/04G06N3/08
CPCG06F16/41G06F16/43G06N3/08G06N3/045
Inventor 王润琦宋雪萌孙畅畅崔超然关惟俐宓生润
Owner SHANDONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products