Method and system for cross-modal retrieval of multimedia data using tag hierarchy information
A multimedia data and modal technology, applied in the field of cross-media retrieval, can solve problems such as ignoring label level information and not considering cross-layer label related information, so as to achieve the effect of quality assurance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0035] This embodiment provides a method for cross-modal retrieval of multimedia data using label hierarchy information;
[0036] Such as figure 1 As shown, the cross-modal retrieval method of multimedia data using label hierarchical information includes:
[0037] S101: Obtain the first modal multimedia data to be retrieved;
[0038] S102: Perform feature extraction on the first modal multimedia data to be retrieved to obtain a first hash code;
[0039] S103: Calculate the distance between the first hash code and the known hash codes corresponding to all the pre-stored multimedia data of the second modality; select the multimedia data of the second modality corresponding to several hash codes with the closest distance, output as search results.
[0040] As one or more embodiments, in S101, the first modality multimedia data to be retrieved includes but not limited to: one or more of text data, image data, audio data or video data.
[0041] As one or more embodiments, in S1...
Embodiment 2
[0140] This embodiment provides a multimedia data cross-modal retrieval system utilizing label hierarchy information;
[0141] A cross-modal retrieval system for multimedia data utilizing tag-level information, including:
[0142] An acquisition module configured to: acquire the first modality multimedia data to be retrieved;
[0143] A feature extraction module configured to: perform feature extraction on the first modality multimedia data to be retrieved to obtain a first hash code;
[0144] The retrieval output module is configured to: calculate the distance between the first hash code and the known hash codes corresponding to all the multimedia data of the pre-stored second modality; Two-modal multimedia data is output as a retrieval result.
[0145]It should be noted here that the above acquisition module, feature extraction module and retrieval output module correspond to steps S101 to S103 in Embodiment 1, and the examples and application scenarios implemented by the ...
Embodiment 3
[0149] This embodiment also provides an electronic device, including: one or more processors, one or more memories, and one or more computer programs; wherein, the processor is connected to the memory, and the one or more computer programs are programmed Stored in the memory, when the electronic device is running, the processor executes one or more computer programs stored in the memory, so that the electronic device executes the method described in Embodiment 1 above.
[0150] It should be understood that in this embodiment, the processor can be a central processing unit CPU, and the processor can also be other general-purpose processors, digital signal processors DSP, application specific integrated circuits ASIC, off-the-shelf programmable gate array FPGA or other programmable logic devices , discrete gate or transistor logic devices, discrete hardware components, etc. A general-purpose processor may be a microprocessor, or the processor may be any conventional processor, o...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


