Cross-modal information retrieval method based on semantic fusion
An information retrieval, cross-modal technology, applied in the field of information retrieval, can solve the problems of difficult data representation and measurement, different feature spaces, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0090] refer to figure 1 , as an embodiment of the present invention, provides a cross-modal information retrieval method based on semantic fusion, including:
[0091] S1: Collect raw data and preprocess the raw data. It should be noted:
[0092] The original data includes the original image, audio and tactile signal, and the resolution of the original image is adjusted to 224×224×3; the audio is converted into a discrete digital signal; the tactile signal or discrete digital signal is preprocessed as a new signal;
[0093] Preprocessing includes,
[0094] (1) Pre-emphasis:
[0095] Set the new signal to x(n), 0≤n≤N-1, apply the pre-emphasis filter to the signal x(n), and get the pre-emphasis signal y(n):
[0096]
[0097] Among them, α represents the pre-emphasis filter coefficient, N is the signal length, and the signal x(n) sampling frequency is f s ;
[0098] (2) Framing:
[0099] Record the frame size FRAME_SIZE as N sz, the frame step size FRAME_STRIDE is reco...
Embodiment 2
[0154] refer to Figure 2-11 It is the second embodiment of the present invention, which is different from the first embodiment in that it provides a verification test of a cross-modal information retrieval method based on semantic fusion, in order to improve the technical effect adopted in this method Verification shows that this embodiment adopts the traditional technical scheme and the method of the present invention to carry out a comparative test, and compares the test results by means of scientific demonstration, so as to verify the real effect of the method.
[0155] Traditional technical solutions: The traditional six methods of CCA, KCCA, ICA, PCA, AE, and VAE have low retrieval accuracy when dealing with cross-modal retrieval problems involving three modalities; in order to verify that this method has higher In this embodiment, the traditional CCA, KCCA, ICA, PCA, AE, and VAE methods will be used to compare the MAP values with this method. The larger the MAP value,...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


