Video and text cross-modal retrieval method based on relational reasoning network
Patent Information
- Authority / Receiving Office
- CN · China
- Current Assignee / Owner
- CHENGDU KOALA URAN TECH CO LTD
- Publication Date
- 2021-08-10
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The invention relates to the field of cross-modal retrieval, in particular to a video and text cross-modal retrieval method based on a relational reasoning network. Background technique
[0002] Cross-media retrieval means that users can retrieve semantically related data in all media types by inputting query data of any media type. In the present invention, it is specifically the mutual retrieval of video and text. In general, videos and corresponding video description texts will be provided in the data set. The task of cross-media retrieval is: for any video, retrieve the video description text most relevant to its content description, or for any video description text, retrieve the The video most relevant to its description. With the increasing amount of multimedia data such as text, images, and videos on the Internet, retrieval across different modalities has become a new trend in information retrieval. The difficulty of this problem lies in how...