Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Cross-media retrieval method based on Resnet-Bert network model

A network model and cross-media technology, applied in the field of cross-media retrieval, can solve the problems of low accuracy of cross-modal retrieval, and achieve good cross-media retrieval effect, strong knowledge representation ability, and enhanced association learning

Pending Publication Date: 2020-11-17
CETC BIGDATA RES INST CO LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing cross-media retrieval technology is limited to two kinds of media data. In fact, this kind of search can no longer meet people's increasing data retrieval needs, especially the problem of low accuracy of cross-modal retrieval.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-media retrieval method based on Resnet-Bert network model
  • Cross-media retrieval method based on Resnet-Bert network model
  • Cross-media retrieval method based on Resnet-Bert network model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0029] Adopt the above scheme, combined with the technical means of the prior art, realize according to the following steps:

[0030] Step 1: Cross-media data collection. Through crawling, query, communication and other means, the present invention obtains cross-media data with related topics, consistent semantics and the same tags. The data includes four types of data: image, video, audio, and text, and each media type data includes 200 species of birds. The image data is the CUB-200-2011 dataset, with a total of 11,788 images, 5,994 training sets and 5,794 test sets. The video data uses the YouTube Birds dataset, with 12,666 videos for the training set and 5,864 videos for the test set. The text dataset consists of 4000 training sets and 4000 testing sets. The audio data includes 6000 training spectrograms and 6000 test spectrograms. Among them, the image CUB-200-2011 data is obtained by downloading from relevant websites, and the video YouTube Birds data is obtained by ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a cross-media retrieval method based on a Resnet-Bert network model. The method comprises the following steps of carrying out classified retrieval on at least one of image data,text data, video data and audio data by adopting a Resnet-Bert network model, and returning a corresponding classification result. Compared with the traditional mutual retrieval of two media types, the mutual retrieval of the four media types can realize wider market application; a Resnet convolutional neural network model with a good effect and a Bert model leading in 11 natural language processing aspects at present are adopted, and the model itself can obtain higher-layer, more abstract and richer feature expressions; the used four modals of data and information are mutually migrated, associated learning is enhanced, and stronger knowledge representation capability is realized; The invention benefits from the improvement of computer performance, the Resnet-Bert network model can achieve a better cross-media training effect and a better cross-media retrieval effect through complex calculation.

Description

technical field [0001] The present invention relates to a cross-media retrieval method based on Resnet-Bert network model, belonging to. Background technique [0002] In the era of big data, a variety of media data types, such as text, images, video, audio, etc., have become the main data forms for people to acquire knowledge. More and more users are eager to learn and master more comprehensive knowledge information through a variety of media data content and the interrelationships between them, so as to assist their own cognition and problem solving. [0003] Retrieval is one of the common ways for users to acquire knowledge. Traditional cross-media retrieval research mainly focuses on two types of media data: search for images by text and search for text by images. In fact, with the advent of the era of big data, people will generate a large amount of text data through the Internet, such as news reports, Weibo Taobao and other comment data, WeChat chat records, bullet scr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/43G06F16/45G06N3/04G06N3/08
CPCG06F16/43G06F16/45G06N3/08G06N3/045
Inventor 闫盈盈张婧慧洒科进曹扬丁剑飞
Owner CETC BIGDATA RES INST CO LTD
Features
  • Generate Ideas
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More