An Image Annotation Method Based on Multimodal Deep Learning
A technology of deep learning and image annotation, applied in the field of image processing, can solve problems such as difficult to achieve satisfactory results, and achieve the effect of improving performance and improving the accuracy of annotation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0050] The invention will be described in further detail below in conjunction with the accompanying drawings.
[0051] Such as figure 1 As shown, the present invention provides a method for image labeling based on multimodal deep learning. The method includes: first, using unlabeled images to train a deep neural network; secondly, using backpropagation to optimize each single modality; finally, using Power Gradient Algorithm for Online Learning to Optimize Weights Between Different Modalities.
[0052] The deep neural network in the present invention adopts convolutional neural network, and its model structure is as follows figure 2 shown. The present invention evaluates the performance of the image labeling algorithm based on multimodal deep learning proposed by the present invention through a series of experiments.
[0053] Step 1: Introduce the dataset used to evaluate the performance of the algorithm.
[0054] The experiment adopts three public image datasets, includi...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com