Image semantic understanding analysis method based on global interaction

A technology of semantic understanding and parsing methods, applied in neural learning methods, instruments, biological neural network models, etc., can solve the problems of slow model convergence, poor semantic analysis logic, etc. high precision effect

Pending Publication Date: 2022-05-27
SHENYANG INST OF AUTOMATION - CHINESE ACAD OF SCI
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Aiming at the problems of the representative model of encoding and decoding structure - the baseline model (Neural Image Caption Generator, NIC), the model convergence speed is slow, the generated semantic analysis logic is poor, etc., the present invention proposes that the global image information The semantic analysis model is sent in real time to guide the semantic generation, and the bidirectional cyclic neural network model is used to analyze the semantics of the image, so as to obtain a more logical and precise image semantic description

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image semantic understanding analysis method based on global interaction
  • Image semantic understanding analysis method based on global interaction
  • Image semantic understanding analysis method based on global interaction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] specific implementation

[0026] The following is a further detailed description of the present invention.

[0027] Step 1: Image Feature Information Extraction and Coding

[0028] 1.1) Image feature extraction and encoding

[0029] In the model of the present invention, the image feature extraction encoder adopts the convolutional neural network VGG-16 model to perform feature extraction on the input image, obtains 4096-dimensional high-dimensional image feature information at the network output, and sends the feature vector as the global information of the image. into the decoding end for cross-modal interaction.

[0030] Step 2: Decoding of image feature information

[0031] 2.1) Gated recurrent unit

[0032] In the global interaction model of the present invention, in order to improve the accuracy and richness of language description, a bidirectional cyclic neural network model is adopted, and in order to avoid the sharp increase in parameter scale caused by the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention constructs a global interactive image semantic analysis method based on machine vision, is applied to the generation of an image semantic title, and comprises the following specific steps: 1) selecting a target image feature extraction model, and carrying out feature extraction and coding on image data; 2) establishing a global interactive bidirectional recurrent neural network to analyze image features; 3) carrying out standard regularization processing on the extracted image feature information, and sending the image feature information into a semantic analysis model in real time in a global information mode to carry out model training; and 4) performing semantic analysis on a new target image through the trained model. The image semantics generated by the image semantics understanding model and algorithm established by the invention has the characteristics of strong logicality and rich semantics, has the advantages of high model convergence speed, high semantic analysis precision and the like, and is successfully applied to the fields of content-based image retrieval, medical image analysis, auxiliary blind guiding, early education of children and the like.

Description

technical field [0001] The invention relates to an analysis method for constructing an image semantic understanding model based on global interaction, and is applied to the fields of content-based image retrieval, medical image analysis, auxiliary guidance for the blind, early childhood education and the like. That is, given a picture, generating a text description of the picture belongs to the technical fields of target detection and semantic analysis algorithms. Background technique [0002] Image semantic understanding is based on image recognition and integrates interdisciplinary research in computer science, psychology, and linguistics, and has also made important contributions to the study of cross-modal interactions between images and texts. Image semantic understanding technology wants to rationally or perceptually understand the target image as a whole and generate a natural language description that conforms to human habits. It is a very complex and challenging ta...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06V10/764G06V10/82G06K9/62G06N3/04G06N3/08
CPCG06N3/084G06N3/045G06F18/241
Inventor 库涛熊艳彬杨琦瑞南琳刘金鑫林乐新王海张志东马岩
Owner SHENYANG INST OF AUTOMATION - CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products