Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Automatic image semantic description method

A technology of semantic description and automatic image, applied in still image data retrieval, still image data query, still image data clustering/classification, etc. Advanced problems, to achieve the effect of high accuracy, high semantic description precision, and strong applicability

Active Publication Date: 2019-06-07
ZHENGZHOU UNIVERSITY OF LIGHT INDUSTRY
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the existing technology, the image features extracted by CNN are used as the input of the recurrent neural network (RNN), and the image semantic description information is used as the output of the RNN, and the image semantic description problem is regarded as the translation process from the image to the semantic description. An automatic image semantic description model based on CNN and RNN, but the accuracy of this method's understanding of image semantics is not high, the sentences marked with this model are not smooth enough, and the accuracy of the marked content is not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0024] The present invention provides a technical solution: an automatic image semantic description method, comprising the following steps;

[0025] Step 1, utilize clustering algorithm to carry out clustering to image set according to visual characteristic, the algorithm of clustering algorithm is the image clustering algorithm of K-means, can improve the accuracy of clustering;

[0026] Step 2, dividing the clustered image set into several categories, and each category is divided into several images;

[0027] Step 3. Use CNN to perform image pre-description processing, and mark the purpose of pre-description. The process of image pre-description processing is: vectorization, attribute establishment, projection transformation, and data format conversion;

[0028] Step 4. Label the image with several categories, and the pre-description of the first layer is called the category label of the image. The category label of the image is mapped to the category space based on the feat...

Embodiment 2

[0035] The present invention provides a technical solution: an automatic image semantic description method, comprising the following steps;

[0036] Step 1. Use a clustering algorithm to cluster the image set according to the visual features. The algorithm of the clustering algorithm is the image clustering algorithm of isodata, which can automatically increase or decrease the number of categories during the clustering process, which speeds up the efficiency;

[0037] Step 2, dividing the clustered image set into several categories, and each category is divided into several images;

[0038] Step 3. Use CNN to perform image pre-description processing, and mark the purpose of pre-description. The process of image pre-description processing is: vectorization, attribute establishment, projection transformation, and data format conversion;

[0039] Step 4. Label the image with several categories, and the pre-description of the first layer is called the category label of the image. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an automatic image semantic description method. the method comprises the following steps: utilizing a clustering algorithm to cluster the image set according to the visual features; dividing the clustered image set into a plurality of categories; performing image pre-description processing by using the CNN; the image is labeled with a plurality of categories, and the pre-description of the first layer is called as the category labeling of the image; constructing a classifier for each type of image by using an SVM; judging whether such descriptions are added to the imageor not by utilizing a classifier; utilizing an MBRM model labeling algorithm; and obtaining image semantics through the joint of the image areas obtained by the related training set. The invention provides an automatic image semantic description method. The method can effectively fuse the low-level features of the image and the semantic description high-level semantic information of the image, ishigh in precision and accuracy, has the characteristics of definiteness, formalization, sharing, summarization and the like, can be widely applied to a plurality of fields including information retrieval, information extraction, semantic networks and knowledge management, and is high in applicability.

Description

technical field [0001] The invention relates to the technical field of image semantic description, in particular to an automatic image semantic description method. Background technique [0002] Automatic description of image content (image captioning), that is, to automatically describe the content of images in natural language, has a wide range of application prospects for automatic description of image content, such as human-computer interaction and blind guide systems, and has recently become a field of computer vision and artificial intelligence. A new focus, unlike image classification or object detection, automatic image description aims at comprehensive description of objects, scenes and their relationships, involving visual scene parsing, content semantic understanding, and natural language processing, which is a cutting-edge technology in mixed tasks integrated design; [0003] In the existing technology, the image features extracted by CNN are used as the input of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/55G06F16/53G06K9/62
Inventor 李祖贺张涛钱晓亮曾黎金保华于泽琦田二林于源
Owner ZHENGZHOU UNIVERSITY OF LIGHT INDUSTRY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products