Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Chinese image semantic description method combined with multilayer GRU based on residual error connection Inception network

A semantic description, network technology, applied in neural learning methods, biological neural network models, instruments, etc., can solve the problems of sentence coherence and readability, and low recognition rate.

Inactive Publication Date: 2018-11-16
HARBIN UNIV OF SCI & TECH
View PDF12 Cites 43 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The machine learning classification algorithm that relies on the combination of traditional manual features can also realize the semantic description of images, but the generated Chinese sentences are not very coherent and readable from both objective and subjective perspectives.
The application of deep learning network has improved this problem, but there are still shortcomings such as low recognition rate.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese image semantic description method combined with multilayer GRU based on residual error connection Inception network
  • Chinese image semantic description method combined with multilayer GRU based on residual error connection Inception network
  • Chinese image semantic description method combined with multilayer GRU based on residual error connection Inception network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The specific embodiments of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0053] A Chinese image semantic description method based on a residual connection-based Inception network combined with a multi-layer GRU network is a Chinese image semantic description method based on the Inception-residual module combined with a multi-layer GRU network; the Inception-residual module is a pair of Inception_v3 (GoogleNet series The third version of the deep convolutional network) The core component of the network, the Inception Architecture module, combines the residual ideas proposed by the residual neural network (ResNet) to design and generate a brand new core component of the deep convolutional network, which makes the network structure While further deepening, the performance of the network will not be degraded, so that deeper features can be extracted. The present invention adopts the deep convolutional neura...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Chinese image semantic description method combined with multilayer GRU based on a residual error connection Inception network, and belongs to the field of computer vision andnatural language processing. The method comprises the steps: carrying out the preprocessing of an AI Challenger image Chinese description training set and an estimation set through an open source tensorflow to generate a file at the tfrecord format for training; pre-training an ImageNet data set through an Inception_ResNet_v2 network to obtain a convolution network pre-training model; loading a pre-training parameter to the Inception_ResNet_v2 network, and carrying out the extraction of an image feature descriptor of the AI Challenger image set; building a single-hidden-layer neural network model and mapping the image feature descriptor to a word embedding space; taking a word embedding characteristic matrix and the image feature descriptor after secondary characteristic mapping as the input of a double-layer GRU network; inputting an original image into a description model to generate a Chinese description sentence; employing an evaluation data set for estimation through employing the trained model and taking a Perplexity index as an evaluation standard. The method achieves the solving of a technical problem of describing an image in Chinese, and improves the continuity and readability of sentences.

Description

technical field [0001] The invention belongs to the field of computer vision and natural language processing, and in particular relates to a Chinese image semantic description method based on a residual connection-based Inception network combined with a multi-layer GRU network. Background technique [0002] Chinese text description of images is a technology that combines computer vision and Chinese natural language processing (NLP). , how to more efficiently search for the image information required by users in a large number of images, only relying on the traditional keyword retrieval method not only has the problem of slow search speed but also has the problem of inaccurate image positioning. For scientific research, in In the era of big data, if you rely on manpower to label a large number of pictures, generating labels is particularly unrealistic. Accurate labeling of images can not only greatly improve people's image retrieval efficiency, but also provide accurate data...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/084G06N3/048G06F18/214G06F18/24
Inventor 谢金宝吕世伟刘秋阳李佰蔚梁新涛王玉静
Owner HARBIN UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products