Image description generation method and device, model training method and device and storage media

A technology of image description and description information, applied in the field of machine learning, can solve the problem of inaccurate image description information, achieve the effect of accurate target image description information and improve accuracy

Active Publication Date: 2018-07-20
SHENZHEN TENCENT COMP SYST CO LTD
View PDF14 Cites 36 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In the image description generation method provided by related technologies, the input parameters of the decoder only include the global feature vector and the set of label vectors of the tar...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image description generation method and device, model training method and device and storage media
  • Image description generation method and device, model training method and device and storage media
  • Image description generation method and device, model training method and device and storage media

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] In order to make the object, technical solution and advantages of the present invention clearer, the implementation manner of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0030] For ease of description, terms involved in each embodiment are briefly introduced below.

[0031] CNN (Convolution Neural Network, Convolutional Neural Network), is a feed-forward neural network that directly starts from the underlying pixel features of the image, and extracts the features of the image layer by layer. It is the most commonly used implementation model of the encoder, responsible for encoding the image into a vector.

[0032] RNN (Recurrent Neural Network, recursive neural network), is a neural network with fixed weights, external inputs and internal states, which can be regarded as the behavioral dynamics of internal states with weights and external inputs as parameters . RNN is the most commonly used implement...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an image description generation method and device, a model training method and device and storage media, and belongs to the technical field of machine learning. The image description generation method comprises that a target image is obtained; a first global characteristic vector and a first mark vector set of the target image are generated; the target image is input to a matching model, a first multi-mode characteristic vector of the target image is generated by the matching model; the matching model is obtained by training a training image and reference image description information of the training image; and description information of the target image is generated according to the first multi-mode characteristic vector, the first global characteristic vector andthe first mark vector set. The multi-mode characteristic vector of the target image is generated via the matching model which is obtained by training, and the multi-mode characteristic vector is theninput to a calculation model to obtain the description information of the target image, and thus, the generated image description information is more accurate.

Description

technical field [0001] The embodiments of the present invention relate to the technical field of machine learning, and in particular to an image description generation method, a model training method, a device, and a storage medium. Background technique [0002] With the development of image recognition technology, the content information of images can be converted into text descriptions of images through algorithms. [0003] A related image description generation method includes: firstly, the acquired target image is encoded by an encoder, such as a feature extraction model, to generate a global feature vector and a set of annotation vectors of the target image, and then input the global feature vector and The annotation vectors are collected into the decoder, such as the calculation model, and finally the description information of the target image is obtained. [0004] In the image description generation method provided by the related art, the input parameters of the dec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06T9/00G06N3/04G06V10/776
CPCG06N3/049G06T9/002G06N3/045G06N3/084G06V10/454G06V10/82G06V10/776G06N3/044G06F18/214G06F18/217
Inventor 姜文浩马林刘威
Owner SHENZHEN TENCENT COMP SYST CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products