LSTM-based two-dimensional image target class identification method

A technology of two-dimensional images and target categories, applied in the field of image recognition, can solve problems such as low accuracy rate, inability to fully reflect one-dimensional vectors, and inability to fully reflect the two-dimensional topology of image data, etc., to achieve good explainability The effect of improving accuracy

Active Publication Date: 2018-03-02
HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL
View PDF2 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The one-dimensional vector obtained by such a single conversion method cannot fully reflect the spatial organization relationship of image pixels, that is, this method ignores other optional methods when matrix data is converted into one-dimensional data; and makes the two-dimensional topology of image data ( The distance relationship of different pixels in two-dimensional space) is not fully reflected, resulting in a low accuracy rate of discrimination

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • LSTM-based two-dimensional image target class identification method
  • LSTM-based two-dimensional image target class identification method
  • LSTM-based two-dimensional image target class identification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The present invention will be further described below in conjunction with the description of the drawings and specific embodiments.

[0030] An LSTM-based two-dimensional image target category identification method, which adds the transformation method when converting a two-dimensional image into a one-dimensional image, according to the circular series connection, row sequence series connection and column sequence connection method from outside to inside respectively. The two-dimensional image is converted into a one-dimensional vector, and then all are sent to the LSTM network for processing (training or identification). Since the LSTM network outputs a result for each conversion method. Then, use a specially designed weighted score fusion scheme to fuse the network output results of different conversion methods, and perform target identification based on the final fusion results. This approach can fully expand the spatial organization of two-bit data into one-bit da...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an LSTM-based two-dimensional image target class identification method comprising the following steps that S1, a two-dimensional image is converted into a one-dimensional vectoraccording to a ring series mode from outside to inside, the two-dimensional image is converted into the one-dimensional vector according to a row sequence series mode, and the two-dimensional image is converted into the one-dimensional vector according to a column sequence series mode; S2, the one-dimensional vectors obtained by the three conversion modes are all transmitted to an LSTM network tobe trained or identified, and then three results are outputted; and S3, the three results outputted by the LSTM network are fused and target identification is performed based on the final fusion result. The beneficial effects of the method are that different modes are applied to the two-dimensional image to obtain the one-dimensional vector so that the spatial organization relationship of the two-dimensional image can be fully utilized, the correct rate of identification can be obviously enhanced and the method has great interpretability.

Description

technical field [0001] The present invention relates to image recognition, in particular to an LSTM-based two-dimensional image target category recognition method. Background technique [0002] LSTM (Long Short Term Memory) deep network has important applications in object category recognition, but it is limited to the situation where the object is expressed as one-dimensional data. The main advantage of the LSTM network is that it can make full use of the associated information of the sequence in time sequence. figure 1 A schematic diagram of the overall structure of LSTM is given. In order to solve the problem of gradient disappearance in the traditional RNN network, that is, the perception of the later time nodes to the previous time nodes decreases. LSTM has designed a unique Cell structure to realize the memory function and maintain the perception ability of subsequent time nodes for previous time nodes. LSTM also designed a "Forget Gate" to selectively forget previous...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/049G06N3/08G06F18/2111G06F18/2413
Inventor 徐勇吴帅
Owner HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products