Unsupervised target detection method and system based on variational auto-encoder and Gaussian mixture model

A Gaussian mixture model and target detection technology, applied in the field of artificial intelligence, can solve the problems of insensitive object type information and insufficient detection accuracy, and achieve the effect of good performance and good effect.

Inactive Publication Date: 2021-08-13
SHANGHAI JIAO TONG UNIV
View PDF4 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The present invention aims at the problem of insufficient detection accuracy of the existing unsupervised learning target detection for classification and multi-object scenes, and the existing target detection framework based on variational autoencoders is difficult to deal with scenes with a large number of objects and is not sensitive to object type information In order to solve the defects, an unsupervised object detection method and system based on variational autoencoder and Gaussian mixture model is proposed, which combines the spatial attention mechanism and Gaussian mixture model, which can not only achieve end-to-end object detection and classification, but also in the presence of In the case of a large number of objects, it still has good performance and has good scalability

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unsupervised target detection method and system based on variational auto-encoder and Gaussian mixture model
  • Unsupervised target detection method and system based on variational auto-encoder and Gaussian mixture model
  • Unsupervised target detection method and system based on variational auto-encoder and Gaussian mixture model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0014] Such as figure 1 As shown, this embodiment involves an unsupervised target detection system based on a variational autoencoder and a Gaussian mixture model, including: a backbone network and a pres-prediction head, a depth prediction head, and a where-prediction head respectively connected to it ; Spatial transformation network and what-encoder and cat-encoder respectively connected to it; cell decoder, what prior network and differentiable renderer, wherein: the backbone network preprocesses the input image to obtain the feature map, pres -prediction head, depth prediction head and where-prediction head respectively get z according to the feature map pres Hidden variable, z where Hidden variables and z depth Hidden variable, the spatial transformation network according to the input image and z where Hidden variables undergo spatial transformation processing and output cell information to cat-encoder and what-encoder respectively, and cat-encoder obtains z according ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an unsupervised target detection method and a system based on a variational auto-encoder and a Gaussian mixture model, and the method comprises the steps: converting an input image into a feature map of H * W dimensions, namely H * W cells, through a backbone network, coding the feature map into a hidden variable of which the prior distribution accords with the Gaussian mixture model, and carrying out the image reconstruction through a decoder according to the hidden variable, and comparing the reconstructed image with the input image, and calculating a loss function, so that a neural network is trained, an encoder obtains information such as the category and the position of an object in the image, and unsupervised target detection is realized. According to the method, the space attention mechanism and the Gaussian mixture model are combined, end-to-end target detection and classification can be achieved, meanwhile, good performance is still achieved under the condition that a large number of objects exist, and good expansibility is achieved.

Description

technical field [0001] The present invention relates to a technology in the field of artificial intelligence, in particular to an unsupervised target detection method and system based on a variational autoencoder and a Gaussian mixture model. Background technique [0002] Current supervised learning still requires large labeled datasets, and this processing requires a lot of work, making useful datasets hard to come by. At the same time, the generality of the deep learning model obtained by supervised learning is poor, and the performance on different data sets may be attenuated. In contrast, the biggest feature of unsupervised learning is that it does not need to label the data, which greatly reduces the workload. At the same time, unsupervised learning is committed to obtaining a general model and improving the versatility of the model. In the field of unsupervised learning, the variational autoencoder is a very important framework, although the AIR (Attend, Infer, Repea...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/088G06V2201/07G06N3/045G06F18/23G06F18/214G06F18/24
Inventor 沈耀朱伟劲余林峰
Owner SHANGHAI JIAO TONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products