Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

72 results about "Fisher vector" patented technology

In Computer Vision, a Fisher Vector (see paper here) can be used to describe an entire image for image classification. High level picture: The Fisher Vector (FV) encodes the gradients of the log-likelihood of the features under the GMM, with respect to the GMM parameters.

Track and convolutional neural network feature extraction-based behavior identification method

The invention discloses a track and convolutional neural network feature extraction-based behavior identification method, and mainly solves the problems of computing redundancy and low classification accuracy caused by complex human behavior video contents and sparse features. The method comprises the steps of inputting image video data; down-sampling pixel points in a video frame; deleting uniform region sampling points; extracting a track; extracting convolutional layer features by utilizing a convolutional neural network; extracting track constraint-based convolutional features in combination with the track and the convolutional layer features; extracting stack type local Fisher vector features according to the track constraint-based convolutional features; performing compression transformation on the stack type local Fisher vector features; training a support vector machine model by utilizing final stack type local Fisher vector features; and performing human behavior identification and classification. According to the method, relatively high and stable classification accuracy can be obtained by adopting a method for combining multilevel Fisher vectors with convolutional track feature descriptors; and the method can be widely applied to the fields of man-machine interaction, virtual reality, video monitoring and the like.
Owner:XIDIAN UNIV

Human movement significant trajectory-based video classification method

The invention relates to a human movement significant trajectory-based video classification method. The human movement significant trajectory-based video classification method comprises the following steps that: a video set M is divided into a training set Mt and a test set Mv, and human movement information in each video is tracked in a multi-scale space by using SIFT and dense optical flow technologies, so that movement significant trajectories in each video can be obtained; feature description vectors of each trajectory are extracted; redundant information in the feature description vectors are eliminated through using a PCA method, and dimension reduction is performed on each class of feature description vectors; the feature description vectors in the training set Mt are clustered by suing a Gauss mixture model, and then, Fisher vectors of each video in the video set M are generated by using a Fisher Vector method; a linear SVM classification model is constructed on the training set Mt; and on the test set Mv, videos in the test set are classified through using the linear SVM classification model. Compared with the prior art, the human movement significant trajectory-based video classification method of the invention has the advantages of excellent robustness, higher computational efficiency and the like.
Owner:DEEPBLUE TECH (SHANGHAI) CO LTD

Improved method for learning discriminative segments in fine-grained identification

The invention discloses an improved method for learning discriminative segments in fine-grained identification. The method comprises the following steps of: extracting segments with discrimination properties in an original image: obtaining a feature map by the original image through a convolutional pooling layer in a convolutional neural network, considering a vector of each space fixed position as a detector of a corresponding position segment in the original image, assuming that a detector, a discriminative region of which has highest response, in the original image is learnt, carrying out convolutional operation on the detector and the feature map so as to obtain a new response map, and selecting a position with a maximum value in the new response map so as to obtain the segments with discriminative properties; and learning features of the segments with the discriminative properties and carrying out classification by using the features: obtaining a local saliency map according to the segments with the discriminative properties, and encoding the local saliency map by using a space weighted Fisher vector. The method is capable of learning discriminative features more suitable forfine-grained identification tasks, and decreasing the interferences of background information in the discriminative segments so as to improve the classification precision.
Owner:TIANJIN UNIV

Figure behavior identification method based on random projection and Fisher vectors

The invention discloses a figure behavior identification method based on random projection and Fisher vectors. The method employs a random projection theorem method to replace a principal component analysis method for characteristic dimension reduction, for the purpose of solving the problems of large time consumption, indeterminate reservation of principle components and the like. A random projection theorem indicates that through a compression measurement matrix, original signals with a sparse property can be projected to a certain low-dimension subspace, and the point distance between a vector after mapping and an original high-dimension characteristic vector maintains basically unchanged, i.e., data distortion is not generated in a whole compression process. Besides, different from hard division of a BoW model, the method provided by the invention employs a GMM-Fisher vector hybrid model for soft division of locus characteristic vectors, is integrated with the characteristics of a Fisher nucleus generation mode and a discrimination mode, can calculate the occurrence frequency of each characteristic descriptor, can also describe the probability distribution conditions of these characteristic descriptors in the perspective of statistics, enriches characteristic expression of behavior motion and also improves the behavior identification efficiency.
Owner:NANJING NANJI INTELLIGENT AGRI MASCH TECH RES INST CO LTD

Foundation cloud picture classification method based on heterogeneous feature fusion network

The invention discloses a foundation cloud picture classification method based on a heterogeneous feature fusion network. The foundation cloud picture classification method comprises the following steps: (1) preprocessing a plurality of foundation cloud pictures with noise; (2) features of the processed foundation cloud images are extracted respectively, and a manual feature extraction method is combined with Fisher vector coding to obtain a feature vector corresponding to each foundation cloud image; (3) inputting the output of the step (2) into a four-layer full connection layer network, wherein the output of the network is recorded as fc; and (4) after the training set is amplified, training a convolutional neural network model, fusing fc with the deep semantic feature fg obtained by the last pooling layer, and obtaining a classification probability corresponding to each class through a full connection layer. According to the method, the generalization ability of a foundation cloudpicture classification and recognition task can be remarkably improved, the robustness of the model is high, visual information is combined from multiple angles, the cloud shape can be accurately positioned even if noise is manually added, and a good recognition result is obtained.
Owner:HOHAI UNIV

Mobile visual search framework based on CRBM and Fisher network

The invention provides a mobile visual search framework based on a CRBM and Fisher network and relates to the image retrieval of a mobile terminal. The method comprises the following steps of 1) constructing and training a continuous restricted Boltzmann machine network; 2) constructing and training a Fisher layer network. According to the invention, the sub-space feature information of the localfeature nature of non-Gaussian distribution is found by adopting the nonlinear dimensionality reduction algorithm CRBM in the aggregation global compact binary feature algorithm. Meanwhile, more efficient global features are obtained through adopting a fisher-based network structure aggregation fisher vector. A compact self-adaptive feature is obtained by adopting the scalar quantization algorithmand the bit self-adaptation algorithm. Therefore, the length of the transmitted image feature information can be selected according to different self-adaptive selections of the network bandwidth of amobile terminal. During the retrieval stage, global features are roughly matched to obtain a candidate set, and the accurate matching of the geometric consistency test is conducted by adopting localfeatures. Therefore, the mobile visual search framework can be adapted to large-scale image retrieval tasks.
Owner:XIAMEN UNIV

Image classification method based on noise reduction sparse automatic encoder and density space sampling

ActiveCN108416389ALearning robustnessSolve the problem that requires a large number of manually labeled samplesCharacter and pattern recognitionHidden layerData set
The invention discloses an image classification method based on a noise reduction sparse automatic encoder and density space sampling. The image classification method based on a noise reduction sparseautomatic encoder and density space sampling includes the steps: constructing an image block training set; constructing a noise reduction sparse automatic encoder of single hidden layer, inputting the image block training set, and training the noise reduction sparse automatic encoder; performing density space sampling on each image in a training image data set and a test image data set; using thenoise reduction sparse automatic encoder to extract local characteristic set information from the space area obtained by performing density space sampling on each image; using two layers of laminatedFisher Vector to encode the characteristic set information, so as to obtain the final Fisher vector of each image; and training a classifier by means of the Fisher vector, so as to realize image classification. The image classification method based on a noise reduction sparse automatic encoder and density space sampling can accurately acquire the image information, can improve the classificationaccuracy of images, and can be used for construction of a large scale image classification and retrieval system.
Owner:YANCHENG TEACHERS UNIV

Compact visual descriptor deep neural network generation model in visual retrieval

The invention discloses a compact visual descriptor deep neural network generation model in visual retrieval, and relates to image retrieval. The model is implemented by the following steps of: constructing a Fisher layer network; constructing a grouping and secondary classification module; training a loss function on the basis of a maximum boundary condition; for image library images and a queryimage, firstly extracting local features of the images, carrying out aggregation and binary embedding on the local features of the images by using a trained network structure so as to obtain binary codes of the images, carrying out matching in an image library according to the binary code of the query image, returning images with high similarities as a matched candidate set, carrying out geometricconsistency inspection on the candidate set by using the local features so as to carry out accurate matching, and returning a final query result. According to the model, the flexible Fisher network is used to aggregate local features of images so as to generate more efficient global feature Fisher vectors, and the grouping and secondary classification module is used to carry out binary coding onthe Fisher vectors so as to obtain compact global binary features.
Owner:XIAMEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products