Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

309 results about "Feature aggregation" patented technology

Point cloud data classification method based on deep learning

ActiveCN110197223AGuaranteed affine transformation invarianceExcellent division effectCharacter and pattern recognitionPoint cloudData set
The invention discloses a point cloud data classification method based on deep learning. The method provides a multi-scale point cloud classification network, and comprises the steps of firstly, providing a multi-scale local area division algorithm on the basis of completeness, adaptivity, overlap and multi-scale characteristic requirements of the local area division, and obtaining a multi-scale local area by taking the point cloud and the characteristics of different levels as input; and then constructing the multi-scale point cloud classification network comprising a single-scale feature extraction module, a low-level feature aggregation module, a multi-scale feature fusion module and the like. The network fully simulates the action principle of the convolutional neural network, and hasthe basic characteristics that the local receptive field becomes larger and larger and the feature abstraction degree becomes higher and higher along with the increase of the network scale and depth.The method of the invention respectively obtains the 94.71% and 91.73% classification accuracies at the standard public data set ModelNet 10 and ModelNet 40, is in a leading or equivalent level in thesimilar work, and the feasibility and effectiveness of the method are verified.
Owner:BEIFANG UNIV OF NATITIES

Cross-mode pedestrian re-identification method and system based on a heterogeneous hierarchical attention mechanism

The invention provides a cross-modal pedestrian re-identification method and system based on a heterogeneous hierarchical attention mechanism, and the method comprises the steps: extracting pedestrianimage features and text description features, and enabling the pedestrian image features and the text description features to serve as initial global features of a pedestrian image channel and a textdescription channel; Establishing a heterogeneous hierarchical attention model, and enhancing pedestrian picture features and text description features by using a bidirectional cross-mode fine-grained matching attention mechanism and a context-guided local feature aggregation attention mechanism by the model; A heterogeneous hierarchical attention model is trained in a two-stage training mode, preliminary training is carried out in the first stage by utilizing pedestrian category supervision information, training in the second stage is carried out by utilizing cross-modal samples to match thepedestrian category supervision information on the basis, and pedestrian re-identification is carried out by utilizing the trained model. The pedestrian re-identification method and device can improve the accuracy of pedestrian re-identification.
Owner:中科人工智能创新技术研究院(青岛)有限公司

Target identification method based on quality evaluation

ActiveCN108765394ASolve the problem of object recognitionValid descriptionImage enhancementImage analysisImaging qualityGoal recognition
The invention provides a target identification method based on quality evaluation. The target identification method based on quality evaluation includes the steps: constructing a target identificationmodel which includes a quality evaluation network, a feature extraction network, and a feature aggregation network, wherein the target identification model is used for extracting the target feature from a video so as to characterize the overall structural information and local information of the target; training the target identification model, and adjusting the parameters of the quality evaluation network and the feature extraction network during the training process so as to enable the target identification model to output the target feature according with the preset demand; and performingtarget identification on the video through the trained target identification model. Therefore, the target identification method based on quality evaluation solves the target identification problem caused by changeable appearance and irregular image quality in a video sequence, and adds the interframe correlation information in quality evaluation so as to obtain more effective target information toenable characterization of the target to be more accurate, thus improving the identification accuracy.
Owner:SHANGHAI JIAO TONG UNIV

Optical flow multilayer frame feature propagation and aggregation method for video target detection

The invention provides an optical flow multilayer frame feature propagation and aggregation method for video target detection, and relates to the technical field of computer vision. The method comprises the following steps: firstly, extracting multilayer features of adjacent frames through a feature network, extracting optical flow through an optical flow network, then propagating multilayer framelevel features of a previous frame of a current frame and a next frame of the current frame to the current frame by utilizing the optical flow, and performing up-sampling or down-sampling on the optical flow by layers with different step lengths to obtain multilayer propagation features; and then sequentially aggregating propagation characteristics of each layer by layer, and finally generating multi-layer aggregated frame level characteristics for final video target detection. According to the optical flow multilayer frame feature propagation and aggregation method oriented to video target detection provided by the invention, the output frame level aggregation feature has the advantages of high shallow network resolution and deep network high-dimensional semantic feature, the detection performance can be improved, and the detection performance of the multilayer feature aggregation method on a small target is improved.
Owner:NORTHEASTERN UNIV

Remote sensing image semantic segmentation method, system and equipment and storage medium

The invention discloses a remote sensing image semantic segmentation method, system, equipment and a storage medium, belongs to the field of image processing, and aims to solve the technical problems of low semantic segmentation precision and low segmentation efficiency. The method comprises the following steps: constructing, training and testing a network, wherein the network is specifically a deep semantic segmentation network of an encoder-decoder structure constructed by a Pytorch deep learning framework; performing network training based on the remote sensing image data sample set; and taking a to-be-measured remote sensing image as network input to obtain a segmentation result of the remote sensing image. On one hand, model parameters are reduced through a bottleneck type module, depth separable convolution, asymmetric convolution, convolution with holes and the like, the calculation complexity is reduced, and the time of remote sensing image semantic segmentation is shortened; on the other hand, the semantic segmentation precision is improved through multi-scale feature aggregation and a mixed attention module, so that the provided remote sensing image semantic segmentation network can accurately and efficiently realize the semantic segmentation of the remote sensing image.
Owner:HUANENG CLEAN ENERGY RES INST +2

Pedestrian detection method and apparatus based on Haar-like intermediate layer filtering features

The present invention discloses a pedestrian detection method based on Haar-like intermediate layer filtering features, comprising the steps of: extracting object features of various training images in a training image set, training an Adaboost classifier based on decision trees by using the extracted object feature data to obtain a classification model; extracting object features of an image to be detected under a plurality of scales and inputting the object features to the classification model to obtain a pedestrian detection result, wherein a method for extracting the object features comprises the following steps: respectively extracting a plurality of different channel features of an original image to obtain multiple channel feature patterns of the original image; respectively performing downsampling for each channel feature pattern; respectively extracting corresponding Haar-like features of each channel feature pattern which has been subjected to the downsampling by using a group of preset Haar-like feature templates; and clustering all the Haar-like features of the original image into the object features of the original image. The invention also discloses a pedestrian detection apparatus based on Haar-like intermediate layer filtering features. The pedestrian detection method and the pedestrian detection apparatus can effectively improve pedestrian detection performance.
Owner:SOUTHEAST UNIV

Graph embedding method and device, and storage medium

The invention provides a graph embedding method and device, and a storage medium, and the method comprises the steps: reading graph structure data and node feature values in a target graph, and building a graph structure model; regarding each node in the graph structure model as a target node, and sampling a first-order neighbor node of each target node according to the non-uniform neighbor node sampling function to obtain a first-order neighborhood of each target node; constructing second-order neighborhoods of the target nodes according to the first-order neighborhoods of the target nodes, aggregating the second-order neighborhoods to the first-order neighborhoods corresponding to the target nodes, and inputting the aggregated features of the second-order neighborhoods into the fully-connected neural network to obtain new features of the first-order neighborhoods of the target nodes; and aggregating the new features to the corresponding target nodes, and inputting the aggregated newfeatures of the first-order neighborhood into the fully-connected neural network to obtain output features of the target nodes. The neighborhood can be flexibly and effectively constructed for each node in the graph, and feature aggregation can be rapidly carried out, so that the graph embedding effect based on the graph neural network is improved.
Owner:GUILIN UNIV OF ELECTRONIC TECH

Visual rich document information extraction method for actual OCR scene

ActiveCN112801010ASolve the problem of extraction error accumulationDecoupling prediction resultsSemantic analysisSpecial data processing applicationsPattern recognitionNamed entity classification
The invention discloses a visual rich document information extraction method for an actual OCR scene. The method comprises the following steps: collecting a visual rich text image in the actual scene; extracting text word embedding features and position embedding features of character levels and word levels by utilizing a pre-training word embedding model; training a named entity classification module; constructing a global document graph structure based on graph convolution GAT, and introducing a self-attention mechanism; training a named entity boundary positioning module; constructing a multi-feature aggregation structure; and training an error semantic correction module, adopting a decoding structure of a GRU, extracting a coding hidden state of a corresponding dimension feature according to an optimal path of a CRF, and guiding output of a decoder every time by taking category information of a named entity as prior guidance information to obtain entity naming information in a standard format. According to the visual rich document information extraction method, the precision of the visual rich document information extraction method in actual OCR detection and recognition application is effectively improved, and the visual rich document information extraction method is of great significance to structured storage of visual rich document information.
Owner:SOUTH CHINA UNIV OF TECH

A social network rumor identification method based on feature aggregation

ActiveCN109685153ASolve the problem that it is difficult to deal with heterogeneous informationSolve problems that are difficult to feed into machine learning modelsData processing applicationsCharacter and pattern recognitionStudy methodsData quality
The invention discloses a social network rumor identification method based on feature aggregation, and the method comprises the steps: designing time sequence propagation mode features acceptable by adeep neural network and time sequence text features, constructing a rumor detection model by using a feature aggregation technology, and carrying out the final detection and early detection of a rumor. The problem that propagation mode characteristics of social network event propagation are difficult to serve as machine learning model input is solved, the propagation mode characteristics do not depend on characteristic engineering and field knowledge, the influence of various factors in the actual propagation process is comprehensively embodied, and the method can be effectively applied to different rumor identification scenes; The defect that the quality of feature data is reduced due to huge difference of the number of messages contained in different samples is avoided, the problem thata single model is difficult to deal with heterogeneous information in a traditional machine learning method is solved, and compared with an existing rumor identification method, the accuracy is obviously improved.
Owner:WUHAN UNIV

Spatial-temporal feature aggregation method and system combined with attention mechanism and terminal

PendingCN111967310AImprove pedestrian recognition rateBiometric pattern recognitionNeural architecturesTime domainAlgorithm
The invention provides a spatial-temporal feature aggregation method and system combined with an attention mechanism, and a terminal, and the method comprises the steps: extracting the spatial domainfeatures of a pedestrian in a deep network through a convolutional neural network, and obtaining the time domain features of the pedestrian through the spatial domain features comprehensively extracted through a recurrent neural network; respectively generating corresponding quality-sensitive attention scores and frame-sensitive attention scores by adopting a feature extraction network so as to dynamically fuse spatial domain and time domain features; carrying out linear superposition fusion to obtain quality-sensitive spatial domain features and frame-sensitive time domain features to obtainpedestrian space-time feature expression; carrying out network training on the upper, middle and lower parts of a pedestrian to obtain corresponding local features with complementary properties, and obtaining feature expressions with higher discrimination through splicing. The method and system have good robustness, and can better solve and adapt to the conditions of shielding, light change and the like; and by combining the spatial domain and time domain characteristics of the pedestrian, the detail characteristics of the pedestrian are mined, so that the method and system can play better performance and efficiency in the next step of pedestrian recognition.
Owner:SHANGHAI JIAO TONG UNIV

Panoramic segmentation method with bidirectional connection and shielding processing

The invention discloses a panoramic segmentation method with bidirectional connection and shielding processing. The method comprises the following steps: 1) obtaining a data set for training panoramicsegmentation, and defining an algorithm target; 2) performing feature learning on intra-group images by using a full convolutional network; 3) extracting semantic features from a feature map throughsemantic feature extraction branches; 4) extracting instance features from the feature map through instance feature extraction branches; 5) establishing connection from instance segmentation to semantic segmentation, and aggregating the semantic features and the instance features to perform semantic segmentation; 6) establishing connection from semantic segmentation to instance segmentation, and aggregating the instance features and the semantic features to perform instance segmentation; and 7) using an occlusion processing algorithm, fusing the results of semantic segmentation and instance segmentation, and outputting the result of panoramic segmentation. According to the method, the complementarity between the semantic segmentation and the instance segmentation is fully utilized, meanwhile, the occlusion processing algorithm provided by the apparent information of the bottom-layer features is applied, and the panoramic segmentation of the image is efficiently completed.
Owner:ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products