Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

134results about How to "Enhanced Feature Representation" patented technology

Vehicle type recognition method based on CNN and domain adaptive learning

The invention relates to a vehicle type recognition method based on a CNN and domain adaptive learning. The method comprises steps of: establishing a CNN-based initial model by adding a rotation-invariant layer in an Alexnet network, distinguishing a discriminant layer and designing a new objective function; separately extracting the feature maps of different-domain sample convolution layers by using the established initial model, calculating the cosine similarity between the sample feature maps, determining the shared convolution kernel or the non-shared convolution kernel of the CNN, retaining the weight and the offset of the shared convolution kernel, and updating the weight and the offset of the non-shared convolution kernel; based on a target-domain training sample, calculating the cosine similarity between respective feature map layers and the average similarity of the entire target domain, and clustering each type of similar feature maps according to the average similarity; expanding source-domain samples with similar distribution characteristics in the target domain to new samples in the target domain, adjusting the entire CNN model by using the new samples in the target domain, and then using a softmax classifier to classify the vehicle types of the test samples in the target domain.
Owner:NANJING UNIV OF INFORMATION SCI & TECH

Long-tail distribution image data identification method based on dual-channel learning

The invention discloses a long-tail distribution image data identification method based on dual-channel learning. The method comprises the following steps: 1) constructing a dual-channel learning model combining unbalanced learning and small sample learning; 2) updating all parameters in the dual-channel learning model by utilizing dual-channel learning total loss and back propagation, and storingoptimal dual-channel learning model parameters; and 3) inputting the image data of the test set to the optimal dual-channel learning model, and obtaining the prediction label of the image. Accordingto the invention, unbalanced learning and small sample learning are combined to solve the problem of long-tail distribution image data identification; the unbalanced learning channel can improve the identification accuracy of the unbalanced data set; the small sample learning channel can improve the feature representation of model learning, and the dual-channel total loss enables the model to focus on the unbalanced learning channel in the early stage of training and focus on the small sample learning channel in the later stage of training, thereby improving the recognition accuracy of the long-tail distribution image data on the whole.
Owner:SOUTH CHINA UNIV OF TECH

Android malicious software detection method and device based on capsule network

The invention belongs to the technical field of network security, and particularly relates to an Android malicious software detection method and device based on a capsule network, and the method comprises the steps: collecting an Android software file sample, decompressing a to-be-processed file, converting the to-be-processed file into an RGB three-channel color image, and enabling the RGB three-channel color image to serve as training sample data; constructing a capsule network, and training the capsule network by using the sample data to obtain a trained network model containing a graph structure and network parameters, the capsule network realizing transmission between feature vectors in a capsule layer through an iterative dynamic routing algorithm; and inputting the to-be-detected target file into the trained capsule network model for testing, and judging whether the to-be-detected target file is a malicious software file or not through an output result. The method and device canefficiently run on the Android operation platform, occupy few resources, are high in efficiency and accuracy, can realize high-accuracy classification detection tasks even under the condition of small-scale training samples, and achieve the purpose of protecting the Android intelligent mobile terminal.
Owner:PLA STRATEGIC SUPPORT FORCE INFORMATION ENG UNIV PLA SSF IEU

Remote sensing image target detection method based on multi-scale feature fusion

The invention discloses a remote sensing image target detection method based on multi-scale feature fusion. The method comprises the following steps: constructing a target detection basic network composed of a truncation type VGG and a multi-scale feature map; constructing a multi-scale feature map fusion module composed of a deconvolution module and a prediction module, the multi-scale feature map fusion module being used for extracting context information of a multi-scale feature map in the target detection basic network and generating a multi-scale feature map; predefining a multi-length-to-width ratio and a multi-scale default frame according to the size of the receptive field of the multi-scale feature map, and completing positioning and classification of the target by using the default frame; training a target detection basic network; accessing a multi-scale feature map fusion module to the target detection basic network, fixing parameters of the target detection basic network during training, and only adjusting the parameters of the multi-scale feature map fusion module; and at the last stage, finely adjusting parameters of the target detection basic network and the multi-scale feature map fusion module at the same time to achieve a coupling effect.
Owner:TIANJIN UNIV

Pedestrian target detection and recognition method based on monocular vision and deep learning

The invention belongs to the technical field of pedestrian target detection, and discloses a pedestrian target detection and recognition method based on monocular vision and deep learning. The methodincludes: establishing a small sample pedestrian data set, and collecting road pedestrian images in a real scene; performing pedestrian detection based on the whole image candidate and a single regression target detection algorithm based on depth features; finely adjusting weight parameters of a higher layer of the network on the VOC data set and the small sample pedestrian data set through secondary transfer learning; based on multi-scale pyramid image feature extraction with consistent phases, extracting contour features of pedestrian images, and obtaining a multi-scale pyramid feature map;and adopting a balanced focus loss function to replace a cross entropy loss function to measure the classification accuracy of the target. According to the invention, the CNN is used to obtain depth features, a deformable component model is trained, and the detection precision is effectively improved; transfer learning is introduced and can be found by analyzing a hidden layer in the AlexNet model, and the accuracy of pedestrian target detection and recognition is improved.
Owner:SHANDONG VOCATIONAL COLLEGE OF IND

Traffic signal control method and system based on random strategy gradient, and electronic equipment

The invention discloses a traffic signal control method and system based on a random strategy gradient, and electronic equipment. The method comprises steps of obtaining the static road network data of at least one control signal intersection; visually drawing a traffic simulation road network according to the static road network data; acquiring real-time traffic operation state data of at least one control signalized intersection; performing parameter check on simulation parameters in the traffic simulation road network according to the traffic operation state data to obtain an optimized traffic simulation road network; inputting a traffic state obtained by observing the optimized traffic simulation road network into a value network to obtain an evaluation value of each signal control scheme in the traffic state; inputting the traffic state into a strategy network to obtain a probability value of each signal control scheme; and updating parameters of the strategy network through a random strategy gradient based on the evaluation value of each signal control scheme in the traffic state and one signal control scheme. According to the method provided by the invention, a problem of dimension explosion of signal control can be solved.
Owner:DUOLUN TECH CO LTD

Video pedestrian re-identification method based on multi-attention heterogeneous network

The invention discloses a video pedestrian re-identification method based on a multi-attention heterogeneous network, and belongs to the field of image processing. The method comprises the following steps: constructing and training a multi-attention heterogeneous network; and performing feature extraction on the video of the known pedestrian ID and the video of the undetermined pedestrian ID by using the trained network, and the pedestrian ID is judged according to the cosine distance between the two features. According to the method, Soft attention and non-local attention are introduced intothe OSNet network; according to the method, the Soft attention is utilized to pay attention to pedestrian area features in an image, the learning ability of non-local attention to space-time featuresin a video sequence is utilized to improve feature representation of the video sequence, features which are more robust and more discriminative are extracted, and the recognition accuracy is improved.And meanwhile, the features of the specific frame are selected as local feature learning network branches, so that the learning of pedestrian local features is enhanced while the pedestrian global features in the video sequence are learned, and the performance of the network in video pedestrian re-identification is improved.
Owner:HUAZHONG UNIV OF SCI & TECH

Group behavior identification method based on multi-modal information fusion and decision optimization

The invention discloses a group behavior recognition method based on multi-modal information fusion and decision optimization, and the method comprises the steps: firstly obtaining a group member candidate box sequence for a to-be-recognized video, extracting the corresponding optical flow features, and extracting the human body posture segmentation features as a third visual clue; then acquiringa double-flow model of the spatial and temporal features of the human body target and performing multi-modal information fusion (MMF) on the double-flow model; and finally, connecting the two branchesobtained after MMF fusion with a GRU, and performing decision optimization by adopting a multi-classifier fusion method based on adaptive category weight, thereby obtaining a group behavior label. According to the scheme of the invention, during feature fusion, an MMF feature fusion algorithm is designed, so that space-time features supplement each other, information supplements each other, and finally better feature representation is obtained; in the aspect of decision optimization, a multi-classification fusion method based on self-adaptive class weights is designed, classifier acceptance and rejection and each class weight are calculated more accurately, and therefore high recognition precision is obtained.
Owner:QINGDAO UNIV OF SCI & TECH

Hyperspectral image classification method based on spectral space attention fusion and deformable convolutional residual network

The invention relates to a hyperspectral image classification method, and concretely relates to a hyperspectral image classification method based on spectral space attention fusion and a deformable convolutional residual network. The objective of the invention is to solve the problem of low classification accuracy of hyperspectral images caused by insufficient spectrum and spatial feature extraction and overfitting under small samples due to the fact that the hyperspectral images contain abundant information in the existing hyperspectral image classification. The method comprises the following steps: 1, acquiring a hyperspectral image data set and a corresponding label vector data set; 2, establishing an SSAF-DCR network based on spectrum space attention fusion and a deformable convolution residual error; 3, inputting the x1, the x2, the Y1 and the Y2 into a network SSAF-DCR, and performing iterative optimization by adopting an Adam algorithm to obtain an optimal network; and 4, inputting x3 into the optimal network to carry out classification result prediction. The method is applied to the field of hyperspectral image classification.
Owner:QIQIHAR UNIVERSITY

Cloth defect detection method and system based on deep neural network

ActiveCN111462051AHigh location informationGood semantic informationImage enhancementImage analysisEngineeringNetwork model
The invention discloses a cloth defect detection method and system based on a deep neural network, and belongs to the technical field of pattern recognition. The method comprises the steps that a defect cloth image training set is used for training a deep neural network model, labels are defect types and real frame position information, the deep neural network model is composed of a backbone network and a detection network, and the backbone network is used for extracting three feature maps with different scales from defect cloth images; the detection network includesthree detection sub-networks and the detection result fusion module, wherein the three detection sub-networks are the same in structure. Each detection sub-network is used for detecting a defect type and prediction frame position information from the feature map, and consists of three dense connecting blocks, and the feature channel connection between the dense blocks is used for enhancing feature transfer, and the detection result fusion module is used for performing non-maximum suppression on the prediction result to obtain a final prediction frame and a defect type, and inputting to-be-detected cloth into the traineddeep neural network model to obtain a detection result, so that the type and the position of the defect in the cloth can be detected more quickly and accurately.
Owner:HUAZHONG UNIV OF SCI & TECH

Multi-scale face age estimation method and system embedded with high-order information

The invention relates to the field of face age estimation, in particular to a multi-scale face age estimation method and system embedded with high-order information, and the multi-scale face age estimation method comprises the steps: inputting a face image, and carrying out the preprocessing of the face image; inputting the face image into a residual network to perform global feature extraction soas to construct a global branch; inserting blocks for extracting high-order age information at different positions of the global branch; taking the output feature map of the first convolution layer of ResNets as the input of a long-term and short-term memory network, obtaining the position information of the age-sensitive region, and obtaining a local feature map through cutting to construct a local branch; minimizing a loss function through back propagation, performing joint optimization on the two branches, and performing iterative training on the neural network; and inputting the test setinto a trained neural network model, and calculating and outputting a final predicted age according to the age characteristics. The multi-scale face age estimation method and system have the advantages that the network model provided by the invention is relatively low in calculation cost and high in precision, and the applicability of related products is relatively high.
Owner:CHONGQING UNIV OF POSTS & TELECOMM

Image segmentation method, device and apparatus and storage medium

The embodiment of the invention provides an image segmentation method, device and apparatus, and a storage medium. The method comprises: extracting an initial feature map of a to-be-processed image through a base network of an image segmentation model; pooling the initial feature map through an average pooling sub-model of the image segmentation model to obtain a first feature map carrying short-distance dependency relationship information; processing the initial feature map through at least one branch sub-model to obtain at least one target feature map, wherein the target feature map comprises a second feature map carrying global dependency relationship information and/or a third feature map carrying long-distance dependency relationship information; and cascading the first feature map with the target feature map, and then performing convolution to obtain an image segmentation result and outputting the image segmentation result. According to the method, the branch sub-models are arranged in parallel with the average pooling sub-model to obtain the feature map carrying global dependency relationship information and/or long-distance dependency relationship information, and the feature map carrying global dependency relationship information and/or long-distance dependency relationship information is cascaded with the feature map carrying short-distance dependency relationship information obtained by the average pooling sub-model, so that the feature representation capability is enhanced, and the image segmentation accuracy is improved.
Owner:上海眼控科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products