Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

183 results about "Non maximum suppression" patented technology

A pedestrian and vehicle detection method and system based on improved YOLOv3

The invention discloses a pedestrian and vehicle detection method and system based on improved YOLOv3. According to the method, an improved YOLOv3 network based on Darknet-33 is adopted as a main network to extract features; the cross-layer fusion and reuse of multi-scale features in the backbone network are carried out by adopting a transmittable feature map scale reduction method; and then a feature pyramid network is constructed by adopting a scale amplification method. In the training stage, a K-means clustering method is used for clustering the training set, and the cross-to-parallel ratio of a prediction frame to a real frame is used as a similarity standard to select a priori frame; and then the BBox regression and the multi-label classification are performed according to the loss function. And in the detection stage, for all the detection frames, a non-maximum suppression method is adopted to remove redundant detection frames according to confidence scores and IOU values, and an optimal target object is predicted. According to the method, a feature extraction network Darknet-33 of feature map scale reduction fusion is adopted, a feature pyramid is constructed through feature map scale amplification migration fusion, and a priori frame is selected through clustering, so that the speed and precision of the pedestrian and vehicle detection can be improved.
Owner:NANJING UNIV OF POSTS & TELECOMM

Pedestrian detection and tracking method based on accelerated area Convolutional Neural Network

The invention relates to a pedestrian recognition and tracking method based on an accelerated area Convolutional Neural Network. Firstly, training and testing data set are preprocessed according to the requirements through a robot with an infrared camera to acquire a training dataset and a testing dataset at night, and then, actual target position labeling is conducted on all training and testing photos and is recorded to a sample file; then, the accelerated area Convolutional Neural Network is constructed, the accelerated area Convolutional Neural Network is trained by using the training dataset, and the final probability belonging to a pedestrian area and a bounding box of the area are calculated out from network output by the usage of a non-maximum suppression algorithm; the accuracy of the network is tested by the usage of the testing dataset, and a network model consistent with the requirements is obtained; photos collected by the robot at night are input to an accelerated area Convolutional Neural Network model, and the probability belonging to the pedestrian area and the bounding box of the area are online output by a model in real time. According to the pedestrian detection and tracking method based on the accelerated area Convolutional Neural Network, a pedestrian in an infrared image can be effectively recognized, and real-time tracking for a pedestrian target in an infrared video can be achieved.
Owner:DONGHUA UNIV

An aluminum material surface defect detection algorithm based on deep learning

The invention relates to an aluminum material surface defect detection algorithm based on deep learning, and the algorithm comprises the steps: (1) employing a camera to shoot the surface of an aluminum material, obtaining a related data set, employing a labelImg tool to label an image, and obtaining label information; (2) dividing the image into a training set and a test set, and performing dataenhancement on the training set; (3) inputting a defective image, a non-defective image and label information of the defective image into the network at the same time every time to carry out model training; and (4) inputting the test image into the trained aluminum material surface defect detection model, and obtaining the position and the corresponding category of the defect. According to the method, a defective image and a non-defective image can be effectively utilized; the generalization ability and the detection precision of the model are improved, the detection performance is further improved by fully utilizing context information around the candidate region, the detection performance of dense small defects can be improved by utilizing a soft non-maximum suppression algorithm, and the method is an efficient aluminum material surface defect detection algorithm.
Owner:SUN YAT SEN UNIV

A defect target detection method based on an attention mechanism

The invention belongs to the technical field of defect detection, and discloses a defect target detection method based on an attention mechanism, which comprises the following steps: marking various defects of all pictures in an original data set to obtain a standard training data set with marks; Obtaining a training label according to the standard training data set, determining a loss function, obtaining a network model, and performing training by using a reverse conduction method to obtain a defect regression detection network model based on an attention mechanism and having enhanced defectpart weight; Performing classification prediction and regression prediction on the to-be-detected picture by utilizing the defect regression detection network model; Carrying out non-maximum suppression processing on the predicted defect bounding box and filtering the defect bounding box to obtain an output result; According to the method provided by the invention, the weight of the defect area isimproved through an attention mechanism, so that the defect detection precision is improved; The industrial product surface defect classification and regression detection method can be applied to other types of surface defect detection frameworks to improve the detection precision, and is high in universality.
Owner:WUHAN JINGCE ELECTRONICS GRP CO LTD +1

Non-segmented character positioning and identification method based on deep learning

The invention discloses a non-segmented character positioning and identification method based on deep learning. The non-segmented character positioning and identification method comprises steps of constructing a deep convolution neural network, wherein the deep convolution neural network comprises a universal convolution network, a candidate positioning network and a classification identification network, constructing a target function in order to realize global end-to-end training of a whole network, adopting an artificially-calibrated training set and a progressive-combined training mode to perform training on the network, using a network obtained through training to extract possible areas of a plurality of characters in a test image and a classified identification result when applied to a test, and performing non-maximum suppression on a result obtained by the network and performing post-processing on score threshold determination to obtain a final detection result. The non-segmented character positioning and identification method is simple, does not need to perform character segmentation pre-processing, can be compatible with a plurality of character forms, has a strong capability of resisting background interference and can be used as a general character detection method.
Owner:南京汇川图像视觉技术有限公司

Deep convolutional neural network-based human face occlusion detection method

ActiveCN106485215AAccurate occlusion detectionJudging the occlusionCharacter and pattern recognitionNoseMultilayer perceptron
The invention discloses a deep convolutional neural network-based human face occlusion detection method. The method comprises the steps of performing block segmentation on an input image to obtain a target pre-selected region; constructing a first deep convolutional neural network, training the first deep convolutional neural network comprising a first deep convolutional network and a first multilayer perceptron connected with the first deep convolutional neural network to obtain required parameters, extracting features of the target pre-selected region, and performing classification; predicting the position of a human head through a second multilayer perceptron according to the extracted features; filtering the credibility of a classification type which is the human head and the predicted position of the human head through non-maximum suppression to remove an overlapped duplicate detection box; and obtaining a human head block in combination with original image segmentation, constructing a multi-task learning policy-based second deep convolutional neural network, and judging whether the left eye, the right eye, the nose and the mouth of the human head block are occluded or not. According to the method, the occluded human face can be accurately detected and the specific occluded part of the human face can be judged; and the method is mainly used for crime pre-warning of videos of a camera in front of an automatic teller machine.
Owner:XIAN JIAOTONG LIVERPOOL UNIV

Steel rail tread defect recognition method based on combination of gray image and depth image

The invention discloses a steel rail tread defect recognition method based on combination of a gray image and a depth image. The method comprises the steps of establishing a steel rail tread image data set by utilizing a registered gray image and depth image pair of a steel rail tread, dividing the data set into a training sample set and a test sample set, and preprocessing the images of the dataset; performing defect recognition by adopting a special convolutional neural network structure, wherein a front end of the network is provided with two branch structures, which can extract features from the gray image and the depth image of the steel rail tread respectively; fusing feature information of the gray image and the depth image through feature graph connection, and then outputting preliminary prediction results by adopting a prediction module; and finally, screening the preliminary defect prediction results by adopting a non-maximum suppression method to obtain a final steel rail defect recognition result. The method combines two-dimensional and three-dimensional features of the steel rail tread at the same time, has the capability of distinguishing real defects and pseudo-defects, can reduce the misjudgment rate and the missing detection rate, and is especially suitable for defect recognition in a complex environment.
Owner:BEIHANG UNIV

Scene text detection method based on end-to-end full convolutional neural network

The present invention discloses a scene text detection method based on an end-to-end full convolutional neural network, which is used for the problem of finding a multi-directional text position in animage of a natural scene. The method specifically comprises the following steps: obtaining a plurality of image data sets for training scene text detection, and defining an algorithm target; carryingout feature learning on the image by using a full convolution feature extraction network; predicting an affine transformation matrix in an instance level for each sample point on the feature map, andcarrying out feature expression on the text according to the predicted affine transformation deformation sampling grid; classifying feature vectors of a candidate text, and carrying out coordinate regression and affine transformation regression to jointly optimize the model; using the learning framework to detect the precise position of the text; and carrying out non-maximum suppression on the bounding box set output by the network to obtain a final text detection result. The method disclosed by the present invention is used for scene text detection of real image data, and has a better effectand robustness for multi-directional, multi-scale, multi-lingual, shape distortion and other complicated situations.
Owner:ZHEJIANG UNIV

Wheat field weed detection method based on deep learning

The invention discloses a wheat field weed detection method based on deep learning, and the method comprises the steps: collecting a large number of wheat and wheat field main weed pictures at different growth stages, building a data set, and dividing the data set into a training set and a test set; inputting the training set into a preset convolutional neural network model for training through atransfer learning method to obtain a crop weed classification recognizer, and testing the crop weed classification recognizer by using a test set to obtain a classification recognition result so as toperform fine adjustment; generating a large number of interest domains with different sizes on the to-be-detected picture by adopting a sliding window method, and inputting each interest domain intoa crop weed classification recognizer for classification and recognition to obtain a corresponding prediction category and a correct probability; and screening out an interest domain corresponding tothe local maximum correct probability of each type from all interest domains by applying a non-maximum suppression algorithm, and outputting a classification and positioning prediction result. According to the method, crops and weeds can be quickly and accurately identified and positioned, and the requirement for data is low.
Owner:WUHAN UNIV

A driving scene vehicle detection method based on an SSD neural network

The invention discloses a driving scene vehicle detection method based on an SSD neural network, comprising the following steps: constructing a data set and dividing the data set into a training set and a test set; based on Caffe's deep learning framework, using a SqueezeNet as a feature extraction network; selecting and merging six convolution layers of SqueezeNet network to be detected; after merging the six convolution layers of SqueezeNet network to be detected, adding a position regression layer and a class confidence discrimination layer to complete the construction of the training network model; obtaining a network pre-training model by initializing the training network model; using the network pre-training model, and obtaining the final training model by using the DSD method to carry out multiple rounds of training on the produced data set; capturing the forward image and inputting the image into the final training model, then using the non-maximum suppression algorithm to remove the redundant detection box so as to the detection results. The invention can quickly and accurately detect the vehicle target in front of the vehicle, and is a powerful measure for improving the environment perception ability of the intelligent driving vehicle.
Owner:CHONGQING UNIV

A multi-scale target detection method fusing context information

The invention discloses a multi-scale target detection method fusing context information, and the method comprises the steps of extracting the characteristics of an input image through employing a deep residual convolutional neural network, and obtaining a candidate box set which corresponds to the input image and is used for target detection through employing an RPN network and an improved non-maximum suppression method; for each candidate box, extracting to obtain convolutional features output by the deep residual convolutional neural network, and extracting the convolutional features outputted by the last convolutional layer of the deep residual convolutional neural network in four directions of upper, lower, left and right twice by adopting an LSTM method to obtain context feature information; performing regularization and splicing operation on the context information and the convolutional features to obtain multi-scale features fused with the context information; converting the multi-scale features into high-dimensional feature vectors by using a full connection layer, and carrying out target classification and border position detection by using a classification layer and a regression layer. The method has the advantages of being high in precision, good in robustness and strong in adaptability for target detection.
Owner:NANJING UNIV OF POSTS & TELECOMM

Target detection method, system and related equipment of underwater vehicle

The invention relates to the field of robot vision, pattern recognition and machine learning, in particular to an underwater robot target detection method, a system and related equipment, aiming at improving the robustness of target detection technology to underwater target occlusion, deformation and illumination changes. The object detection method of the invention comprises the following steps:obtaining an original image to be detected; normalizing the pixel value of the original image to be detected, and obtaining the image to be detected after preprocessing; the preprocessed image being input into the target detection network for detection, and the bounding frame of the region of interest and the probability of belonging to each target class being obtained; according to the bounding box of ROI and the probability of belonging to each target class, the improved non-maximum suppression algorithm being used to obtain the bounding box and the class of the target object, wherein, a deformable convolution neural network is used to extract feature map in the target detection network, and the candidate region method is used to detect the target. The detection method of the invention improves the detection precision under the condition of guaranteeing the speed.
Owner:INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products