Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

167results about How to "Reduce network parameters" patented technology

Small target detection method based on multi-scale images and weighted fusion loss

The invention belongs to the field of image and video processing, and relates to a small target detection method based on multi-scale images and weighted fusion loss, and the method comprises the steps: extracting a plurality of groups of feature vectors from a plurality of different-scale images based on an improved Mask RCNN model, carrying out the fusion of the plurality of groups of feature vectors, and constructing a feature pyramid; generating a candidate detection box based on the feature pyramid and screening to obtain a suggested detection box; correspondingly returning the suggesteddetection boxes to the feature pyramid to generate feature maps of the suggested detection boxes, and performing aligned interception on the feature maps; inputting the aligned suggested detection boxes into a classifier layer to obtain category confidence coefficients and position offsets of the suggested detection boxes; in the test stage, screening a certain suggested detection box according tothe category confidence score of the suggested detection box, and performing non-maximum suppression; in the training stage, weighting the loss function calculated by detecting the small target feature layer and fusing with the loss function of detecting the large target layer and the middle target layer, thus the sensitivity of the model to the small target object is enhanced.
Owner:SOUTH CHINA UNIV OF TECH

Fundus image optic cup and optic disk segmentation method and system for assisting glaucoma screening

ActiveCN110992382AEfficient Multi-Size ExtractionBoost backpropagationImage enhancementImage analysisInformation processingGlaucoma screening
The invention discloses a fundus image optic disc segmentation method and a system for assisting glaucoma screening, and relates to the technical field of image information processing. The fundus image optic disc segmentation method comprises the steps that a plurality of fundus images are collected and preprocessed, and a training image sample set and a verification image sample set are obtained;training of a constructed W-Net-Mcon full convolutional neural network by using the training image sample set to obtain an optimal W-Net-Mcon full convolutional neural networkis carried out; preprocessing the fundus image to be segmented, and inputting the preprocessed fundus image to be segmented into the optimal W-Net-Mcon full convolutional neural network to obtain a prediction target result image; Processing prediction target result graph by utilizing polar coordinate inverse transformation and ellipse fitting to obtain final segmentation result so as to obtain cup-to-disk ratio and finally obtain glaucoma preliminary screening result. According to the method, image semantic information can be effectively extracted in a multi-size mode, fusion of features of different levels, fusion of global features and detail features and encouragement of feature multiplexing are carried out, gradient back propagation is improved, and the image segmentation precision is improved.
Owner:SICHUAN UNIV

A multi-scale Hash retrieval method based on deep learning

Image pairing information and image classification information are optimized and a Hash code quantization process is used to realize a simple and easy end-to-end deep multi-scale supervision Hash method, and meanwhile design a brand new pyramid connected convolutional neural network structure, and the convolutional neural network structure takes paired images as training input and enables the output of each image to be approximate to a discrete Hash code. In addition, the feature map of each convolution layer is trained, feature fusion is carried out in the training process, and the performance of deep features is effectively improved. A neural network is constrained through a new binary constraint loss function based on end-to-end learning, and a Hash code with high feature representationcapability is obtained. High-quality multi-scale Hash codes are dynamically and directly learned through an end-to-end network, and the representation capability of the Hash codes in large-scale image retrieval is improved. Compared with an existing Hash method, the method has higher retrieval accuracy. Meanwhile, the network model is simple and flexible, can generate characteristics with strongrepresentation ability, and can be widely applied to other computer vision fields.
Owner:SHANDONG UNIV

Driving fatigue detection and early warning system and method based on vision

ActiveCN108791299AOptimize running timeGood early warning robustnessAlarmsControl devicesDriver/operatorVision based
The invention provides a driving fatigue detection and early warning system based on vision. The driving fatigue detection and early warning system comprises a face detection and identity verificationmodule, a normal driving baseline database module, a fatigue driving behavior detection module, a fatigue driving behavior early-warning module and a fatigue driving behavior management and control module; the face detection and identity verification module is used for collecting images of a driving cab and extracting face information and identity information; the normal driving baseline databasemodule is used for establishing and updating fatigue baselines of a driver; the fatigue driving behavior detection module is used for generating fatigue driving ROI based on the extracted face information, and conducting fatigue driving behavior identification; the fatigue driving behavior early-warning module is used for judging the driver fatigue state and the driving fatigue level according tothe monitoring results of the fatigue driving behavior detection module and issuing a fatigue early-warning signal for the driver; and the fatigue driving behavior management and control module is used for triggering driving data records and uploading data according to the fatigue early-warning signal.
Owner:ZHEJIANG LEAPMOTOR TECH CO LTD

Multi-objective fused educational resource personalized recommendation system and method

The invention discloses a multi-objective fused educational resource personalized recommendation system, which is characterized by comprising a recommendation system, and the recommendation system comprises a service layer, a recommendation layer, a strategy layer and a data layer. The data layer comprises data displayed and fed back by a user, a user interest model, resource content and overall resource content. The strategy layer comprises representation features of learner users and representation features of education resources. The recommendation layer comprises an SOM-CNN model, an ITEM-SOM model and a recommendation list generated according to the SOM-CNN model and the ITEM-SOM model. The service layer comprises education resource recommendation services for different users. The idea of integrating multi-task target learning is applied to the educational resource recommendation system, personalized recommendation of educational resources is realized, the recommendation accuracyand diversity are used as two to-be-learned target tasks of the network, and two different network module structures are adopted to concentrate on the accuracy and diversity of recommendation resultsrespectively. And a unified loss function is designed, and synchronous end-to-end learning and training are performed on multiple targets of the network recommendation result.
Owner:贵州开放大学贵州职业技术学院 +1

Expression recognition method based on multi-branch cross-connection convolutional neural network

The invention relates to an expression recognition method, in particular to an expression recognition method based on a multi-branch cross-connection convolutional neural network. The invention aims to solve the problems of low efficiency, serious resource waste and incomplete feature extraction of an existing traditional expression feature extraction method. The method comprises the following steps of: 1, preprocessing a facial expression image data set; 2, a multi-branch cross-connection convolutional neural network is constructed and used for extracting facial expression image features, andthe process is as follows: the multi-branch cross-connection convolutional neural network is composed of a first convolutional layer, a module 1, a module 2, a module 3, a forty-th convolutional layer, a batch standardization BN and a Relu activation function; and 3, classifying the image features extracted by the network by adopting a Softmax classification algorithm, namely connecting a globalmean value pooling after the constructed multi-branch cross-connection convolutional neural network, and carrying out multi-classification by using a Softmax function after a global mean value poolinglayer. The method is applied to the field of expression recognition.
Owner:QIQIHAR UNIVERSITY

Urban sound event classifying method based on N-DenseNet and high-dimensional mfcc features

The invention provides an urban sound event classifying method based on an N-DenseNet and high-dimensional mfcc feature. By the urban sound event classifying method, during audio data processing, richer and more efficient feature information can be provided, ta model has stronger generalization ability, and classification has higher accuracy. The urban sound event classifying method comprises thefollowing steps: S1, acquiring audio data to be processed, preprocessing an original audio signal, and outputting an audio frame sequence; S2, performing time-domain and frequency-domain analysis on the audio frame sequence, extracting a high-dimensional Mel-frequency cepstrum coefficient, and outputting a feature vector sequence; S3, constructing an acoustic model, and training the acoustic modelto obtain a well-trained acoustic model; S4, processing the feature vector sequence output in the step S2, and then inputting into the well-trained acoustic model for classification recognition to obtain a recognition result, namely a classification result of a sound event, wherein the acoustic model is a network model constructed by combining the characteristics of an N-order Markov model on thebasis of a DenseNet model, namely the acoustic model is an N-order DenseNet model.
Owner:JIANGNAN UNIV

Three-dimensional face and eyeball movement modeling and capturing method and system

The invention provides a three-dimensional face and eyeball movement modeling and capturing method and system, and the method comprises the steps: firstly extracting the position information of a feature point of a face in a shot scene image, and obtaining the information through employing a convolutional neural network method; after sparse facial feature points are obtained, a multilinear model of a human face serving as a prior condition of the model, the positions of the feature points detected on the image serving as observation results, and designing a maximum posteriori framework to optimize and solve the geometrical shape and posture of the human face in the image; aiming at the movement of a three-dimensional eyeball, using sparse two-dimensional feature points for obtaining an eyearea image block through cutout, achieving alignment of the image block through position information of the feature points, further marking the areas of an iris and a pupil, and reconstructing the eyeball movement in real time through an analysis method based on synthesis; in combination with the expression parameters obtained in the previous step, combining facial expression motion with eyeballmotion to obtain a complete facial expression animation.
Owner:INST OF COMPUTING TECH CHINESE ACAD OF SCI

Water turbine runner blade defect detection method based on YoloV4-Lite network

The invention relates to the technical field of water turbine runner blade defect detection, and particularly discloses a water turbine runner blade defect detection method based on a YoloV4-Lite network, and the method comprises the steps: S1, constructing a defect detection network based on the YoloV4-Lite network; S2, performing picture collection on different defect positions of the turbine runner blade to obtain thousand or more defect pictures; S3, preprocessing the defect picture acquired in the step S2 (processing by using LabelImg software according to a Pascal VOC 2012 format) to obtain a data set; and S4, training, testing and verifying the defect detection network by adopting the data set. According to the invention, the backbone extraction network CSPDarkNet53 network of YoloV4-Lite is replaced by the MobileNet network, and the MobileNet network is a real-time lightweight network, so that the network detection speed can be improved, and the network parameters can be greatly reduced. Experimental results show that the accuracy rate of the defect detection network can reach 97.48%, the network parameter quantity of the MobileNetV3 only needs 37.35 MB and is reduced by 206.94 MB compared with that of CSPDarkNet53, the FPS reaches 44.68, and the method has the advantages of being high in accuracy rate, low in memory storage and real-time.
Owner:CHONGQING UNIVERSITY OF SCIENCE AND TECHNOLOGY

Flame target detection method based on digital image and convolution features

Because the generalization of a flame detection model based on image features is not strong, and the requirement of a deep neural network model for the number of training samples is high, the invention provides a flame target detection method based on digital images and convolution features, and the method comprises the steps: firstly making a data set comprising video dynamic features; replacingthe standard convolution of the VGG16 in the classic Faster R-CNN with the depth separable convolution, and reducing the number of convolution layers; cutting 256 image blocks from the original imageaccording to a candidate box generated by the RPN, and extracting LBP features of each image block; reducing the size of an output feature map of the ROI pooling layer and the number of neurons of a full connection layer through convolution, and further reducing network parameters; and finally, combining the extracted LBP features, the dynamic features in the data set and the pooled tiled featurevectors, and sending the combined feature vectors to a full connection layer for classification and regression. The flame target detection model constructed by the patent has relatively high detectionprecision, is convenient to improve for overcoming the defects of a test result, and is high in flexibility.
Owner:NANJING FORESTRY UNIV

Method of recommending personalized treatment scheme for stroke patient

ActiveCN111524571ASolve the problem of inconsistent input lengthReduce training timeTherapiesMedical automated diagnosisMedical recordNerve network
The invention discloses a method of recommending a personalized treatment scheme for a stroke patient. The method comprises the following steps: S1, preprocessing text information about physical examination and evaluation results in electronic medical records of patients; S2, expressing words, sentences and documents in the physical examination and evaluation results in the electronic medical records of the patients in a vector manner; S3, training a neural network model based on document vectors to obtain a personalized treatment scheme recommendation model; and S4, carrying out unified dataexpression, word segmentation and text filtering processing on the physical examination and evaluation results in an electronic medical record of a new patient, then carrying out document vector representation, and inputting represented document vectors into the personalized treatment scheme recommendation model to obtain a recommended personalized treatment scheme. According to the method, evaluation and physical examination information in the electronic medical record of the patient is taken as a document, the process of personalized treatment scheme recommendation is converted into a multi-label classification problem, the personalized treatment scheme can be recommended according to the physical examination results and the evaluation results of the patient, an auxiliary decision is provided for a doctor, and the burden of the doctor is reduced.
Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA

Microalgae identification method based on improved YOLOv3

The invention provides a microalgae identification method based on improved YOLOv3, which comprises: collecting microalgae microscopic images, and making a data set of the microalgae images; performing data enhancement on the data set; dividing the enhanced data set into a training set, a verification set and a test set, labeling microalgae in the data set, and generating a labeled image; constructing an improved YOLOv3 target detection model; setting training parameters, and training the constructed YOLOv3 target detection model based on the data set; and classifying and positioning the test set images based on the trained YOLOv3 target detection model. According to the method, an improved YOLOv3 target detection model is adopted, a lightweight Mobilenet network is used for replacing an original feature extraction network darknet53 of YOLOv3, the operation speed can be remarkably increased, network parameters are greatly reduced, meanwhile, a spatial pyramid pool structure SPP is introduced, region features can be combined and connected in the same convolution layer with different scales, and the method is suitable for large-scale detection, so the position error is small when a small object is detected, and the CIoU is used for optimizing the loss function to further improve the detection precision.
Owner:DALIAN MARITIME UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products