Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

475results about How to "Reduce overfitting" patented technology

Cascaded residual error neural network-based image denoising method

The invention discloses a cascaded residual error neural network-based image denoising method. The method comprises the following steps of building a cascaded residual error neural network model, wherein the cascaded residual error neural network model is formed by connecting a plurality of residual error units in series, and each residual error unit comprises a plurality of convolutional layers, active layers after the convolutional layers and unit jump connection units; selecting a training set, and setting training parameters of the cascaded residual error neural network model; training the cascaded residual error neural network model by taking a minimized loss function as a target according to the cascaded residual error neural network model and the training parameters of the cascaded residual error neural network model to form an image denoising neural network model; and inputting a to-be-processed image to the image denoising neural network model, and outputting a denoised image. According to the cascaded residual error neural network-based image denoising method disclosed by the invention, the learning ability of the neural network is greatly enhanced, accurate mapping from noisy images to clean images is established, and real-time denoising can be realized.
Owner:SHENZHEN INST OF FUTURE MEDIA TECH +1

Gesture recognition method based on 3D-CNN and convolutional LSTM

The invention discloses a gesture recognition method based on 3D-CNN and convolution LSTM. The method comprises the steps that the length of a video input into 3D-CNN is normalized through a time jitter policy; the normalized video is used as input to be fed to 3D-CNN to study the short-term temporal-spatial features of a gesture; based on the short-term temporal-spatial features extracted by 3D-CNN, the long-term temporal-spatial features of the gesture are studied through a two-layer convolutional LSTM network to eliminate the influence of complex backgrounds on gesture recognition; the dimension of the extracted long-term temporal-spatial features are reduced through a spatial pyramid pooling layer (SPP layer), and at the same time the extracted multi-scale features are fed into the full-connection layer of the network; and finally, after a latter multi-modal fusion method, forecast results without the network are averaged and fused to acquire a final forecast score. According to the invention, by learning the temporal-spatial features of the gesture simultaneously, the short-term temporal-spatial features and the long-term temporal-spatial features are combined through different networks; the network is trained through a batch normalization method; and the efficiency and accuracy of gesture recognition are improved.
Owner:BEIJING UNION UNIVERSITY

Brain tumor segmentation network and segmentation method based on U-Net network

The invention discloses a brain tumor segmentation network and segmentation method based on a U-Net network. The tail of a contraction path of the segmentation network is connected with a spatial pyramid pooling structure; hole convolution of different scales is introduced into a network jump connection part of the segmentation network; an Add operation and original input are adopted to form a residual block with hole convolution; a receptive field of shallow feature information in the contraction path is expanded; fusing with an expansion path of a corresponding stage is carried out. The segmentation method comprises the following steps: cutting and preprocessing a training data set, then constructing a brain tumor segmentation network DCU-Net based on a U-Net network, then inputting a preprocessed two-dimensional image into a segmentation model for feature learning and optimization, obtaining an optimal parameter model of the segmentation model, and finally inputting a to-be-segmented test data set image into the segmentation model for tumor region segmentation. According to the method, the problems of over-segmentation and under-segmentation in brain tumor segmentation can be effectively solved, and the brain tumor segmentation precision is improved.
Owner:HENAN UNIVERSITY OF TECHNOLOGY

Feature extraction and state recognition of one-dimensional physiological signal based on depth learning

The present invention discloses a feature extraction and state recognition method for one-dimensional physiological signal based on depth learning. The method comprises: establishing a feature extraction and state recognition analysis model DBN of a on-dimensional physiological signal based on depth learning, wherein the DBN model adopts a "pre-training+fine-tuning" training process, and in a pre-training stage, a first RBM is trained firstly and then a well-trained node is used as an input of a second RBM, and then the second RBM is trained, and so forth; and after training of all RBMs is finished, using a BP algorithm to fin-tune a network, and finally inputting an eigenvector output by the DBN into a Softmax classifier, and determining a state of an individual that is incorporated into the one-dimensional physiological signal. The method provided by the present invention effectively solves the problem that in the conventional one-dimensional physiological signal classification process, feature inputs need to be selected manually so that classification precision is low; and through non-linear mapping of the deep confidence network, highly-separable features/feature combinations are automatically obtained for classification, and a better classification effect can be obtained by keeping optimizing the structure of the network.
Owner:SICHUAN UNIV

Image classification method and device

The invention discloses an image classification method and an image classification device. The image classification method comprises the following steps: based on a big image data set, training an AlexNet model structure; migrating the five trained convolution layers to a small database to form a lower-level feature extraction layer, and constructing together with a residual network layer, a multiscale pooling layer, a feature layer and a softmax classifier to obtain a migration model structure, wherein the residual network layer includes two convolution layers; inputting small image data set into the migration model structure, upgrading parameters by adopting a batch gradient descending method, and training an image classification hybrid model; and classifying according to the image classification hybrid model to obtain a classification result. By migrating the pretrained convolution layers on the big data set to the small data set, increasing the multiscale pooling layer, and serially connecting the feature quantity output by the residual network layer and the multiscale pooling layer and inputting to the classifier, the feature quantity is increased and the overfitting problem can be relieved; and through the hybrid model trained based on a convolution nerve network and a migration learning, the image classification accuracy can be effectively improved.
Owner:GUANGDONG UNIV OF TECH

High-resolution SAR terrain classification method based on multiscale convolution and feature fusion

ActiveCN108154192ARetain propertiesPreserve scattering propertiesCharacter and pattern recognitionSmall sampleData set
The invention discloses a high-resolution SAR terrain classification method based on multiscale convolution and feature fusion, and mainly aims at solving the problem in the prior art that the classification precision is low and overfitting easily occurs. The high-resolution SAR terrain classification method comprises the steps of 1, extracting textural features and wavelet features of to-be-classified images; 2, fusing the to-be-classified images, the textural features and the wavelet features to constitute a fusion feature matrix; 3, according to the fusion feature matrix, constructing a training dataset and a testing dataset; 4, adding a multiscale convolution layer and a shuffle layer to an existing CNN network, changing a full-joint layer into a convolution layer, and constructing a multiscale convolution fusion network; 5, using the training dataset to train the multiscale convolution fusion network to obtain model parameters; 6, using the model parameters to initialize the multiscale fusion network to classify a test set. By means of the high-resolution SAR terrain classification method based on the multiscale convolution and the feature fusion, the parameters of the networkare reduced, the overfitting phenomenon of a small sample problem is solved, the classification precision is improved, and the high-resolution SAR terrain classification method can be applied to high-resolution SAR image terrain classification.
Owner:XIDIAN UNIV

Video human action reorganization method based on sparse subspace clustering

The invention belongs to computer visual pattern recognition and a video picture processing method. The computer visual pattern recognition and the video picture processing method comprise the steps that establishing a three-dimensional space-time sub-frame cube in a video human action reorganization model, establishing a human action characteristic space, conducting the clustering processing, updating labels, extracting the three-dimensional space-time sub-frame cube in the video human action reorganization model and the human action reorganization from monitoring video, extracting human action characteristic, confirming category of human sub-action in each video and classifying and merging on videos with sub-category labels. According to the computer visual pattern recognition and the video picture processing method, the highest identification accuracy is improved by 16.5% compared with the current international Hollywood2 human action database. Thus, the video human action reorganization method has the advantages that human action characteristic with higher distinguishing ability, adaptability, universality and invariance property can be extracted automatically, the overfitting phenomenon and the gradient diffusion problem in the neural network are lowered, and the accuracy of human action reorganization in a complex environment is improved effectively; the computer visual pattern recognition and the video picture processing method can be applied to the on-site video surveillance and video content retrieval widely.
Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA

Method for reducing dimensions of hyper-spectral data on basis of pairwise constraint discriminate analysis and non-negative sparse divergence

ActiveCN103544507AAvoid opt-inAchieve knowledge transferCharacter and pattern recognitionHyperspectral data classificationSource field
The invention discloses a method for reducing dimensions of hyper-spectral data on the basis of pairwise constraint discriminate analysis and non-negative sparse divergence, and belongs to methods for processing hyper-spectral remote sensing images. The method aims to solve the problem of deterioration of the classification performance of most advanced algorithms for classifying hyper-spectral data on the basis of machine learning when source hyper-spectral data and target hyper-spectral data are distributed differently. The method includes firstly, performing pairwise constraint discriminate analysis according to pairwise constraint samples; secondly, designing a non-negative sparse divergence criterion to create a bridge among source-field hyper-spectral data and target-field hyper-spectral data which are distributed differently; thirdly, combining the pairwise constraint discriminate analysis with the bridge to transfer knowledge from the source hyper-spectral data to the target hyper-spectral data. The pairwise constraint samples containing discriminate information can be automatically acquired. The method has the advantages that the knowledge can be transferred among the hyper-spectral data acquired at different moments, in different areas or by different sensors; the information of the source-field hyper-spectral data can be effectively utilized to analyze the target-field hyper-spectral data, and high integral classification precision and a high Kappa coefficient can be acquired.
Owner:CHINA UNIV OF MINING & TECH

Financial reimbursement all-invoice picture recognition and processing method

The invention discloses a financial reimbursement all-invoice picture recognition and processing method. According to the method, a specific scene of invoice recognition is optimized to some extent, invoice recognition types are expanded, invoices of all types can be recognized, and recognition is more accurate and higher in efficiency. The method is mainly used for performing recognition and processing after the invoices of all types are scanned into pictures. The method comprises the specific steps that the invoice pictures obtained after scanning are subjected to color preprocessing, and input data is provided for picture contour preprocessing; the pictures obtained after color preprocessing are subjected to contour detection, and pixel interference outside an invoice paper range is eliminated; the pictures obtained after contour detection are subjected to text preprocessing, characters on the invoices are recognized; and a character set is formed and output; the output character set is subjected to picture recognition and processing, a recognition and processing result is organized into a result object with semantics according to semantics of invoice recognition, and the result object is used as a final recognition result.
Owner:北京天宇星空科技有限公司

Pedestrian re-identification method and system of binarized triple twin network model

The invention discloses a pedestrian re-identification and method for a binary triple twin network model, and the method comprises the steps: enabling three paths of convolutional neural networks to input positive and negative samples and detection sample images, extracting image features, and enabling each path of convolutional neural network to comprise a convolutional layer, a pooling layer, and a full connection layer; wherein the convolutional layer, the pooling layer and the full connection layer of each convolutional neural network are the same and share weight parameters, and binarizing the weight parameters and an activation function value between the convolutional layer and the pooling layer; connecting the Softmax layer with each full connection layer, and classifying and normalizying features output by the convolutional neural network and connecting the Triplet loss verification function module with the Softmax layer and using the module for receiving the sample characteristics output by the normalization classification layer and carrying out similarity calculation on the sample pairs. According to the invention, the deep learning model with stronger identification capability is obtained, and the problem of large content difference caused by illumination, scene change and human body posture diversification of similar and same pedestrian pictures on different pedestrian picture contents is better solved.
Owner:SHANGHAI MARITIME UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products