Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

166results about How to "Solve vanishing gradient" patented technology

Human behavior recognition method based on attention mechanism and 3D convolutional neural network

The invention discloses a human behavior recognition method based on an attention mechanism and a 3D convolutional neural network. According to the human behavior recognition method, a 3D convolutional neural network is constructed; and the input layer of the 3D convolutional neural network includes two channels: an original grayscale image and an attention matrix. A 3D CNN model for recognizing ahuman behavior in a video is constructed; an attention mechanism is introduced; a distance between two frames is calculated to form an attention matrix; the attention matrix and an original human behavior video sequence form double channels inputted into the constructed 3D CNN and convolution operation is carried out to carry out vital feature extraction on a visual focus area. Meanwhile, the 3DCNN structure is optimized; a Dropout layer is randomly added to the network to freeze some connection weights of the network; the ReLU activation function is employed, so that the network sparsity isimproved; problems that computing load leap and gradient disappearing due to the dimension increasing and the layer number increasing are solved; overfitting under a small data set is prevented; and the network recognition accuracy is improved and the time losses are reduced.
Owner:NORTH CHINA ELECTRIC POWER UNIV (BAODING) +1

Chinese text classification method based on super-deep convolution neural network structure model

The invention provides a Chinese text classification method based on a super-deep convolution neural network structure model. The method comprises the steps of collecting a training corpus of a word vector from the internet, combining a Chinese word segmentation algorithm to conduct word segmentation on the training corpus, and obtaining a word vector model; collecting news of multiple Chinese news websites from the internet, and marking the category of the news as a corpus set for text classification, wherein the corpus set is divided into a training set corpus and a test set corpus; conducting word segmentation on the training set corpus and the test set corpus respectively, and then obtaining the word vectors corresponding to the training set corpus and the test set corpus respectively by utilizing the word vector model; establishing the super-deep convolution neural network structure model; inputting the word vector corresponding to the training set corpus into the super-deep convolution neural network structure model, and conducting training and obtaining a text classification model; inputting the Chinese text which needs to be sorted into the word vector model, obtaining the word vector of the Chinese text which needs to be classified, and then inputting the word vector into the text classification model to complete the Chinese text classification.
Owner:HEBEI UNIV OF TECH

Medical image segmentation method of residual full convolutional neural network based on attention mechanism

ActiveCN110189334ASolve the problem of lack of spatial features of imagesReduce redundancyImage enhancementImage analysisImage segmentationImaging Feature
The invention provides a medical image segmentation method of a residual full convolutional neural network based on an attention mechanism. The medical image segmentation method comprises the steps: preprocessing a to-be-segmented medical image; constructing a residual full convolutional neural network based on the attention mechanism, wherein the residual full convolutional neural network comprises a feature map contraction network, an attention network and a feature map expansion network group; inputting the training set data into a residual error type full convolutional neural network for training to obtain a learned convolutional neural network model; and inputting the test set data into the learned convolutional neural network model, and performing image segmentation to obtain segmented images. According to the medical image segmentation method, an attention network is utilized to effectively transmit image features extracted from a feature map contraction network to a feature mapexpansion network; and the problem of lack of image spatial features in an image deconvolution process is solved while the attention network can also inhibit image regions irrelevant to a segmentation target in a low-layer feature image, so that the redundancy of the image is reduced, and meanwhile, the accuracy of image segmentation is also improved.
Owner:NANJING UNIV OF POSTS & TELECOMM

Load optimization scheduling method and system for residential micro-grid, and storage medium

The invention provides a load optimization scheduling method and system for a residential micro-grid, and a storage medium. The method comprises the following steps of acquiring environmental data andtime data of the residential micro-grid in a preset future time period; inputting the environmental data and the time data into a pre-trained load prediction model in order to obtain electrical loaddata of the residential micro-grid in the future time period; inputting the environmental data and the time data into a pre-trained photovoltaic output power prediction model in order to obtain photovoltaic output power data of the residential micro-grid in the future time period; determining an objective function of the residential micro-grid in the future time period and a corresponding constraint condition, wherein the optimization objective of the objective function is that the total cost of the residential micro-grid is minimum; and solving the objective function through a particle swarmalgorithm in order to obtain a load scheduling scheme of the residential micro-grid in the future time period. The invention provides the load scheduling scheme suitable for the current micro-grid, and the operating cost of the residential micro-grid is reduced.
Owner:HEFEI UNIV OF TECH

Cement finished product specific surface area prediction method and system based on long-term and short-term memory network

ActiveCN111079906AEliminate the effects of specific surface area predictionsWith memory functionNeural architecturesNeural learning methodsStochastic gradient descentAlgorithm
The invention discloses a cement finished product specific surface area prediction method and system based on a long-term and short-term memory network. The method comprises the following steps: sorting training input data in a training input set according to a time sequence; inputting the sorted training input set into a pre-constructed long-term and short-term memory network model to obtain a cement finished product specific surface area prediction value at each moment; calculating a node error term of each neuron by adopting a time-based back propagation algorithm according to the trainingoutput set and the cement finished product specific surface area prediction value, wherein the node error term comprises a forgetting gate error term, an input gate error term and an output gate errorterm; training the to-be-trained parameters by adopting a random gradient descent method according to the node error term to obtain a trained long-term and short-term memory network model; and inputting the to-be-tested input set into the trained long-short-term memory network model to obtain a to-be-tested cement finished product specific surface area prediction value. The accuracy of cement finished product specific surface area prediction can be improved.
Owner:YANSHAN UNIV

Non-invasive load decomposition method and system

The invention discloses a non-invasive load decomposition method and system. According to the scheme, the method comprises the steps of acquiring a load total power sequence of a target user within aperiod of time; and based on the load total power sequence of the user and set load types, decomposing load total power to obtain power characteristic sequences of the load types in the time period according to pre-trained relations between the total power sequence and the power characteristic sequences of the load types, wherein the load types are determined by using conditions of household appliances by the user. The training process comprises the steps of performing receptive field amplification based on identity mapping and a time sequence. The non-invasive load is decomposed through the pre-trained relations between the total power sequence and the power characteristic sequences of the load types, so that the correctness and accuracy of a load decomposition result are improved; and inthe training process, the load total power is subjected to a receptive field method according to the time sequence through the identity mapping, so that the problem of gradient disappearance in the load decomposition process is solved, the receptive field is expanded, and the load decomposition result is more authentic.
Owner:CHINA ELECTRIC POWER RES INST +3

Deep learning-based method for predicting lithologic sequence model through using seismic data

ActiveCN109828304ASolve the situation where there is asymmetric data volumeFlexibleNeural architecturesSeismic signal processingObservation dataNetwork model
The invention discloses a deep learning-based method for predicting a lithologic sequence model through using seismic data. The method includes the following steps that: 1) the lithologic data of a target stratum section of a well in a work area and near-well trace seismic data are measured so as to be adopted as training data; 2) the near-well trace seismic data are normalized, so as to be converted to a range of -1 to 1; 3) a stacked recurrent neural network and a sequence-to-sequence recurrent neural network model are adopted to respectively train the processed near-well trace seismic dataand the well lithologic data, with the near-well trace seismic data adopted as observation data, and the well lithologic data adopted as target data, iterative computation is performed, so that a learning model reaches convergence; and 4) actual seismic data are inputted to the learning model which is obtained after calculation in the step 3), so that a predicted lithologic sequence is obtained. With the deep learning-based method for predicting the lithologic sequence model through using the seismic data of the invention adopted, a lithologic data body capable of effectively reflecting reservoir distribution can be generated under the control of a seismic data sequence, and an inter-well reservoir prediction problem can be solved, and a basis can be provided for exploration and development.
Owner:CHINA NAT OFFSHORE OIL CORP +1

Short-term water quality and quantity prediction method and system based on deep learning

The invention provides a short-term water quality and water quantity prediction method based on deep learning, and the method comprises the following steps: A, preprocessing original water quality andwater quantity data, and dividing the processed data into a training set and a test set; B, inputting the training set into an LSTM network for training, and updating the weight by using an adam algorithm to obtain a prediction model; C, predicting a prediction value in the test set by using a prediction model based on the original water quality and water quantity data; D, inputting the prediction error into the ARMA model to obtain an error correction model of an error sequence; E, inputting to-be-predicted data into the prediction model and the error correction model, and geometrically adding settlement results to obtain a predicted value; the invention further provides a water quality and quantity prediction system. The invention has the advantages that the LSTM neural network and theARMA model are used for calculating the water quality and water quantity at the moment to be predicted and the prediction error respectively, higher universality and stability are achieved, and the water quality and water quantity prediction result is more stable.
Owner:ANHUI ZEZHONG SAFETY TECH +2

Wind power prediction method and system integrating long-term and short-term memory network and extreme learning machine

PendingCN111832825ASolve vanishing gradientImproving the effectiveness of wind power forecastingForecastingNeural architecturesLearning machineFeature vector
The invention provides a wind power prediction method integrating a long-term and short-term memory network and an extreme learning machine, and the method comprises the steps of obtaining a wind power sequence and corresponding meteorological data, and carrying out the recombination of the feature data and meteorological feature data of the wind power sequence according to the frequency size, andforming a low-frequency combined input feature vector and a high-frequency combined input feature vector; inputting the low-frequency combined input feature vector into a trained long-term and short-term memory network prediction model to obtain a first prediction result, and inputting the high-frequency combined input feature vector into a trained extreme learning machine prediction model to obtain a second prediction result; and fusing prediction results of the long-term and short-term memory network prediction model and the extreme learning machine prediction model to obtain a final prediction result of the wind power. Different prediction models are set for components of different frequencies, and the prediction result of the prediction model is fused so that the wind power predictioneffect can be improved. Meanwhile, the strong coupling effect of the wind power meteorological information and the wind power is fully considered, and the accuracy of wind power prediction is improved.
Owner:SHANDONG UNIV

Electrocardiosignal atrial fibrillation detection method based on one-dimensional dense connection convolutional network

The invention discloses an electrocardiosignal atrial fibrillation detection method based on a one-dimensional dense connection convolutional network. The method comprises the following steps: step 1,acquiring a plurality of electrocardiosignal segments containing atrial fibrillation labels; 2, preprocessing the electrocardiosignal segments in the step 1 and taking the electrocardiosignal segments as training data for training a one-dimensional dense connection convolutional network model; 3, building the one-dimensional dense connection convolutional network model by utilizing a deep learning framework; 4, randomly selecting the size of an initial parameter, continuously sending the training data to the model in batches, and performing back propagation to update the network parameter toobtain an optimal parameter; 5, carrying out lightweight processing on the trained network, wherein the lightweight processing comprises parameter quantification and network pruning; and 6, collectingelectrocardiosignals of the patient, sending the signal waveform as input into the one-dimensional dense connection convolutional network model, outputting a result, and pre-judging whether the patient has atrial fibrillation or not.
Owner:BEIHANG UNIV

Electrocardiosignal atrial fibrillation detection device based on dense connection convolutional recurrent neural network

The invention discloses an electrocardiosignal atrial fibrillation detection device based on a dense connection convolutional recurrent neural network. The electrocardiosignal atrial fibrillation detection device comprises a data acquisition module, a preprocessing module, an atrial fibrillation detection module and a training module; the atrial fibrillation detection module is used for building adense connection convolutional recurrent neural network atrial fibrillation detection model; the model comprises a convolution layer, and a dense connection neural network, a bidirectional recurrentneural network and an output discrimination classification layer. The dense connection convolutional neural network effectively solves the problems of gradient disappearance and network ductility, and makes full use of characteristics at the same time; the bidirectional recurrent neural network enables the network to be more suitable for an analysis scene of time sequence signals; and the detection device detects electrocardiosignals from a space domain and a time domain successively, and considerable atrial fibrillation detection accuracy is finally achieved through combination and cascadingof the two networks. According to the technical scheme, compared with traditional atrial fibrillation segmented detection processes, the operation process is simpler, and robustness and algorithm stability are higher.
Owner:BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products