Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

121results about How to "Avoid Gradient Explosion" patented technology

Cutter wear state monitoring method based on deep gated cycle unit neural network

The invention discloses a cutter wear state monitoring method based on a deep gated cycle unit neural network. The method comprises the steps that vibration signals generated in the tool machining process are collected in real time through a sensor, after wavelet threshold denoising, the signals are input into a one-dimensional convolutional neural network for single time step time sequence signallocal feature extraction; then, inputting the time series signal into an improved deep gated recurrent unit neural network CABGRUs to carry out time series signal time series feature extraction; an Attention mechanism is introduced to calculate network weights and reasonably distribute the network weights, and finally, signal feature information with different weights is put into a Softmax classifier to classify tool wear states, so that complexity and limitation caused by manual feature extraction are avoided; meanwhile, the problem that a single convolutional neural network ignores correlation before and after a time sequence signal is effectively solved, and the accuracy of the model is improved by introducing an Attention mechanism. Therefore, the method has the characteristic of improving the real-time performance and accuracy of cutter wear state monitoring.
Owner:GUIZHOU UNIV

Two-dimensional recursive network-based recognition method of Chinese text in natural scene images

The invention discloses a two-dimensional recursive network-based recognition method of Chinese text in natural scene images. Firstly, a training sample set is acquired, and a neural network formed bysequentially connecting a deep convolutional network, a two-dimensional recursive network used for encoding, a two-dimensional recursive network used for decoding and a CTC model is trained; test samples are input into the trained deep convolutional network, and feature maps of the test samples are acquired; the feature maps of the test samples are input into the trained two-dimensional recursivenetwork, which is used for encoding, to obtain encoding feature maps of the test samples; the encoding feature maps of the test samples are input into the trained two-dimensional recursive network, which is used for decoding, to obtain a probability result of each commonly used Chinese character in each image of the test samples; and clustering searching processing is carried out, and finally, the overall Chinese text in the test samples is recognized. According to the method of the invention, space/time information and context information of the text images are fully utilized, the text imagepre-segmentation problem can be avoided, and recognition accuracy is improved.
Owner:SOUTH CHINA UNIV OF TECH

Time-space domain correlation prediction method for air pollutant concentration

ActiveCN109492822AAvoid the vanishing or exploding gradient problemEliminate degradation problemsForecastingNeural architecturesMachine learningPollutant
The invention relates to a time-space domain correlation prediction method for air pollutant concentration, which comprises the steps of S1, constructing a prediction model based on a residual error network and a convolutional LSTM network by taking PM2.5 as a sample for target pollutant prediction; s2, selecting appropriate training and testing data from the environment monitoring data to complete initialization of the prediction model; s3, training the prediction model stage by stage to obtain a neural network prediction model capable of accurately predicting PM2.5; s4, selecting hyper-parameters (the number of layers, the number of nodes and the learning rate) of the model by utilizing the verification set until the model is optimal; and S5, carrying out urban PM2.5 prediction by utilizing the verified prediction model. Compared with the prior art, the method has the advantages that the convolutional LSTM network is used as a middle layer, deep space-time association feature extraction is performed on spatial features extracted by the bottom ResNet network, accordingly, the prediction performance of the network model can be improved, the hidden state of the convolutional LSTM can be received by the aid of the full connection layer, and a final prediction result can be generated.
Owner:SHANGHAI NORMAL UNIVERSITY

Spatial spectrum fusion hyperspectral image classification method based on a three-dimensional deep residual network

InactiveCN109871830AAvoid the problem of training degradationSmall scaleCharacter and pattern recognitionSlide windowClassification methods
The invention belongs to the field of hyper-spectral intelligent perception, and particularly discloses a spatial spectrum fusion hyper-spectral image classification method based on a three-dimensional deep residual network, which comprises the following steps: S1, generating candidate frames by using a sliding window method, and generating a plurality of windows; S2, randomly dividing the windowinto a training set and test set data; S3, training a three-dimensional depth residual network (3D-CNN) based on the hyperspectral data in the training set; S4, inputting a test set sample into the classification model, The hyperspectral image classification method has the advantages that the characteristics of the input data are extracted and predicted, the spectral characteristics and the spatial spectrum characteristics of the hyperspectral image are extracted at the same time, the classification precision of the hyperspectral image is further improved, a residual network structure is introduced, and the problem of learning degradation in a traditional hyperspectral classification neural network is solved. The hyperspectral image target classification method is clear in structure and easy to implement, the structural characteristics of the hyperspectral image can be fully utilized, and the hyperspectral image target classification precision is remarkably improved while the calculation time is shortened.
Owner:NAT UNIV OF DEFENSE TECH

Face recognition model training method and device, face recognition method and device, equipment and storage medium

The invention relates to the field of biological recognition, trains a face recognition model based on deep learning, and particularly discloses a face recognition model training method and device, aface recognition method and device, equipment and a storage medium. The face recognition model training method comprises the steps: carrying out the training of a preset convolutional neural network,so as to construct a feature extraction network; establishing connection between the feature extraction network and a preset classification network to obtain a first convolutional neural network model; freezing a weight parameter of the feature extraction network of the first convolutional neural network model; performing iterative training on a classification network in the first convolutional neural network model to obtain a second convolutional neural network model; unfreezing the weight parameters of the feature extraction network of the second convolutional neural network model; and training the unfrozen second convolutional neural network model to obtain a face recognition model. The face recognition model training method can improve the face recognition speed and improve the stability of the model.
Owner:PING AN TECH (SHENZHEN) CO LTD

Aspect-level text sentiment classification method and system

The invention discloses an aspect-level text sentiment classification method and system, and the method comprises the steps: extracting the long-distance dependence features of a sentence text according to the obtained local feature vectors of the sentence text, and obtaining the context feature representation of the sentence text; constructing a syntactic dependency relationship among words in the sentence text according to the context feature representation of the sentence text to obtain aspect-level feature representation of the sentence text; and constructing a dependency tree-based graphattention neural network, and obtaining aspect-level emotion categories of the text according to aspect-level feature representation of the sentence text. The method comprises the steps of extractinglocal feature information in a sentence by adopting a convolutional neural network, learning pooled features of the convolutional neural network by utilizing a bidirectional long-short-term memory network, obtaining context information of the sentence, constructing a dependency tree-based graph attention network model, and modeling a sentence dependency relationship by utilizing syntactic information of a dependency tree, thereby improving the performance of sentiment classification.
Owner:SHANDONG NORMAL UNIV

Navigation reminding method for short-term traffic flow prediction based on SVD-PSO-LSTM

The invention relates to a navigation reminding method for short-term traffic flow prediction based on SVD-PSO-LSTM. Firstly, an LSTM model is trained and optimized; collecting historical traffic flowdata; preprocessing the LSTM model, and inputting the preprocessed LSTM model into the trained and optimized LSTM model; the method comprises the following steps: outputting a prediction result of ashort-time traffic flow by a traffic flow sensor, publishing the prediction result of the short-time traffic flow on navigation software, displaying the prediction result of the short-time traffic flow on a road section in different colors according to the traffic flow, providing displayed information for a driver to refer to congestion, and reasonably planning travel and selecting a navigation route according to the displayed information. When the LSTM model is trained and optimized, historical traffic flow data is collected, preprocessed and then divided into a training set and a test set, and then the LSTM model is trained and optimized by adopting a PSO algorithm and the training set; wherein the preprocessing comprises the step of performing noise reduction operation on the data by adopting an SVD algorithm. The method provided by the invention is more accurate in prediction effect, and can reasonably plan a navigation route.
Owner:DONGHUA UNIV

Hospitalization behavior prediction method and device based on time-varying attention improved Bi-LSTM

ActiveCN110334843AImprove accuracyMaintain long-term dependency association stabilityMedical data miningForecastingDiseaseState prediction
The invention provides a hospitalization behavior prediction method and device based on time-varying attention improved Bi-LSTM. The hospitalization behavior prediction method comprises the steps of extracting features from massive medical insurance data, wherein the correlation degree between the features and hospitalization behaviors is greater than a preset correlation degree threshold; constructing Bi-LSTM by utilizing the extracted hospitalization behavior characteristics and the corresponding weights of the hospitalization behavior characteristics; updating a weight value of each hospitalization state prediction data in the Bi-LSTM by adopting hospital-disease attraction data pre-generated by an attention mechanism based on the hospitalization state prediction data obtained by the Bi-LSTM; constructing a time adjustment function; outputting a multi-period multi-state hospitalization state prediction vector; utilizing the hospitalization state prediction vector to construct a softmax prediction function; calculating a loss function of an output value of the softmax prediction function, training learning parameters of Bi-LSTM by adopting back propagation, and completing training of the model; and after model training is completed, outputting a prediction result of the experimental sample set, comparing the prediction result with an actual hospitalization behavior, and feeding back and updating a weight value of hospitalization state prediction data.
Owner:SHANDONG UNIV

Method for recognizing lithology through reconstructed textures

The invention discloses a method for recognizing lithology through reconstructed textures. The method comprises the following steps: establishing an original specimen image library; establishing a standardized image library; extracting features of the rock specimen image by using discrete convolution and maximum pooling operation of a convolutional neural network MobileNet-V2 convolutional model,and effectively preventing the problems of gradient explosion and gradient disappearance in the image feature extraction process by using a new algorithm; and selecting the rock category with the maximum probability to output a result, and completing the recognition process. According to the method, the problems of low precision and low recognition efficiency caused by the fact that an existing algorithm is limited to two dimensions in the field of randomly recognizing rock lithology under the conditions of particle distribution and section texture development and imaging quality including rock color, definition and the like are solved. The method has the advantages that various lithology characters can be widely identified without being limited by external factors such as backgrounds andcolors, and the identification accuracy is improved; intelligent identification operation speed improvement.
Owner:长江岩土工程有限公司 +1

Unreal information detection method based on BERT model and enhanced hybrid neural network

The invention discloses an unreal information detection method based on a BERT model and an enhanced hybrid neural network. The method comprises the steps of preprocessing a to-be-detected text; performing convolution and pooling operation on the input matrix by using a CNN network, and splicing the input matrix into a feature sequence; taking the feature sequence as the input of a BiLSTM network,and comprehensively capturing the deep semantic features of the text from the front direction and the rear direction by using a forward LSTM unit and a backward LSTM unit respectively; generating a semantic code containing attention distribution by utilizing an attention layer, and optimizing a feature vector; and finishing classification detection of the feature vectors by utilizing a classifierof the output layer, and judging whether the feature vectors are the non-real information. According to the method, the CNN, the BiLSTM and the attention mechanism are combined, the detection precision of the unreal information is high, local phrase features and global context features of the text of the unreal information can be extracted, text keywords can also be extracted, and the unreasonable influence of irrelevant information on a detection result is reduced.
Owner:CHINA THREE GORGES UNIV

Road scene semantic segmentation method based on full residual cavity convolutional neural network

The invention discloses a road scene semantic segmentation method based on a full-residual cavity convolutional neural network. The method comprises the steps: constructing the full-residual cavity convolutional neural network at a training stage, and enabling the full-residual cavity convolutional neural network to comprise an input layer, a hidden layer and an output layer, and enabling the hidden layer to comprise one transition convolution block, eight neural network blocks, seven deconvolution blocks and four fusion layers; inputting each original road scene image in the training set intoa full residual cavity convolutional neural network for training to obtain 12 semantic segmentation prediction images corresponding to each original road scene image; calculating a loss function value between a set formed by 12 semantic segmentation prediction images corresponding to each original road scene image and a set formed by 12 one-hot coded images processed by corresponding real semantic segmentation images to obtain a full residual cavity convolutional neural network training model; in the test stage, a full residual cavity convolutional neural network training model is used for prediction; the method has the advantages of high segmentation accuracy and strong robustness.
Owner:ZHEJIANG UNIVERSITY OF SCIENCE AND TECHNOLOGY

Road intersection steering ratio prediction method based on LSTM neural network

The invention discloses a road intersection steering ratio prediction method based on an LSTM neural network. The road intersection steering ratio prediction method comprises the steps: firstly carrying out the statistics of traffic flows in different flow directions of all entrance lanes of a road intersection, and carrying out the data preprocessing; then, calculating the steering ratios of allthe entrance lanes of the road intersection in all the flow directions, and splitting a training set and a test set; then, designing a prediction model based on an LSTM neural network, and selecting an appropriate activation function and an appropriate loss function; and finally, sequentially selecting the training set and the test set of different entrance lanes, training and testing the corresponding prediction model, and completing the establishment of the prediction model, thereby being applied to the prediction of the road intersection steering ratio. According to the road intersection steering ratio prediction method, the time sequence characteristic of the traffic flow can be fully utilized, and the prediction precision of the road intersection steering ratio is effectively improved, and meanwhile, the problems of gradient explosion and gradient dispersion are avoided through the reasonable neural network structure and the good activation function characteristic, and the applicability of the prediction method is improved.
Owner:ZHEJIANG UNIV OF TECH

Video behavior identification method based on compression reward and punishment mechanism

The invention belongs to the field of computer vision, and relates to a video behavior identification method based on a compression reward and punishment mechanism. The technical problems that an existing video behavior recognition method is large in calculated amount, poor in robustness, low in accuracy and the like are mainly solved. According to the invention, a convolutional neural network containing a compression reward and punishment mechanism is designed for video behavior identification. The network is constructed based on a time segmentation network. The method comprises the followingsteps: firstly, dividing a video into three segments, randomly extracting an optical flow image and an RGB frame from each segment, respectively inputting the optical flow image and the RGB frame into a time and space network, weighting the extracted features through compression and reward and punishment operations, and respectively carrying out preliminary prediction on behaviors of the weightedtime and space features on a time channel and a space channel; fusing the preliminary prediction result of each segment to obtain a video-level prediction result; and finally, fusing the video-levelprediction results to obtain a video behavior recognition result. Experiments are carried out on data sets UCF101 and HMDB51, and results show that compared with other models, the model has higher accuracy.
Owner:SHANXI UNIV

A tail end space energy consumption prediction method based on building total energy consumption, a medium and equipment

The invention discloses a tail end space energy consumption prediction method based on building total energy consumption, a medium and equipment, and the method comprises the steps of sequentially transmitting preprocessed sample data of N tail end spaces at a moment t and first tau time steps into a tail end space energy consumption prediction model in each step of model training; enabling the model to obtain N tail end space energy consumption prediction values at the t moment through N times of forward calculation, adding the N tail end space energy consumption prediction values to the actual total energy consumption of the building to calculate a loss function, and adjusting parameters of the tail end space energy consumption prediction model through back propagation of a gradient descent method; repeating the training process on the sample data at all moments until the model converges to the prediction precision, and completing the training of the end space energy consumption prediction model; and predicting the energy consumption generated by the tail end space by using the obtained tail end space energy consumption prediction model through the controller parameters of the tail end devices, the temperature and humidity sensors, the Internet weather information, the people flow density, the power of the electric appliances and the lighting devices and the house structure parameters in the operation process of the building heating and ventilation system. According to the invention, the training of the tail end space energy consumption prediction model is realized underthe condition that the tail end space energy consumption historical data is missing.
Owner:XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products