Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

1082 results about "Hyper parameters" patented technology

The simplest definition of hyper-parameters is that they are a special type of parameters that cannot be inferred from the data. Imagine, for instance, a neural network. As you probably know, artificial neurons learning is achieved by tuning their weights in a way that the network gives the best output label in regard to the input data.

Condition monitoring data stream anomaly detection method based on improved gaussian process regression model

The invention relates to a condition monitoring data stream anomaly detection method, in particular to a condition monitoring data stream anomaly detection method based on an improved gaussian process regression model. The problem that an existing method for processing monitoring data stream anomaly detection is poor in effect is solved. The method comprises the steps that firstly, the historical data sliding window size is determined; secondly, the types of a mean value function and a covariance function are determined; thirdly, the hyper-parameter initial value is set to be the random number from 0 to 1; fourthly, q data closest to the current time t are extracted; fifthly, the gaussian process regression model is determined; sixthly, prediction is conducted by means of the nature of the gaussian process regression model; seventhly, PI of normal data at the time t+1; eighthly, monitoring data are compared with the PI; ninthly, whether the real monitoring data need to be marked to be abnormal or not is judged; tenthly, beta (xt+1) corresponding to the monitoring value at the time t+1 is calculated; eleventhly, the real value or prediction value and the t+1 are added into DT; twelfthly, new DT is created. The condition monitoring data stream anomaly detection method based on the improved gaussian process regression model is applied in the field of network communication.
Owner:HARBIN INST OF TECH

Urban road congestion degree prediction method based on time sequence traffic events

The invention relates to an urban road congestion degree prediction method based on time sequence traffic events. The method comprises the steps of: S1, acquiring historical traffic event data, real-time traffic event data and video monitoring data of an urban road section; S2, identifying traffic congestion forewarning events in the video data through the 3D CNN, and performing data space-time fusion according to the historical traffic events; S3, determining congestion degree classification labels, constructing a time sequence traffic congestion event data dictionary, and screening a training set, a verification set and a test set; S4, establishing an LSTM sequence data classification model, inputting the training set, and iteratively updating model parameters by utilizing a gradient descent method; S5, inputting the verification set into the model with updated parameters, optimizing and adjusting hyper-parameters, and selecting an optimal model; and S6, inputting the test set into the optimal training model, checking the effectiveness of the model, and carrying out road congestion prediction according to real-time traffic monitoring data. According to the method, a sequence dataclassification model is established by using LSTM, and the urban road congestion degree is predicted based on time sequence traffic events.
Owner:JILIN UNIV

Gaussian process regression-based method for predicting state of health (SOH) of lithium batteries

The invention discloses a Gaussian process regression-based method for predicting state of health (SOH) of lithium batteries, relates to a method for predicting the SOH of the lithium batteries, belongs to the fields of electrochemistry and analytic chemistry and aims at the problem that the traditional lithium batteries are bad in health condition prediction adaptability. The method provided by the invention is realized according to the following steps of: I. drawing a relation curve of the SOH of a lithium battery and a charge-discharge period; II, selecting a covariance function according to a degenerated curve with a regeneration phenomenon and a constraint condition; III, carrying out iteration according to a conjugate gradient method, then determining the optimal value of a hyper-parameter and bringing initial value thereof into prior distribution; IV, obtaining posterior distribution according to the prior part; V, obtaining the mean value and variance of predicted output f' without Gaussian white noise; and VI, together bringing the practically predicted SOH of the battery and the predicted SOH obtained in the step V into training data y to obtain the f', then determining the prediction confidence interval and predicting the SOH of the lithium battery. The method provided by the invention is used for detecting lithium batteries.
Owner:HARBIN INST OF TECH

Deep neural network multi-task hyper-parameter optimization method and device

The invention discloses a deep neural network multitask hyper-parameter optimization method. The method comprises: firstly, a data training set of each task being subjected to model training to obtaina multi-task learning network model; secondly, predicting all points in an unknown region, screening candidate points from a prediction result, finally evaluating the screened candidate points, adding the candidate points and target function values of the candidate points into the data training set, and establishing a model, predicting, screening and evaluating again; and so on, until the maximumnumber of iterations is reached, finally selecting a candidate point corresponding to the maximum target function value from the data training set, that is, the hyper-parameter combination of each task in the multi-task learning network model. According to the method, the Gaussian model is replaced by the radial basis function neural network model, and the radial basis function neural network model is combined with multi-task learning and is applied to the Bayesian optimization algorithm to realize hyper-parameter optimization, so that the calculation amount of hyper-parameter optimization isgreatly reduced. The invention further discloses an electronic device and a storage medium.
Owner:SHENZHEN UNIV

Use of machine learning for classification of magneto cardiograms

The use of machine learning for pattern recognition in magnetocardiography (MCG) that measures magnetic fields emitted by the electrophysiological activity of the heart is disclosed herein. Direct kernel methods are used to separate abnormal MCG heart patterns from normal ones. For unsupervised learning, Direct Kernel based Self-Organizing Maps are introduced. For supervised learning Direct Kernel Partial Least Squares and (Direct) Kernel Ridge Regression are used. These results are then compared with classical Support Vector Machines and Kernel Partial Least Squares. The hyper-parameters for these methods are tuned on a validation subset of the training data before testing. Also investigated is the most effective pre-processing, using local, vertical, horizontal and two-dimensional (global) Mahanalobis scaling, wavelet transforms, and variable selection by filtering. The results, similar for all three methods, were encouraging, exceeding the quality of classification achieved by the trained experts. Thus, a device and associated method for classifying cardiography data is disclosed, comprising applying a kernel transform to sensed data acquired from sensors sensing electromagnetic heart activity, resulting in transformed data, prior to classifying the transformed data using machine learning.
Owner:CARDIOMAG IMAGING

Deep-belief-network-based diagnosis method of distribution network

The invention discloses a deep-belief-network-based diagnosis method of a distribution network. The method comprises: original monitoring data of a distribution network are obtained; denoising and normalized modeling are carried out on the original data; hyper parameters of a distribution network fault diagnosis model are set; 70% of collection data are used as training samples to train the model;the rest of 30% of samples are used for testing the trained fault diagnosis model, wherein six outputted nodes are used for expressing probabilities of occurrences of two kinds of faults at three phases respectively and training is carried out again if the precision does not meet the requirement; the operation state of the distribution network is monitored by the model; and if a fault is caused,a fault type and a line are obtained. According to the invention, the deep learning theory is applied to the fault diagnosis of the distribution network; and changing characteristics of all collectiondata under various faults are learned automatically under the circumstances of complicated structure, many devices, and frequent data deficiency. Meanwhile, the good fault tolerance performance is realized; the fault diagnosis accuracy and timeliness of the distribution network are improved; and thus the distribution network can work stably and safely.
Owner:FOSHAN POWER SUPPLY BUREAU GUANGDONG POWER GRID +1

Weak supervision fine-grained image recognition method based on visual self-attention mechanism

The invention discloses a weak supervision fine-grained image recognition method based on a visual self-attention mechanism. The method involves a student-model module, a teacher-model module and a classification-model module. The student-model and the teacher-model are combined through a Teacher-Student loop feedback mechanism based on Pairwise Approach sorting learning so as to form a self-attention region recommendation network, so that the relation between discriminative region positioning and fine-grained feature learning is enhanced, the discriminative region in the fine-grained image can still be accurately detected under the condition of lacking a target bounding box and a part marking point, and the recognition accuracy is promoted to be remarkably improved; meanwhile, a convolution layer is shared by the three modules, namely, the student-model, the teacher-model and the class-model, so that the model storage space is effectively compressed, the calculation cost is reduced, the method meets the real-time recognition task requirement, and the method is suitable for a large-scale real scene; and besides, a dynamic weight distribution mechanism is adopted in multi-task jointlearning to reduce the amount of artificially set hyper-parameters and enhance the robustness of the model, finally, the whole model is trained and learned in an end-to-end single-stage mode, and thenetwork optimization difficulty is reduced.
Owner:SOUTHEAST UNIV

Estimation method of quasi-stationary broadband array signal direction of arrival based on block sparse Bayesian learning

The present invention discloses an estimation method of a quasi-stationary broadband array signal direction of arrival (DOA) based on block sparse Bayesian learning. An intra-frame correlation and aninterframe independence of a quasi-stationary broadband signal frequency spectrum are employed to set a corresponding a block sparse prior distribution model for signals, and a block sparse Bayesian model is employed to perform estimation of sparse signals, so that an estimation result with higher precision is obtained. Array receiving signals are subjected to appropriate framing processing, eachframe of the signal is subjected to Fourier transform, and each block sparse Bayesian model for each signal is established in a frequency domain; under an assumption of each frame of the signal is independent, information of all the frames is combined to establish a total Bayesian model, and hyper-parameter vectors are employed to control a sparsity of all the frames of signals to be reconstructed; and finally, the expectation maximization algorithm (EM) is employed to obtain an iterative update formula of the hyper-parameter vectors. The estimation method of a quasi-stationary broadband arraysignal direction of arrival based on block sparse Bayesian learning fully utilizes a short-time stability feature of quasi-stationary broadband array signals to establish a block sparse model, and therefore a higher DOA estimation precision can be obtained.
Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA

Reinforcement learning based anaphora resolution method

The invention discloses a reinforcement learning based anaphora resolution method, which comprises the following steps: data preprocessing: carrying out word segmentation, sentence segmentation, part-of-speech tagging, part-of-speech reduction, named entity identification, syntactic analysis and word vector conversion on text data to obtain candidate preceding words and analogy word related characteristics; constructing a neural network model: combining the characteristics of the word vectors and the relevant characteristics which can learn the fingering pairs and the relevant semantic information, better sorting and scoring the candidate preceding words and the fingering words, and finally obtaining an fingering chain; and using the trained model to carry out anaphora resolution, inputting text data, and outputting a resolution chain. According to the method, deep learning training is carried out by adopting a reward measurement mechanism for overcoming the defects of a heuristic lossfunction, the model effect is improved, hyper-parameter setting is automatically carried out for different language data sets, the necessity of manual setting is avoided, the practicability of the model is improved, and the application range is expanded.
Owner:NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT +1

Pedestrian re-identification method fusing random batch masks and multi-scale representation learning

The invention relates to a pedestrian re-identification method fusing random batch masks and multi-scale representation learning. The pedestrian re-identification method comprises the steps of constructing a pedestrian re-identification training network; performing network hyper-parameter adjustment according to preset training parameters to obtain a learning network; shielding multi-scale representation learning and random batch mask branches to obtain a test network, and inputting the test set into the test network to obtain a corresponding test identification result; judging whether the accuracy of the test recognition result is greater than or equal to a preset value or not, if so, inputting the actual data set into the learning network, and otherwise, retraining the network; and finally, shielding multi-scale representation learning and random batch mask branches to obtain an application network, and inputting the query image into the application network to obtain a correspondingidentification result. Compared with the prior art, the method has the advantages that a random batch mask strategy, multi-scale representation learning and loss function joint training are used, moredetailed discrimination features of pedestrian images can be captured, and local important suppressed features are extracted.
Owner:TONGJI UNIV

Training method of multi-moving object action identification and multi-moving object action identification method

The invention provides a training method of multi-moving object action identification, comprising the following steps of: extracting the movement track information of each moving object from video data; layering the movement track information of the moving objects; modeling for the movement mode of the multi-moving object action on each layer; carrying out characteristic description on the model of the movement mode by synthesizing the overall and local movement information in a video, wherein the characteristic at least comprises a three-dimensional hyper-parameter vector for describing the movement track by using a gaussian process; and training a grader according to the characteristic. The invention also provides a multi-moving object action identification method which identifies the multi-moving object action in the video by utilizing the grader obtained by using the training method. In the invention, the movement track of an object is represented by using the gaussian process from a probability angle, and a model is established for a multi-people action mode from three granularity layers, and the characteristics are extracted, which makes the representation of the multi-people action more practical.
Owner:INST OF COMPUTING TECH CHINESE ACAD OF SCI

U-shaped cavity full-convolution integral segmentation network identification model based on remote sensing image

The invention discloses a U-shaped cavity full-convolution integral segmentation network identification model based on a remote sensing image. The model comprises a data preprocessing module, a modeltraining module and a model evaluation module. The model is characterized in that the data preprocessing module is used for performing data preprocessing operation on a remote sensing image to obtaina data set, and performing equal-ratio sampling on the data set to generate a training set, a verification set and a test set; the model training module is used for establishing a U-shaped cavity full-convolution integral cut network model, training parameters of the U-shaped cavity full-convolution integral cut network model by utilizing data of the training set, performing model learning and updating network weights, adjusting hyper-parameters involved in the U-shaped cavity full-convolution integral cut network model by utilizing the difference between the data of the verification set and the recognition effect, and judging the convergence degree of the U-shaped cavity full-convolution integral cut network model to achieve the purpose of deep training. The model can be widely applied toremote sensing image ground objects of different scales.
Owner:CHONGQING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products