Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

42 results about "Random effects model" patented technology

In statistics, a random effects model, also called a variance components model, is a statistical model where the model parameters are random variables. It is a kind of hierarchical linear model, which assumes that the data being analysed are drawn from a hierarchy of different populations whose differences relate to that hierarchy. In econometrics, random effects models are used in the analysis of hierarchical or panel data when one assumes no fixed effects (it allows for individual effects). The random effects model is a special case of the fixed effects model.

A method and a device for identifying abnormal users based on a random forest model

The invention provides a method and a device for identifying abnormal users based on a random forest model, belonging to the technical field of big data. The method comprises the following steps: thesample data is counted from the information of a history user according to a preset attribute, wherein, the preset attribute comprises a first class attribute and a second class attribute, and the classification tag of the history user is obtained; a random forest model is trained using the sample data and the classification label, wherein in the training process, the first class of attributes corresponds to a first sampling probability, the second class of attributes corresponds to a second sampling probability, and the first sampling probability is greater than the second sampling probability; According to the preset attribute, the target data is counted from the information of the user to be identified, and the target data is processed through the trained random forest model to determine whether the user to be identified is an abnormal user. The present disclosure may reduce the amount of sample data required for the abnormal user identification method and improve the accuracy of identification.
Owner:CHINA PING AN LIFE INSURANCE CO LTD

Differential evolution random forecast classifier-based photovoltaic array fault diagnosis method

The invention relates to a differential evolution random forecast classifier-based photovoltaic array fault diagnosis method. The method comprises the steps of firstly, collecting photovoltaic array voltages under various working conditions and currents of photovoltaic strings, and performing identification on various working conditions by different identifiers; secondly, determining a quantity range of decision trees in a random forest model by adopting an out-of-bag data-based classification misjudgment rate mean value; thirdly, performing global optimization on the quantity range of the decision trees by utilizing a differential evolution algorithm to obtain an optimal decision tree quantity value; fourthly, substituting the calculated optimal decision tree quantity value into a randomforecast classifier, and training samples to obtain a random forecast fault diagnosis training model; and finally, performing fault detection and classification on a photovoltaic array by utilizing the training model. According to the method, the model training speed can be greatly increased while the optimal model classification accuracy is ensured, so that the fault detection and classificationof the photovoltaic power generation array are realized more quickly and accurately.
Owner:FUZHOU UNIV

Pedestrian recognition system and pedestrian recognition processing method based on random forest support vector machine

The invention relates to a pedestrian recognition system based on a random forest support vector machine. The pedestrian recognition system comprises a characteristic extraction module, a clustering module, a random forest creating module and a scoring model module. The invention also relates to a pedestrian recognition processing method based on the random forest support vector machine. A similarity ranking way is used for replacing the comparison of traditional similarity absolute values, a threshold value does not need to be delimited, and an obtained ranking result is convenient for users to judge; and since multiple characteristics are required for establishing a random forest model and samples can not be subjected to mutual classification perfection only from apparent characteristics, a K-means clustering algorithm is adopted to replace a phenomenon that a sample category is manually given, and potential relationships among samples can be mined. The method and the system exhibit robustness on pedestrian posture change and can eliminate interferences from other types of samples when the similarity is calculated, a ranking result of RankSVM (Support Vector Machine) is in the top, and recognition accuracy can be improved when the similarity is calculated. Compared with traditional algorithms including MCC, RankSVM and the like listed in the prior art, the pedestrian recognition system is high in recognition accuracy.
Owner:GUILIN UNIV OF ELECTRONIC TECH

A power industrial control attack classification method and system based on machine learning

The invention provides a power industrial control attack classification method and system based on machine learning. The method and the system are characterized by utilizing the historical message data of the electric power industrial control, after completing the default value of the data, extracting the characteristic variable, inputting the stochastic forest model for multi-fold cross-validation, and adjusting the model parameters according to whether the stochastic forest model has occurred fitting and/or under-fitting phenomenon to determine the optimal stochastic forest model to classifythe electric power industrial control attacks. Compared with that prior art, by collecting the history message data of electric power industry control for machine learning, the random forest model isbuilt, and the messages generated by the electric power industrial control system are imported into the random forest model to realize the classification of the electric power industrial control attacks, thereby improving the status quo of the passive defense of the industrial control system, enabling the system to detect and intercept the attacks before being attacked, and improving the safety performance of the electric power industrial control system.
Owner:CHINA ELECTRIC POWER RES INST +3

Chinese brain language area distribution graph construction method

The invention relates to the field of medical image processing and application, in particular to a Chinese brain language area distribution graph construction method. The invention provides a new group-based reliable Chinese language distribution graph construction method for a current situation of locating of a Chinese brain language area not universally accepted yet in the prior art. According to the method, a Chinese brain language area distribution graph is constructed based on intraoperative cortical electrical stimulation and can be applied to the neurosurgery to accurately locate the Chinese brain language area. The construction method mainly comprises the steps of constructing a two-dimensional probability graph, a three-dimensional surface distribution graph, and a random effect model-based statistic parameter graph; and displaying the distribution situation of the language function area in the brain from two-dimensional and three-dimensional perspectives. According to the method, the deviation caused by operation of locating the Chinese language function area only by MRI can be effectively avoided; the Chinese language area can be accurately located through intraoperative language locating; and better reference and guidance are provided for clinical surgeries.
Owner:AFFILIATED HUSN HOSPITAL OF FUDAN UNIV

Intranet transaction identification method based on random forest and naive Bayes model

The invention discloses an intranet transaction identification method based on a random forest and a naive Bayes model. The method comprises the following steps: acquiring an internal screen transaction sample data set under different time window periods, screening and constructing a characteristic index set by adopting a random forest model, constructing a Bayesian identification model of the internal screen transaction according to the screened characteristic index set, and performing internal screen transaction identification by adopting the Bayesian identification model to obtain a resultof whether the internal screen transaction exists or not; and after the event, supervising and verifying whether the internal transaction recognition result is correct, and training and updating the Bayesian recognition model according to the recognition result. According to the invention, the stock intranet transaction identification model is established, so that whether the test target is subjected to intranet transaction or not is accurately identified; a quasi-Newton method and a genetic algorithm are combined, so that parameters of the random forest model are quickly optimized to an optimal solution with high precision, and the solution of the optimal solution has small dependence on an initial value; the method is easy to implement and stable in performance, and robustness and accuracy can be further improved along with increase of sample data.
Owner:CHINA THREE GORGES UNIV

Method and system for optimizing random forest models

The invention is applicable to the technical field of data processing, and provides a method and a system for optimizing random forest models. The method includes creating heat distribution histograms of the random forest models and distribution histograms of decision trees, with different prediction accuracies, in the random forest models; computing similarity degrees among the decision trees by the aid of proportions of identical attribute nodes among the decision trees according to the heat distribution histograms and the distribution histograms of the decision trees, with the different prediction accuracies, in the random forest models; deleting the decision trees with the minimum prediction accuracies according to the distribution histograms of the decision trees, with the different prediction accuracies, in the random forest models, and/or deleting the decision trees with the highest similarity degrees among the decision trees in the random forest models according to the computed similarity degrees among the decision trees. The method and the system have the advantages that the random forest models optimized by the aid of the method and the system are small in scale and high in prediction accuracy and prediction speed, the prediction efficiency of the random forest models can be effectively improved, and the like.
Owner:SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI

Chromatographic overlapping peak analytical method based on wavelet transform and random forest model

The invention discloses a chromatographic overlapping peak analytical method based on wavelet transform and a random forest model. The chromatographic overlapping peak analytical method comprises thesteps that a plurality of chromatographic overlapping peak signals are generated in a simulation mode according to different parameters; for each of the overlapping peak signals, gaus1 wavelets are used for performing the wavelet transform to simulate a first-order derivative; a simulated first-order derivative curve is used for calculating and obtaining four curve inflection points of the original chromatographic overlapping peak signals; the curve inflection points are divided into a training set and a test set according to a certain proportion; the horizontal and vertical coordinates of thefour inflection points are used as the input, a sub-peak area ratio is used as the output, and the optimal parameter of a model is determined by using a cross-validation method in the training set; the random forest model is constructed and trained in a supervised mode according to the optimal parameter; and the test set is used for verifying the effect of the model; and the same method is used for detecting the inflection points of the actual overlapping peak signals, and the trained model is used for performing fitting calculation on the own sub-peak area ratio. The chromatographic overlapping peak analytical method improves the accuracy of the analytical result, and has the advantages of high model convergence speed, simple parameter adjustment and high training efficiency.
Owner:SOUTHEAST UNIV

Polarized SAR image random forest classification method integrating multiple features

The invention discloses a polarized SAR image random forest classification method integrating multiple features. The method comprises the steps that an SLIC superpixel generation algorithm is utilizedto segment a to-be-classified polarized SAR image; feature information of the polarized SAR image is extracted, and a high-dimensional polarized feature map is constructed; a random forest model is trained based on the high-dimensional polarized feature map, and a polarized SAR image random forest model is constructed; the polarized SAR image random forest model is utilized to perform statisticalanalysis on the number of votes for each category of pixels, and a superpixel category probability graph based on a random forest is constructed with the superpixels being units; each superpixel category probability is iteratively corrected based on a PLR model, and a superpixel category probability graph after iterative correction is obtained; and each superpixel category is calculated with thesuperpixels being units, and a classification result is output. According to the method, the improved SLIC algorithm is utilized to generate accurate and fine superpixels as classification units, so that interference of speckle noise in the polarized SAR image is effectively lowered; and by use of neighborhood features of the superpixels, the interference of the speckle noise is further reduced, and the precision of the classification result is improved.
Owner:CCCC SECOND HIGHWAY CONSULTANTS CO LTD

Key information infrastructure asset identification method combined with mixed random forest

The invention discloses a key information infrastructure asset identification method combined with a mixed random forest, and belongs to the technical field of computers and information science. The method comprises the following steps of carrying out the structured processing on the collected facility asset data and carrying out the feature optimization expression to obtain an extended feature vector; in combination with a Delphi expert consultation method and a principal component analysis method, analyzing the key influence factors of the asset facilities, and extracting the key feature vectors; combining the plurality of random forest judgment models with a gating function to obtain a mixed random forest judgment model; and based on the constructed mixed random forest model, identifying whether the traffic is a key asset infrastructure or not. According to the key information infrastructure asset recognition method provided by the invention, the asset feature construction and the key factor extraction are realized by combining a machine learning method under big data, and the respective expert models are constructed by partitions, so that the recognition accuracy and efficiency are improved, and the generalization ability and expandability of the model are improved.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

User authentication password security evaluation method and device based on random forest model

The invention discloses a user authentication password security evaluation method and device based on a random forest model. The user authentication password security evaluation device comprises a prefix feature extraction module, a training set reading and processing module, a model training module and a password generation module. The user authentication password security evaluation method includes the steps: improving the Markov model, taking each character of the password in the password training set as a category, extracting a prefix feature of the character as a feature vector, and training by adopting a random forest to obtain a probability model of a multi-classification problem; and for any character string, obtaining probability distribution of suffix characters of the prefix through the probability model, and generating a candidate password, thereby realizing security evaluation on the user password. According to the user authentication password security evaluation method and device, the problem that an original Markov model is prone to over-fitting due to the model fitting principle can be solved, and the attack effect is better, and the algorithm effect is more stable,and the password security can be evaluated more accurately.
Owner:PEKING UNIV

Power transmission and transformation suspicious data screening method and device based on random forest model

The invention provides a power transmission and transformation suspicious data screening method and device based on a random forest model, and the method comprises the steps: S1, selecting data of multiple dimensions according to the category and periodicity rule of power transmission and transformation equipment, and building a data feature item; S2, distributing different weights for the data according to the sampling time, respectively marking known normal data and abnormal data as positive and negative samples, and dividing a data set into K parts; S3, training a random forest model by adopting a K-fold cross validation method, iteratively adjusting the number T of trees in the random forest by taking an average value of positive and negative sample accuracy as a target, and obtaininga value of T when an index is optimal; and S4, screening suspicious data by using the trained model. Power transmission and transformation equipment is used as an object to construct a suspicious datascreening object, a random forest model of an optimization training set is used for learning data rules from a large amount of historical sampling data, power transmission and transformation suspicious data identification and screening are achieved, the workload of manual screening is reduced, and the data quality of an electric power regulation and control system is improved.
Owner:NARI TECH CO LTD +4
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products