Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

193 results about "Gradient boosting decision tree" patented technology

A Primer on Gradient Boosted Decision Trees. Gradient boosted decision trees are an effective off-the-shelf method for generating effective models for classification and regression tasks. Gradient boosting is a generic technique that can be applied to arbitrary 'underlying' weak learners - typically decision trees are used.

Enterprise industry classification method

ActiveCN107944480ASolve the tedious problem of manual classificationSolve classification problemsCharacter and pattern recognitionLearning basedCluster algorithm
The invention discloses an enterprise industry classification method. According to the method, main business keywords of enterprises are effectively extracted by utilizing semi-supervised learning-based image split clustering algorithm, the extracted keywords are used as features on the basis of a gradient enhancement decision-making tree, and a training cascade classifier is used for classifyingthe enterprises according to industries, so that the problem that artificial classification is tedious is solved. The method specifically comprises the following steps of: 1) extracting main businesskeywords of enterprises by utilizing a word vector and a semi-supervised image split clustering algorithm, getting rid of junk words and constructing a keyword library; and 2) inputting the extractedkeywords which are taken as features into a training cascade classifier, the enterprises are classified by each level of classifier, and the unclassified enterprises are classified according to the next level of classifier. According to the method, keywords can be automatically constructed, updated and classified, the problem of classifying millions and millions of enterprise industries is solved,and the problem of artificial labelling is effectively solved.
Owner:广州探迹科技有限公司

Hyperspectral image classification based on gradient lifting decision tree and semi-supervise algorithm integration

The invention discloses a hyperspectral image classification based on gradient lifting decision tree and semi-supervise algorithm integration in order to solve the technical problem that hyperspectral image classification based on active learning and semi-supervise learning is low in classification precision. Hyperspectral image classification includes the steps that firstly, hyperspectral image data is input; secondly, features of sample points are extracted; thirdly, parameters of a gradient lifting decision tree classifier are trained; fourthly, massed learning sample points are classified; fifthly, the confidence degree of the sample points are assessed; sixthly, the sample points are screened through sparse representation; seventhly, a marked training set is updated; eighthly, a classification result is output. Assessment is conducted on the confidence degree of the unmarked sample points through the prediction result of the classifier and sparse representation, according to the confidence degree of the unmarked sample points, the sample points are divided into two sets for different kinds of processing, burdens for manual marking are reduced while classification precision is improved, and hyperspectral image classification can be used in the fields of geological survey, atmospheric pollution and like.
Owner:XIDIAN UNIV

Risk scoring model construction method, device, storage medium and terminal

The present invention provides a risk scoring model construction method. The construction method includes the following steps that: a blacklist sample library and a whitelist sample library are constructed according to preset account data, wherein the blacklist sample library includes abnormal accounts, and the whitelist sample library includes normal accounts; cluster training is performed on theabnormal accounts in the blacklist sample library and the normal accounts in the whitelist sample library on the basis of a gradient boosting decision tree (GBDT) algorithm, and abnormal account classification features are screened out; the abnormal account classification features are trained on the basis of a random forest (RF) algorithm, and a contribution degree corresponding to each abnormalaccount classification feature is obtained; and a risk scoring model is constructed according to the abnormal account classification features and the contribution degrees corresponding to the abnormalaccount classification features, and the risk scoring model is used for identifying the abnormal accounts. The risk scoring model constructed by the method of the invention improves the timeliness ofclearing the abnormal accounts, reduces noise interferences caused by the abnormal accounts, and improves the calculation precision of many indicators of APPs.
Owner:CHINA PING AN LIFE INSURANCE CO LTD

Commodity purchase prediction modeling method

The invention discloses a commodity purchase prediction modeling method. The method comprises the steps that a purchase record marking training sample is used to predict whether to purchase or not; a sliding window commodity purchase sample is constructed; commodity purchase features are designed based on a time preference; a gradient improvement decision tree algorithm is used for training prediction; after the sample and the features are constructed, feature processing and selection need to be performed, and then the features are input into the gradient improvement decision tree algorithm for training prediction; and feature selection indicators include feature value distribution and relevancy, feature information gains, feature calling frequency, influences of feature knockout, etc. Ordering is performed on feature importance by integrating the indicators, and redundant features with low importance are eliminated. According to the method, a sliding window sample construction method and a feature system based on the time preference are proposed, the accuracy of a commodity purchase prediction model is effectively improved, and the method is used for realizing commodity personalized recommendation in a big data background to precisely recommend proper commodities to a user at a proper time and a proper place.
Owner:SOUTH CHINA UNIV OF TECH

Noise diagnosis algorithm for rolling bearing faults of rotary equipment

The invention discloses a noise diagnosis algorithm for rolling bearing faults of rotary equipment. Firstly, a sound pick-up device collects running noise signals of a rolling bearing, and the signalsare subjected to preliminary fault judgment through a bearing normality and anomaly pre-classification model based on an anomaly detection algorithm; secondly, according to a fault pre-judgment result, the abnormal signals (the faults occur) pass through a neural network filter to filter normal components in the signals of the bearing, the output net abnormal signals are connected to a subsequentfeature extraction module, and the normal signals (no faults occur) are directly connected to the feature extraction module; the feature extraction module extracts Mel-cepstrum coefficients (MFCC) ofthe signals to serve as eigenvectors, feature reconstruction is carried out by utilizing a gradient boosted decision tree (GBDT) to form composite eigenvectors, and principal component analysis (PCA)is used for carrying out dimensionality reduction on features; and finally, feature signals are input into an improved two-stage support vector machine (SVM) ensemble classifier for training and testing, and at last, high-accuracy fault type diagnosis is achieved. According to the algorithm, the bearing faults can be effectively detected and relatively high fault identification accuracy is kept;and the algorithm has relatively high effectiveness and robustness for detection and classification of the bearing faults.
Owner:CHINA UNIV OF MINING & TECH

Data evaluation method and device, terminal equipment and storage medium

The invention discloses a data evaluation method and device, terminal equipment and a storage medium. The method comprises the following steps of carrying out preprocessing on sample variables in a sample data set in order to obtain nominal variables which are ordered according to the sizes of characteristic values; carrying out one-hot coding on the nominal variables which are ordered according to the sizes of the characteristic values, and converting the nominal variables into digital variables; applying a gradient lifting decision-making tree algorithm to the sample data set containing thedigital variables; generating a decision tree model comprising n decision trees; and acquiring combined characteristics by adopting the gradient lifting decision-making tree algorithm. The accuracy ofprediction of the combined characteristics of sample data is improved, and the efficiency of acquisition of the combined characteristics is also improved, so that the combined characteristics are used as input characteristics of a binary logic regression model to carry out prediction of a preset event result, and thus the complexity and the uncertainty of manual searching of the characteristics are avoided, the prediction accuracy of the sample data for the preset event result is improved, and meanwhile, the accuracy and the efficiency of sample data evaluation are also improved.
Owner:CHINA PING AN LIFE INSURANCE CO LTD

Cloud-computing oriented network intrusion detect method and system

ActiveCN106899440AImprove dynamic update operation speedImprove real-time performanceData switching networksNetwork behaviorNetwork attack
The invention discloses a cloud-computing oriented network intrusion detect method. The method comprises the steps of generating a pseudo-gradient promotion decision-making tree set for judging network intrusion behaviors; determining weight information of pseudo-gradient promotion decision-making trees according to classification features of non-leaf nodes corresponding to the pseudo-gradient promotion decision-making trees; carrying out parallel judgment on received network behavior records through adoption of the pseudo-gradient promotion decision-making trees, thereby obtaining independent judgment results; and multiplying the independent judgment results by the corresponding weight information, thereby generating final result information for judging whether the network behavior records are network attacks or not. The method and the system are characterized in simplicity, legibility, high distinguishing precision and high smart comprehensive processing capability. The pseudo-gradient promotion decision-making tree can be generated in a distributed parallel mode under a cloud computing environment. The dynamic update operation speed of the decision-making threes is improved, and the timeliness and accuracy of detecting new-type intrusion events by an IDS are improved.
Owner:SUZHOU UNIV

Android malicious behavior dynamic detection method based on binary dynamic instrumentation

The invention relates to an Android malicious behavior dynamic detection method based on binary dynamic instrumentation, and belongs to the technical field of computer and information science. The method comprises the following steps: firstly, triggering all potential malicious behaviors of tested software through an Android dynamic detection framework; then, through a dynamic binary instrumentation technology, constructing a calling sequence of a program to a system API, using an N-Gram model to extract call timing relationship characteristics of a function; finally, inputting the generated time sequence relation characteristics into a trained GBDT (Gradient Boosting Decision Tree, Gradient Boosting Decision Tree) multi-classification algorithm detection model, identifying malicious software, and carrying out fine-grained classification on malicious behaviors of the software. According to the invention, a dynamic binary instrumentation technology is used.A system function calling timesequence feature of the software is extracted without knowing a program source code. Compared with the prior art, the Android malicious behavior detection method has high accuracy for Android malicious behavior detection, malicious behaviors of the software can be divided into six classes. More detailed detection conclusion granularity is achieved, and the detection efficiency of the Android malicious software is effectively improved.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Shopping mall building air conditioner cooling load prediction method based on GBDT, storage medium and equipment

PendingCN112001439AFlexible handlingSolve the problem of requiring a large amount of data trainingForecastingCharacter and pattern recognitionSimulationEngineering
The invention discloses a shopping mall building air conditioner cooling load prediction method based on GBDT, a storage medium and equipment, and the method comprises: collecting cooling load data, and carrying out the normalization processing to serve as the cooling load energy consumption prediction; establishing a load prediction model based on a gradient lifting decision tree algorithm; inputting the preprocessed data into a prediction model for training, selecting a grid search-cross validation mode, and optimizing the three hyper-parameters with the maximum influence on the performanceof the GBDT model; establishing a final cold load prediction model by completing parameter optimization of the prediction model, and obtaining a predicted cold load curve according to the parameters and the structure of the prediction model; and evaluating the prediction performance of the prediction model, adopting the prediction error for evaluation, enabling the deviation between the true valueand the prediction value to form the prediction error, and completing mall building air conditioner cooling load prediction. The method has good prediction precision, universality and applicability,and is especially suitable for large public buildings with periodically changing cold loads.
Owner:XI'AN UNIVERSITY OF ARCHITECTURE AND TECHNOLOGY

Dynamic electrocardiogram heart beat classification method based on gradient boosting decision tree

ActiveCN109303559AAccurate identificationAvoid the influence of abnormal heartbeatDiagnostic recording/measuringSensorsEcg signalSupraventricular Ectopic Beats
The invention relates to a dynamic electrocardiogram heart beat classification method based on a gradient boosting decision tree. The method comprises the steps that in actual dynamic electrocardiogram, classification is conducted on single heart beats in an electrocardiogram signal according to whether or not arrhythmia exists and the types of arrhythmia, specific classification categories comprise normal heart beats, supraventricular ectopic beat heart beats, ventricular ectopic beats, ventricular beat and normal beat fusion heart beats and pacemaker heart beats; the method comprises the following steps that 1, training data is obtained; 2, heart beat interception and feature extraction are conducted; 3, feature selection and classification model training are conducted; 4, classificationmodel application is conducted, wherein a tree-model-based feature selection method is adopted in step 3 to select features, and the classification model is trained through a gradient boosting decision tree classification method. The method is suitable for arrhythmia classification training of dynamic electrocardiogram and classification identification of different types of heart beats, and a doctor can be assisted in accurately reading and analyzing the electrocardiogram.
Owner:杭州质子科技有限公司

Short-term load prediction method based on GBDT (gradient boosting decision tree)

ActiveCN108539738AHigh precisionGeneralization error is controllableLoad forecast in ac networkData setOriginal data
An embodiment of the invention discloses a short-term load prediction method based on a GBDT (gradient boosting decision tree). The method comprises the following steps: acquiring historical load dataof N days before the prediction-waiting day, and forming an original data set A0; screening a data set B for constructing training samples from the original data set A0; constructing total sample set(X, Y) required for constructing a GBDT prediction model by the data set B; training and constructing a whole-day GBDT prediction model by the total sample set (X, Y), and prediction whole-day load vector of the prediction-waiting day according to the whole-day GBDT prediction model; segmenting the total sample set (X, Y) into 24 sample subsets by the hour, training and constructing hour GBDT prediction models respectively, and predicting 24-hour load vectors of the prediction-waiting day according to the hour GBDT prediction models; predicting the final load vector of the prediction-waitingday according to combination of the whole-day load vector and the 24-hour load vectors. The short-term load prediction precision is improved by sufficiently mining characteristics in the historical load data and constructing the different GBDT models.
Owner:国网山东省电力公司营销服务中心(计量中心) +2

Cigarette loose end rate prediction method and system based on improved gradient improvement decision tree

The invention discloses a cigarette loose end rate prediction method based on an improved gradient improvement decision tree, which comprises the following steps: acquiring process parameters of tobacco shreds in the same batch in each process of tobacco shred making and loose end rate data in a final package process, and forming an original data set by the process parameters and the loose end rate data; dividing the original data set based on the original data set in combination with a correlation coefficient analysis method to obtain a key process parameter set; performing normalization processing on the data in the key process parameter set, and performing random division on the normalized data to obtain a training data set and a test data set; on the basis of training samples in the training data set, constructing an improved gradient improvement decision tree model; and inputting the test data set sample into the improved gradient improvement decision tree model, and predicting the cigarette empty head rate. A connection is established between each process parameter of the tobacco shred making process and the index of the cigarette packet vacancy rate, so that more accurate prediction of the cigarette vacancy rate is realized.
Owner:HANGZHOU ANMAISHENG INTELLIGENT TECH CO LTD

Phosphoric acid production parameter control method based on gradient boosted decision tree

The present invention provides a method for software measurement of ground phosphate rock consumption and a phosphoric acid production parameter control method in a feed-grade calcium hydrophosphate production process. The phosphoric acid production parameter control method comprises the steps of: analyzing relevant factors for influencing consumption, based on the method theory of machine learning, transmitting and storing pulp flow real-time data and vitriol flow real-time data automatically collected by an internet-of-things collection device and pulp flow data manually collected by a lab to a cloud platform, allowing a python language-based analysis platform to be directly connected with a database to extract features based on time sequence data to perform analysis and modeling, and establishing a real-time soft measurement technology for the ground phosphate rock consumption to replace a ground phosphate rock physical measurement device with high investment and easy damaging. Theimplementation process of the method mainly comprises the steps of: pulp flow collection, vitriol flow, pulp storage tank intensity, mineral powder consumption historical data, data preprocessing, training of a gradient boosted decision tree (GBDT) regression model, and prediction of the mineral powder consumption to control phosphoric acid for generation of parameters through adoption of the trained GBDT regression model.
Owner:上海新增鼎数据科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products