Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

48 results about "Information gain ratio" patented technology

In decision tree learning, Information gain ratio is a ratio of information gain to the intrinsic information. It was proposed by Ross Quinlan, to reduce a bias towards multi-valued attributes by taking the number and size of branches into account when choosing an attribute.

Unbalanced data sampling method in improved C4.5 decision tree algorithm

The invention relates to an unbalanced data sampling method in an improved C4.5 decision tree algorithm. The method comprises the steps as follows: firstly, initial weights of various samples are determined according to the number of various samples; the weights of the samples are modified through the training result of the improved C4.5 decision tree algorithm in each round; the information gain ratio and misclassified sample weights are taken into account by a division standard of the improved C4.5 algorithm; the final weights of the samples are obtained after T iterations; the samples in minority class boundary regions and majority class center regions are found out according to the sample weights; over-sampling is carried out on the samples in the minority class boundary regions by an SMOTE algorithm; and under-sampling is carried out on majority class samples by a weight sampling method, so that the samples in the center regions are relatively easily selected to improve the balance degree of different classes of data, and the recognition rates of the minority class and the overall data set are improved. According to the unbalanced data sampling method in the improved C4.5 decision tree algorithm, weight modification is carried out through the improved C4.5 decision tree algorithm; and over-sampling and under-sampling are specifically carried out according to the sample weights, so that the phenomena of classifier over-fitting, loss of useful information of the majority class and the like are effectively avoided.
Owner:CHONGQING UNIV OF POSTS & TELECOMM

Detection method for automatic identification of syndrome types in traditional Chinese medicine

InactiveCN102298663AReduce redundancyDecision-making attributes are objectiveSpecial data processing applicationsNODALInformation gain ratio
The invention discloses a detection method for automatically identifying syndrome types of traditional Chinese medicine, which comprises the following steps: establishing a standard and objective database of TCM cases; aiming at the standardized TCM sample database, using an attribute screening method based on association relations to calculate the mutual relationship between each attribute Information and symmetric uncertainty, based on heuristic rules, select the symptom attribute set that contributes more to syndrome detection; use the selected key attribute set and sample information in the case database to construct a classification training sample set, and calculate the attribute information gain rate, determine the decision-making attributes, control the sample lower limit of each node and record the classification error at the same time, read all training samples and quasi-training samples in the way of incremental learning, and finally obtain the classification rules; use the obtained classification rules to carry out new Sample identification test. The invention can not only be applied to the problem of automatic syndrome differentiation of liver cirrhosis, but also can be extended to the field of automatic discrimination of other syndrome types in traditional Chinese medicine.
Owner:SHANGHAI UNIV OF T C M +1

Cyclotella identification method and apparatus

The embodiment of the invention discloses a cyclotella identification method and apparatus. The method comprises the steps of performing slice processing on a high-magnification microscopic image of an original water sample containing a to-be-tested alga to obtain a square identification image with a cyclotella shape as an internally tangent circle; performing polar coordinate transformation on the identification image, and extracting multiple identification characteristics of the image after the polar coordinate transformation; and inputting the extracted multiple identification characteristics to a decision-making tree model constructed in advance, and determining a category of the to-be-tested alga according to a decision-making tree model output result, wherein the decision-making treemodel performs classification using a C4.5 decision tree algorithm according to characteristics of a template cyclotella image and an information gain ratio of each training sample characteristics ina training sample image set, and performs identification on the to-be-tested alga according to a classification result. According to the technical scheme of the invention, that incision cannot be formed on an alga image under a complex background according to the prior art for identification is effectively solved, and the identification accuracy rate and efficiency on the alga are improved.
Owner:GUANGDONG UNIV OF TECH +1

Distributed denial of service attack detection method based on C4.5 decision tree algorithm

The invention discloses a distributed denial of service attack detection method based on a C4.5 decision tree algorithm in software defined network environment, and the method comprises the followingsteps: collecting flow table information returned back by an OpenFlow switch through an OpenFlow protocol; extracting field information related to a DDoS attack from the flow table information, converting the extracted information into parameters capable of analyzing network flow distribution variation and taking the parameters as attributes, and forming a training set of a decision tree; classifying flows with the C4.5 decision tree algorithm, calculating class information entropy according to training set data classes; orderly calculating conditional entropy of the attributes, gain of information, information entropy of the attributes and information gain ratio of the attributes; selecting the attribute with the highest information gain ratio as a root node of the decision tree, and selecting the attributes with highest information gain ratio from the residual attributes as a fork node, and repeating the steps above until forming the decision tree; and using the finally formed decision tree to perform classification operation for the new network flow, and detecting whether the DDoS attack exists. The method can detect the DDoS attack more accurately.
Owner:NANJING UNIV OF POSTS & TELECOMM

Rock burst danger level prediction method based on local weighting C4.5 algorithm

ActiveCN108280289AEasy to handleOvercome the disadvantage of biased selection of attributes with more valuesDesign optimisation/simulationSpecial data processing applicationsNODALInformation gain ratio
The invention provides a rock bust danger level prediction method based on a local weighting C4.5 algorithm and relates to the technical field of rock burst prediction. The method includes the steps of firstly, adopting an MDLP method for conducting discretization on continuous attribute data in sample data, then adopting a local weighting method for selecting a training set and calculating the weight of samples, utilizing the weight of the samples to calculate an information gain ratio of each attribute, and selecting sample attributes as root nodes of a C4.5 decision tree and splitting attributes of other branch nodes according to the information gain ratios; finally, adopting the weight of the samples to substitute the sample number to conduct pessimistic pruning on the created decisiontree, and correspondingly achieving prediction of rock burst dangers and the like in a predicted area. According to the provided rock bust danger level prediction method based on the local weightingC4.5 algorithm, the defect is overcome that the preference selection values have too many attributes when information gain is adopted for selecting node splitting attributes in an ID3 algorithm; an over-fitting problem is avoided, and the prediction accuracy of a model is high.
Owner:LIAONING TECHNICAL UNIVERSITY

Hidden Markov chain model based intelligent recommendation algorithm

The invention discloses a hidden Markov chain model based intelligent recommendation algorithm, which is applied to an intelligent recommendation system of a law net. The algorithm comprises the steps of for characteristics of documents, replacing an initial probability and a state transfer probability of a state in a hidden Markov chain model with an information gain ratio and a correlation degree respectively; calculating a partial probability of reading a <t>th document by a client; calculating an optimal probability and an optimal document sequence of each document when a document number t is equal to n, and selecting the optimal probability with the maximum value from all the optimal probabilities; and recording the optimal document sequences, namely, all the document sequences recommended to the client. According to the implementation scheme provided by the invention, the advantages of the information gain ratio and a hidden Markov chain are combined, and the shortcoming of high recommendation deviation quantity due to single use of a hidden Markov chain algorithm as a recommendation algorithm and the shortcoming of complexity in parameter calculation are made up for, so that the hidden Markov chain model based intelligent recommendation algorithm has relatively high practicality and relatively high accuracy in recommendation algorithms.
Owner:GUANGZHOU WANGLV INTERNET TECH CO LTD

Electric power business collaborative classification method and system based on C4.5 decision tree algorithm

The invention discloses an electric power business collaborative classification method and system based on a C4.5 decision tree algorithm, and the method comprises the following steps: obtaining an electric power business collaborative related database, and extracting a sample set S from the electric power business collaborative related database; extracting an index set A, wherein the index set Acontains indexes for evaluating the business collaboration data; calculating the information entropy and the information gain ratio of each index for the sample set S based on a C4.5 algorithm so as to select a proper root node and a proper intermediate node; constructing a decision tree according to the selected root node; evaluating and selecting each service cooperation scheme based on the decision tree. The invention further discloses a corresponding system. According to the invention, the information entropy and the information gain ratio are used for calculation, and classification rulesare easy to understand and high in accuracy. The method is applied to collaborative data calculation and analysis of businesses such as power outsourcing, optimal division features are selected as nodes to generate a decision-making tree and perform data classification, classification is quick and good in effect, and collaborative management of the businesses such as power outsourcing is effectively achieved.
Owner:STATE GRID ZHEJIANG ELECTRIC POWER CO LTD HANGZHOU POWER SUPPLY CO +1

Optimizing method and system of mobile service resources of operator

The invention discloses an optimizing method and system of mobile service resources of an operator. The method comprises the steps that historical dialing data of a client of the operator are counted, wherein the dialing data are continuous variables; the continuous variables are converted into discrete characteristic variables through chi-square analysis; whether the client opens the mobile service or not is used as a classified variable of the two value, and a C4.5 decision-making tree model between the characteristic variables and the classified variable is established, wherein in the decision-making tree model, the information gain ratio corresponding to each kind of cutting is calculated and the cutting threshold with the maximum information gain ratio is selected as the optimal cutting threshold of the attribute; according to the decision-making tree model, values of the classified variable are calculated so as to obtain the prediction result of whether the client opens the mobile service or not; according to the prediction result, optimizing processing operation is conducted on the mobile service resources of the operator according to the prediction result. Through the technical scheme, the requirements for the mobile service of the client can be obtained efficiently according to dialing behaviors of the current client so as to realize optimized deployment of the mobile service resources of the operator.
Owner:CHINA TELECOM CORP LTD

Customer risk feature screening method based on SVM-RFE and application thereof

ActiveCN112182331AOvercoming the disadvantages of screening customer risk characteristics with a single indexFinanceWeb data indexingInformation gain ratioSvm classifier
The invention discloses a customer risk feature screening method based on SVMRFE and application thereof. The method comprises the following steps: acquiring customer risk feature data comprising a plurality of customer features; for the customer risk feature Xj, calculating an importance comprehensive measurement index of the customer risk feature Xj based on the Gini coefficient, the informationgain, the information gain ratio, the mutual information and the feature weight of the optimal classification result of the SVM classifier; after importance comprehensive measurement indexes of all customer risk characteristics are calculated in sequence, sorting is carried out in sequence from large to small according to the indexes, and the first k characteristics are selected to form a customer risk characteristic set. According to the method, the correlation between the selected feature subset and the target variable and the redundancy of the feature subset are considered, the defect thatthe customer risk features are screened by using a single index is overcome, and the training result of the SVM classifier is introduced in the feature screening process, so that the screened customer risk features are more suitable for the characteristics of the SVM classifier, and application prospects are good.
Owner:SHANGHAI UNIV OF ENG SCI +1

Cloud environment data storage optimization method based on HDFS

The invention relates to the technical field of data storage, and discloses a cloud environment data storage optimization method based on an HDFS, which comprises the following steps: putting metadataand storage data of each node of an HDFS cluster into a pre-created data volume container; calculatingavailable storage space evaluation values of the HDFS cluster nodes in the physical machines andthe data volume containers respectively; calculating an availability value of each physical machine, and calculating a performance evaluation value of the HDFS cluster node based on the physical volume container according to the availability value of each physical machine; storing the data blocks by using a data storage copy placement algorithm; calculating a data attribute information gain ratioof the to-be-stored data; dividing to-be-stored data blocks in the HDFS cluster by utilizing a KADC-KNN algorithm based on an information gain ratio and weighting; and for the to-be-stored data blocksdivided by the KADC-KNN algorithm based on the information gain ratio and the weighting, storing the to-be-stored data blocks into a Feeder HDFS cluster according to different storage strategies. According to the invention, the optimization of data storage is realized.
Owner:汪礼君

A mobile phone feature detection optimization improvement algorithm based on a C4.5 decision tree

The invention discloses a mobile phone feature detection optimization improvement algorithm based on a C4.5 decision tree. The method is based on a C4.5 algorithm, and in the process of calculating the information gain and the information gain ratio for each factor, a weight parameter is given aiming at the division of different factors, so that the information gain degree data difference of different factors is greater than that of the prior art, the recognition of machine learning on the use characteristic difference of different users is improved, and the credibility of a result is improved. According to the method and the device, the use characteristics of the mobile phone user are analyzed and recorded, and the characteristics of the current mobile phone user are matched with the characteristics of the mobile phone user, so that whether the mobile phone user is legal or not is judged, and the mobile phone cannot be illegally used by a third-party user. The invention aims to improve the mobile phone feature detection accuracy by improving and optimizing a C4.5 decision tree machine learning algorithm and more accurately identify illegal use operations of the mobile phone.
Owner:ELECTRIC POWER RES INST OF STATE GRID ZHEJIANG ELECTRIC POWER COMAPNY +1

Method for determining the optimal segmentation scale of satellite image segmentation based on information gain ratio

The invention discloses a method for determining the optimal segmentation scale of satellite image segmentation based on information gain rate, includes obtaining high-resolution satellite remote sensing image, preprocessing, selecting representative region as sample region determined by optimal segmentation scale, classifying surface features of sample region, obtaining image classification result of sample region as reference image determined by optimal segmentation scale, and obtaining image classification result of sample region as reference image determined by optimal segmentation scale;setting a series of segmentation scale parameters from small to large, using multi-scale segmentation technology, the sample image is segmented in multi-scale to obtain a series of segmented object vectors; Based on Shannon information entropy formula, the information entropy of reference image and segmentation vector is calculated, and the conditional entropy of reference image with segmentationvector is calculated, then the information gain and information gain ratio of segmentation vector are calculated. The optimal segmentation scale is selected according to the principle of maximum information gain rate. Finally, based on the optimal segmentation scale, the original image is segmented at multiple scales.
Owner:ZHEJIANG UNIV

Information providing method and device

The invention provides an information providing method and device. The method comprises the following steps: targeting to a first set, judging whether the number of each data item in the first set isgreater than a first threshold value or not; if so, calculating an information gain ratio of each selected filling parameter in the selected filling parameter list, selecting an inquiry template corresponding to the selected filling parameter with the maximum information gain ratio to inquire the user; selecting and filling parameter information; deleting the data items which are not matched withthe filling parameter information in the first set; obtaining a new first set, and when the data items in the new first set are not greater than the first threshold value, deleting the selected filling parameter with the maximum information gain ratio in the selected filling parameter list to obtain a new selected filling parameter list, re-judging whether the number of the data items in the new first set is greater than the first threshold value or not until the number of the data items is not greater than the first threshold value, and providing the data items in the new first set for the user. According to the method provided by the invention, the filling parameter with the maximum information gain ratio is selected each time to inquire the user, so that the frequency of interaction with the user is reduced, and the service efficiency is improved.
Owner:沈阳民航东北凯亚有限公司

Distributed Denial of Service Attack Detection Method Based on C4.5 Decision Tree Algorithm

The invention discloses a distributed denial of service attack detection method based on a C4.5 decision tree algorithm in software defined network environment, and the method comprises the followingsteps: collecting flow table information returned back by an OpenFlow switch through an OpenFlow protocol; extracting field information related to a DDoS attack from the flow table information, converting the extracted information into parameters capable of analyzing network flow distribution variation and taking the parameters as attributes, and forming a training set of a decision tree; classifying flows with the C4.5 decision tree algorithm, calculating class information entropy according to training set data classes; orderly calculating conditional entropy of the attributes, gain of information, information entropy of the attributes and information gain ratio of the attributes; selecting the attribute with the highest information gain ratio as a root node of the decision tree, and selecting the attributes with highest information gain ratio from the residual attributes as a fork node, and repeating the steps above until forming the decision tree; and using the finally formed decision tree to perform classification operation for the new network flow, and detecting whether the DDoS attack exists. The method can detect the DDoS attack more accurately.
Owner:NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products