Method and device for generating decision tree in data mining system

A technology of data mining and decision tree, applied in electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of low computing efficiency and occupying a lot of system resources.

Inactive Publication Date: 2011-05-11
CHINA MOBILE COMM GRP CO LTD
View PDF0 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0026] In view of this, the embodiments of the present invention provide a method and device for generating a decision tree in a data mining system, which ar

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for generating decision tree in data mining system
  • Method and device for generating decision tree in data mining system
  • Method and device for generating decision tree in data mining system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] In order to provide an implementation plan for improving data classification efficiency and improving system performance, the embodiment of the present invention provides a method and device for generating a decision tree in a data mining system. The preferred embodiments of the present invention will be described below in conjunction with the accompanying drawings. It should be understood that the preferred embodiments described here are only used to illustrate and explain the present invention, not to limit the present invention. And in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other.

[0052] According to an embodiment of the present invention, a method for generating a decision tree in a data mining system is firstly provided, such as Figure 4 shown, including:

[0053] Step 401, traversing the set data set, and determining the unclassified data set corresponding to each candid...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses to a method and a method for generating a decision tree in a data mining system. The main technical scheme comprises steps as follows: A, a data set is traversed and set, so as to determine the unclassified data set corresponding to every candidate unit on the current layer of the decision tree; B, the attribute value of the data attribute corresponding to every candidate unit is determined according to the determined unclassified data attribute; C, the decision tree branch of every unit is generated according to the determined attributed value; and D, whether the data in the unclassified data set corresponding to the lower unit of every decision tree branch is identical in the attribute value with the pre-determined set data attribute or not is judged respectively, the units with a negative judgment are determined to be candidate units of the next layer of the current layer, the next layer serves as the current layer, and the step A is carried out again, and units with a positive judgment are determined as the last unit of the branch respectively. By adopting the technical scheme, the times for traversing the data set is reduced, as a result, the computational efficiency is improved, and the occupation of system resource is lessened.

Description

technical field [0001] The invention relates to the technical field of data mining, in particular to a method and device for generating a decision tree in a data mining system. Background technique [0002] Data mining, also known as knowledge discovery in databases, refers to the extraction of implicit, unknown, non-trivial and potential application value information or patterns from a large number of incomplete, noisy, and fuzzy data. Theories and technologies in database, artificial intelligence, machine learning, statistics and other fields. Data mining tools can predict future trends and behaviors, which can well support people's decision-making. [0003] An important function of data mining is data classification. Data classification refers to mapping data to pre-defined groups or classes. At present, the commonly used classification method is the method based on decision tree. Decision tree is used for classification. The generation rules are easy to understand and ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 邓超徐萌高丹罗治国周文辉孙少陵肖建明段云峰
Owner CHINA MOBILE COMM GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products