Component classifying method based on net establishing software of decision tree

A network-structured software and classification method technology, which is applied in text database clustering/classification, computer components, unstructured text data retrieval, etc., and can solve problems such as difficult to identify true and false information, excessive information, inconsistent information forms, etc.

Inactive Publication Date: 2015-04-22
WENZHOU UNIVERSITY
View PDF4 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] A large amount of unstructured information is scattered throughout the Internet, which brings convenience to people but also brings many problems: too much information, di

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Component classifying method based on net establishing software of decision tree
  • Component classifying method based on net establishing software of decision tree
  • Component classifying method based on net establishing software of decision tree

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] Take all known components of the entire component library as a training set, and use a decision tree to determine the recommended classification of newly added components (use yes / no to mark whether it is a component recommended by the system). The main attributes of the components in the Internet Architecture software component library are as follows: figure 1 shown. It can be seen that not all attributes are useful for building a decision tree, so we selected four attributes: ComType (component type), ValidTime (valid time), EntityType (entity type) and RepCount (number of replicas) to describe components, The purpose is to find out the relationship between these 4 attributes and the degree of recommendation (recommended / not recommended).

[0037]Decision tree technology is the main technology for classification and prediction, and decision tree learning is an example-based inductive learning algorithm. It looks at inferring classification rules in the form of a dec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a component classifying method based on net establishing software of a decision tree. A new component is added into a component bank of the net establishing software. A decision tree technology in data mining is used in classifying of the newly-added component. An ID3 algorithm based on information gain is used for carrying out analysis on the recommending degree of the newly-added component, the decision tree is established, and component classifying is completed. The decision tree technology is used in component classifying in the net establishing software, information gain is used for measuring component attribute values which are used as information amount provided for whole classifying, a classifying rule is visual, understanding and achieving are easy, and classifying efficiency is high.

Description

technical field [0001] The invention relates to a method for classifying components in a component library of network-structured software, in particular to a method for classifying components in network-structured software based on a decision tree. Background technique [0002] A large amount of unstructured information is scattered throughout the Internet, which brings convenience to people but also brings many problems: too much information, difficult to digest; difficult to identify true and false information; difficult to guarantee information security; inconsistent information forms, difficult to Unified processing. The same confusion exists in the component library of Internet-based software built on the Internet, so there should be an intermediate link between the component provider and the component consumer. Through this link, some preprocessing of component information is required, so that Component consumers can get the components they want quickly and convenient...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/35G06F18/24323
Inventor 相徐斌叶修梓洪振杰张三元
Owner WENZHOU UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products