Constraint condition-based random forest recommendation algorithm

A random forest and constraint technology, applied in the field of improved random forest recommendation algorithm, can solve the problems of difficult prediction of continuous fields, increased overhead of classification algorithm, and limitation of algorithm memory size.

Inactive Publication Date: 2017-05-31
TIANJIN UNIV
View PDF0 Cites 36 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The decision tree algorithm has certain shortcomings. Due to the depth-first search, the algorithm is limited by the memory size and it is difficult to handle large training sets.
Various improved algorithms (discretization, sampling) to deal with large data sets or continuous quantities not only increase the overhead of classification algorithms, but also reduce the accuracy of classification. It is difficult to predict continuous fields. When there are too many categories, errors It may increase faster. For time-ordered data, a lot of preprocessing work is required.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Constraint condition-based random forest recommendation algorithm
  • Constraint condition-based random forest recommendation algorithm
  • Constraint condition-based random forest recommendation algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] Below in conjunction with accompanying drawing, the present invention is described in further detail:

[0033] Such as figure 1 As shown, it is a schematic diagram of the overall flow of a random forest recommendation algorithm based on constraints in the present invention, including:

[0034] Step 101: Generate a decision tree according to given training data. In this process, the CART algorithm is used to solve the problem of large branch scale and slow modeling of the decision tree generated by the ID3 algorithm and the C4.5 algorithm. A binary recursive segmentation technology is used; in the CART algorithm, the Gini index is used to construct a binary tree. decision tree. The decision tree generated by the CART algorithm is a binary tree with a concise structure. In the CART algorithm, it is mainly divided into two steps: (1) recursively divide the sample for tree building process; (2) use the verification data for pruning. The actual recursive division process...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a constraint condition-based random forest recommendation algorithm. The algorithm comprises the steps of generating a binary decision tree according to a data set in a given initial training set; randomly selecting features to form a random forest; measuring time efficiency of a decision tree algorithm; measuring efficiency of a random forest algorithm; and improving the random forest algorithm. Compared with other recommendation algorithms, the random forest recommendation algorithm has the advantages that the superiority of the random forest classification algorithm is fully understood, and the random forest classification algorithm is autonomously realized, so that the understanding of a recommendation system is deepened; and finally, key points causing algorithm deficiencies are researched from algorithm construction, and the algorithm is improved, thereby enabling the algorithm to have better efficiency.

Description

technical field [0001] The invention relates to the technical field of data mining and recommendation algorithms, in particular to an improved random forest recommendation algorithm. Background technique [0002] With the advancement of science and technology, people have gradually entered the Internet of Things era from the Internet information age. The rise of social networking and service industry websites has made the amount of information grow rapidly at an exponential rate. The amount of data on the Internet has long been different from what it used to be, and the sources of information are still increasing. For the challenge faced by the Internet - information overload, the recommendation system has set off an upsurge in the field of machine learning. The recommendation system has extremely high application value and has been widely used in many fields. For example, with the rapid development of e-commerce in recent years, online shopping has become the mainstream, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/241G06F18/214
Inventor 喻梅安永利于健高洁徐天一马雄
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products