Unlock instant, AI-driven research and patent intelligence for your innovation.

Recommendation method of data analysis method in data mining

An analysis method and data analysis technology, applied in the field of data analysis, can solve problems such as laborious and time-consuming

Active Publication Date: 2019-12-27
FUDAN UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Moreover, in actual data mining scenarios, it is often time-consuming to find the optimal model and algorithm through multiple iterations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Recommendation method of data analysis method in data mining
  • Recommendation method of data analysis method in data mining
  • Recommendation method of data analysis method in data mining

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] In the data mining problem, select the data set that needs to be analyzed, and make recommendations according to the method described above, and a list of recommended methods will be returned. Select methods at different stages in the list and combine them to build a set of analysis processes from [feature engineering] to [data preprocessing] to [model], [parameter adjustment] to [model fusion]. This set of processes is constructed using a commonly used machine learning framework, and then the data is input into the process to initially complete the data analysis task.

[0043] Now give an example. Make an analysis method recommendation for the game Titanic: Machine Learning from Disaster under kaggle (predict the survivors of the Titanic based on the personal information of the passengers and the training data given).

[0044] First, the nearest neighbor algorithm based on the data set is used to predict the score of each analysis method under the competition:

[004...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of data analysis, and particularly relates to a recommendation method of a data analysis method in data mining. The recommendation method of a data analysis method mainly comprises four parts: (1) nearest neighbor recommendation based on a data set; (2) collaborative filtering based on an analysis method; (3) nerve collaborative filtering; and (4) fusion of recommendation results. According to the method, based on the interaction history of data analysis, the latent semantics of the data set and the analysis method are mined as a recommendation basis, and finally the analysis method suitable for the data set is returned. According to the method, a user can be helped to quickly find a proper analysis method, and information in a data set is mined.

Description

technical field [0001] The invention belongs to the technical field of data analysis, and in particular relates to a method for recommending data analysis methods in data mining. Background technique [0002] With the continuous development of data science, the problems solved by data mining are becoming more and more complex, and the corresponding technologies are emerging in an endless stream. Choosing among many algorithms and models for data analysis has become a thorny issue. In addition, for data of different types and distributions, whether the selected model is appropriate and whether the data analysis process is reasonable will have a decisive impact on the results of data mining. Moreover, in actual data mining scenarios, it is often labor-intensive and time-consuming to find the optimal model and algorithm through multiple iterations. [0003] With the improvement of computing power and the development of technologies such as databases and cloud services, the nu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/2458
CPCG06F16/2465Y02D10/00
Inventor 孙振远荆一楠何震瀛王晓阳
Owner FUDAN UNIV