Data mining method and system

A data mining and data technology, applied in the network field, can solve problems such as low data mining efficiency and inability to guarantee the correctness of data mining results, and achieve the effects of saving data mining costs, improving data mining efficiency, and ensuring correctness

Inactive Publication Date: 2012-06-27
亿赞普(北京)科技有限公司
View PDF3 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem to be solved in this application is to provide a data mining method and system to solve the problem tha...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data mining method and system
  • Data mining method and system
  • Data mining method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0026] refer to figure 2 , shows a flowchart of steps of a data mining method according to Embodiment 1 of the present application.

[0027] The data mining method of the present embodiment comprises the following steps:

[0028] Step S102: Obtain multiple program scripts corresponding to different data mining sub-processes.

[0029] Wherein, a data mining sub-process includes at least one program script, and each program script belongs to a data mining sub-process. That is, there may be one program script to implement a data mining sub-process; there may also be multiple program scripts combined to implement a data mining sub-process; there may also be multiple program scripts, some of which can be combined in different ways Realize a data mining sub-process, if there are five scripts A, B, C, D, E, among them, ABC combination can realize the data preprocessing sub-process, ADC can also realize the data pre-processing sub-process, AEC is the same, the difference The diffe...

Embodiment 2

[0039] refer to image 3 , shows a flowchart of steps of a data mining method according to Embodiment 2 of the present application.

[0040] The data mining method of the present embodiment comprises the following steps:

[0041] Step S202: According to the data mining sub-process to which each program script belongs, set the prefix character of the script name for multiple program scripts in the system as the script identifier of the program script.

[0042] Wherein, there may be multiple program scripts executing the same data mining sub-process, and the same script name prefix characters are set for these program scripts. For example, program scripts a1, a2, and a3 all execute the data preprocessing sub-process, then the same script name prefix character "P_" can be set for these three scripts, and they are named "P_a1", "P_a2" and "P_a3" respectively. A data mining sub-process corresponds to multiple different program scripts, and different data mining models can be comp...

Embodiment 3

[0054] refer to Figure 4 , shows a schematic flowchart of a data mining method according to Embodiment 3 of the present application.

[0055] Such as Figure 4 As shown, this embodiment provides an automation framework for integrating and sorting program scripts. The automation framework is divided into three main parts, namely, data selection and preprocessing and transformation part, model calculation part and data analysis part, each part has its own execution module. The basis for the division of the three parts can be the coupling between the data mining sub-processes, for example, the coupling between the three sub-processes of data selection, data preprocessing and data conversion is relatively large, so they are integrated together. However, those skilled in the art should understand that in practical applications, those skilled in the art may perform appropriate integration and division according to actual needs, or may not perform integration and division. One dat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a data mining method and a system. The data mining method comprises: obtaining a plurality of program scripts corresponding to various data mining sub-procedures, wherein one data mining sub-procedure comprises at least one program script, each program script is provided with one corresponding scrip mark, and the script mark is used for indicating the data mining sub-procedure corresponding to the program script, wherein the program script belongs to the data mining sub-procedure; obtaining a plurality of script marks corresponding to the plurality of program scripts; determining calling sequence of the plurality of program scripts corresponding to the plurality of script marks according to the execution sequence of various data mining sub-procedures; calling the plurality of program scripts according to the determined calling sequence and performing data mining. According to the invention, the correctness of a data mining result is guaranteed, the data mining efficiency is improved, and the data mining cost is saved.

Description

technical field [0001] This application relates to the field of network technology, in particular to a data mining method and system. Background technique [0002] With the rapid development of information technology, the amount of data accumulated by people has increased rapidly, and how to extract useful knowledge from massive data has become a top priority. Data mining is a data processing technology developed to meet this need. [0003] Data mining refers to the process of extracting hidden, unknown but potentially useful information and knowledge from a large number of incomplete, noisy, fuzzy, random data. In the Internet, the process of data mining such as figure 1 As shown, it includes: data selection sub-process, data preprocessing sub-process, data conversion sub-process, model calculation sub-process, data analysis and same sub-process. At present, each sub-process in this process is implemented independently, and they are generally processed separately without...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F9/48
Inventor 罗峰黄苏支李娜
Owner 亿赞普(北京)科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products