Big data mining tool and method based on dragging process
A data mining and big data technology, which is applied in digital data processing, structured data retrieval, database management systems, etc., can solve the problems of high cost of big data mining applications, high requirements for professional knowledge, integration, etc., and reduce the cost of data mining. Threshold, optimize computing efficiency, and reduce the effect of using the threshold
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0050] see figure 2 , this implementation takes big data cleaning as an example to describe the specific application of the present invention in detail.
[0051] (1) Add start and finish operators. Data mining tools use start and finish operators as the start and end marks of a mining process, and guide the calculation of the entire process. If start and finish operators are not set, effective calculations cannot be performed.
[0052] (2) Add a data source. The data mining tool uses data source operators to represent the data to be mined. You can directly drag and drop the data source operators to the blank space in the process view to realize data import.
[0053] (3) Add a big data mining operator, drag the "Data Selection" operator plug-in to the blank space in the process view, and realize the data selection function loading.
[0054] (4) Connecting operators, connecting data sources and "data selection" operators to realize data transfer, the connection sequence from t...
Embodiment 2
[0059] see image 3 , this embodiment takes neural network regression prediction as an example to describe the big data submission method and the calculation engine selection method in detail.
[0060] (1) image 3 The function description of the process operators shown is that the "data splitting" operator implements training and test set division, the "neural network" operator implements model training, and the "attribute selection" operator implements test set delabelling; the "model application" operator implements The child will make regression predictions on the test set based on the training model;
[0061] (2) Submit all, at this time the calculation engine will calculate the output of all operators from "start" to "finish";
[0062] (3) Partial submission. At this time, the calculation engine will calculate the output of the specified operator, such as the "data selection" operator. By parsing the process XML file, the "data selection" operator depends on the path. ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com