Method and system for automatically generating machine learning sample features

A machine learning and automatic generation technology, applied in the field of artificial intelligence, can solve problems such as inability to conveniently generate machine learning samples, and achieve the effects of lowering the threshold of use, improving ease of use, and improving efficiency

Pending Publication Date: 2018-05-29
THE FOURTH PARADIGM BEIJING TECH CO LTD
View PDF4 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] Exemplary embodiments of the present invention provide a method and system for automatically generating features of machine learning samples to solve the problem in the prior art that features of machine learning samples cannot be easily generated

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for automatically generating machine learning sample features
  • Method and system for automatically generating machine learning sample features
  • Method and system for automatically generating machine learning sample features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like numerals refer to like parts throughout. The embodiments are described below in order to explain the present invention by referring to the figures.

[0047] Here, machine learning is an inevitable product of the development of artificial intelligence research to a certain stage. It is dedicated to improving the performance of the system itself by means of calculation and using experience. In computer systems, "experience" usually exists in the form of "data". Through machine learning algorithms, "models" can be generated from data. Model, when faced with a new situation, the model will provide the corresponding judgment, that is, predict the result. Whether training a machine learning model or making predictions using a trained machine learning model, the data needs to be converted into machine learning samples inclu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and system for automatically generating features of a machine learning sample. The method comprises the steps that (A) a user-specified data table is acquired, whereinone row of the data table corresponds to a data record, and one column of the data table corresponds to a field; (B) the feature type corresponding to each non-target value field in the data table isdeclared, wherein the feature type comprises a discrete feature and / or a continuous feature; (C) each non-target value field is processed into a unit feature according to the declared feature type; (D) feature combination is carried out based on the generated unit features to generate a combined feature; and (E) the features of the machine learning sample are acquired based on the generated unit features and the combined feature. According to the invention, the method and the system can automatically generate the features of the machine learning sample based on the data table; the use threshold of feature engineering is reduced; the usability of the feature engineering is improved; and the efficiency of feature engineering is improved.

Description

technical field [0001] The present invention generally relates to the field of artificial intelligence, and more specifically, relates to a method and system for automatically generating features of machine learning samples. Background technique [0002] With the emergence of massive data, people tend to use machine learning techniques to mine value from the data. [0003] The basic process of training a machine learning model mainly includes: [0004] 1. Import a dataset (for example, a data table) containing historical data records; [0005] 2. Complete feature engineering, in which various features are obtained by performing various processing on the attribute information of the data records in the data set, and the feature vector formed by these features can be used as a machine learning sample; [0006] 3. Training the model, wherein, according to the set machine learning algorithm (for example, logistic regression algorithm, decision tree algorithm, neural network al...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N99/00
CPCG06N20/00G06F18/2155
Inventor 杨强戴文渊陈雨强孙迪杨慧斌刘守湘
Owner THE FOURTH PARADIGM BEIJING TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products