Unlock instant, AI-driven research and patent intelligence for your innovation.

High-order feature automatic generation method, system and device and medium

An automatic generation, high-level technology, applied in computing models, machine learning, computing, etc., can solve problems such as opacity and non-interpretation of human beings

Active Publication Date: 2020-07-28
上海携程国际旅行社有限公司
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved by the present invention is to overcome the defects in the prior art that the new features generated by deep learning technology are opaque to humans, and these new features are not well interpretable, and provide an automatic generation of high-order features Methods, systems, devices and media

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • High-order feature automatic generation method, system and device and medium
  • High-order feature automatic generation method, system and device and medium
  • High-order feature automatic generation method, system and device and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0065] like figure 1 As shown, this embodiment discloses a method for automatically generating high-order features, including the following steps:

[0066] Step S101. Obtain an input feature set, which contains several input features; each of the input features has a corresponding name and meaning;

[0067] Step S102, generating high-order features by performing operator operations on the input features in the current input feature set;

[0068] The operator in this embodiment includes the name of the operator, the meaning of the operator, and the execution mode of the operator, and a cross feature candidate is generated according to the operator. Operators mainly include unary operators, binary operators and multivariate operators.

[0069] Step S103, adding each generated high-order feature to the input feature set to form several sets of candidate feature sets, and evaluating the several sets of candidate feature sets using the selected machine model;

[0070] Step S104,...

example 1

[0079] Example 1: For example, the newly generated feature name is [age][disc5]. Age is the parent feature, representing the age of the user. Look up the table, disc5 is a unary operator, which means feature discretization, then the new feature is expressed as discretizing age into 5 levels.

example 2

[0080] Example 2: The newly generated feature name is [[age][disc5], gender, level][groupThenAvg]. Among them, [age][disc5], gender, and level are the parent features, and [age][disc5] is a new feature after the unary operator in (1), and it also acts as the parent feature in the process of feature generation. Features, gender indicates the user's gender, and level indicates the user's consumption level. Look up the table, groupThenAvg is a multivariate operator, and the meaning of the feature is to take the average after the feature is grouped, then the new feature is expressed as grouping by the two features of [age][disc5] and gender, and then take the average of the level in the group, the feature describes The average consumption level of users in different ages and genders.

[0081] Use the model to evaluate candidate features. Since each candidate feature needs to be evaluated individually, a simple and fast model will greatly improve time efficiency. Logistic regressi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a high-order feature automatic generation method, system and device and a medium, and the method comprises the following steps: obtaining an input feature set which comprises aplurality of input features; performing operator operation on the input features in the current input feature set to generate high-order features; respectively adding each generated high-order feature into an input feature set to form a plurality of groups of candidate feature sets, and evaluating the plurality of groups of candidate feature sets by utilizing a selected machine model; adding high-order features in a plurality of candidate feature sets with optimal evaluation results into the input feature set to obtain an updated input feature set; evaluating an input feature set by using themachine model; and outputting the updated high-order features in the input feature set and the specific meanings corresponding to the features. According to the method, effective high-order featurescan be automatically generated, and the generated high-order features can be named and explained.

Description

technical field [0001] The present invention relates to the field of artificial intelligence and machine learning, in particular to a method, system, device and medium for automatically generating high-order features. Background technique [0002] In recent years, more and more cases have shown that effective features can greatly improve the indicators of a machine learning task. The feature in machine learning refers to the performance of some outstanding properties of things, which is the key to distinguish things. Many experts can use their domain knowledge, combined with specific business scenarios, to design useful features to promote business development. On the other hand, model interpretability is very important in some scenarios. For example, in the search ranking of tourism products, good interpretability is very important for suppliers of tourism products to understand the ranking results of their products. [0003] Usually, the design of an effective feature o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N20/00
CPCG06N20/00Y02P90/30
Inventor 王育添江文斌李健
Owner 上海携程国际旅行社有限公司