Variable binning method and device, terminal equipment and storage medium

A binning and variable technology, applied in the computer field, can solve the problems of inaccurate binning results and low binning efficiency, and achieve the effects of reducing manual intervention and time-consuming, improving binning efficiency, and fast and accurate feature extraction

Active Publication Date: 2018-12-07
CHINA PING AN LIFE INSURANCE CO LTD
View PDF4 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present invention provide a variable binning method, device, terminal equipment, and storage medium to solve the problems of inaccurate binning results and low binning efficiency of medium-frequency binning or equal-width binning in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Variable binning method and device, terminal equipment and storage medium
  • Variable binning method and device, terminal equipment and storage medium
  • Variable binning method and device, terminal equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0031] see figure 1 , figure 1 The implementation flow of the variable binning method provided by this embodiment is shown. This variable binning method is applied in the feature encoding process based on the spark platform to realize automatic binning of sample data. While preserving the original sample data information to the greatest extent, it can quickly and accurately extract features and realize rapid modeling. . The details are as follows:

[0032] S1: Obtain sample data.

[0033] In the embodiment of the present invention, sample data is collected from a preset database, and the sample data is mainly insurance business data.

[0034] S2: According to the preset variable configuration, determine the nominal variable to be binned and m eigenvalues ​​corresponding to the nominal variable from the sample data, where m is a positive integer greater than 1.

[0035] In the embodiment of the present invention, the preset variable configuration is used to configure the v...

Embodiment 2

[0108] Corresponding to the variable binning method in Example 1, Figure 4 The variable binning device corresponding to the variable binning method provided in Embodiment 1 is shown, and for the convenience of description, only the parts related to the embodiment of the present invention are shown.

[0109] Such as Figure 4 As shown, the variable binning device includes: an acquisition module 41 , a determination module 42 , a storage module 43 , a calculation module 44 , a removal module 45 , and a loop module 46 . The detailed description of each functional module is as follows:

[0110] An acquisition module 41, configured to acquire sample data;

[0111] The determination module 42 is used to determine the nominal variable to be binned and m eigenvalues ​​corresponding to the nominal variable from the sample data according to the preset variable configuration, wherein m is a positive integer greater than 1;

[0112] The storage module 43 is used to store m eigenvalues...

Embodiment 3

[0133] This embodiment provides a computer-readable storage medium, and a computer program is stored on the computer-readable storage medium. When the computer program is executed by the processor, the variable binning method in Embodiment 1 is implemented, or, the computer program is executed by the processor. Realize the function of each module in the variable binning device in embodiment 2 at the same time. To avoid repetition, details are not repeated here.

[0134] It can be understood that the computer-readable storage medium may include: any entity or device capable of carrying the computer program code, recording medium, U disk, mobile hard disk, magnetic disk, optical disk, computer memory, read-only memory (Read- Only Memory, ROM), random access memory (Random Access Memory, RAM), electric carrier signal and telecommunication signal, etc.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of computers, and provides a variable binning method, a variable binning device, terminal equipment and a storage medium. The variable binning method comprises the steps of acquiring sample data; according to preset variable configuration, determining nominal variables to be binned and feature values corresponding to the nominal variables from the sample data; storing the feature values into a preset feature value set; aiming at each feature value in the feature value set, diving the nominal variables into two bins by using the feature value as a test split point, and computing an associated index value corresponding to each feature value; using the feature value corresponding to the maximum in the associated index values as a target split pointto execute a binning operation; and removing the feature value from the feature value set; and when a binning result reaches a preset bin number threshold, stopping binning, and otherwise, continuously executing the binning operation. According to the technical scheme provided by the invention, the binning operation is automatically performed on the nominal variables based on the associated indexvalues, manual intervention and consumed time are reduced, and the binning efficiency of the binning operation is improved.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to a variable binning method, device, terminal equipment and storage medium. Background technique [0002] At present, the common binning method is equal-width binning or equal-frequency binning. Equal-width binning refers to dividing the value range of a feature into a interval of equal width, and each interval is regarded as a binning. Binning refers to arranging the eigenvalues ​​in ascending order, and dividing them into parts a according to the number of eigenvalues, and each part is regarded as a binning. However, whether it is equal-width binning or equal-frequency binning, it is necessary to manually set the number of bins in advance. If the number of bins is too small, more information will be lost. If the number of bins is too large , the purpose of binning cannot be achieved. [0003] If after equal-frequency binning or equal-width binning, manual merging is p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/18G06K9/62
CPCG06F17/18G06F18/24
Inventor 黄严汉曾凡刚
Owner CHINA PING AN LIFE INSURANCE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products