Unordered categorical variable processing method and device

A technology of categorical variables and processing methods, which is applied in the field of disorderly categorical variable processing methods and devices, and can solve the problems of inability to ensure the effectiveness of grouping, low efficiency, and inability to ensure effects, etc.

Inactive Publication Date: 2018-09-28
GUOXIN YOUE DATA CO LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the above-mentioned method of grouping by experience is inefficient and cannot ensure the effectiveness of the grouping. The above-mentioned method directly used without grouping cannot ensure the accuracy of subsequent modeling and other applications for multi-categorical variables with a wide range of classifications. Effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unordered categorical variable processing method and device
  • Unordered categorical variable processing method and device
  • Unordered categorical variable processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In order to make the purpose, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments are only It is a part of the embodiments of this application, not all of them. The components of the embodiments of the application generally described and illustrated in the figures herein may be arranged and designed in a variety of different configurations. Accordingly, the following detailed description of the embodiments of the application provided in the accompanying drawings is not intended to limit the scope of the claimed application, but merely represents selected embodiments of the application. Based on the embodiments of the present application, all other embodiments obtained by those skilled in the art without...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an unordered categorical variable processing method and device. The method comprises the following steps of: obtaining an unordered categorical variable set, wherein the unordered categorical variable set comprises at least two categories of unordered categorical variables and corresponding dependent variables are binary variables; aiming at each category of unordered categorical variables in the unordered categorical variable set, carrying out statistical analysis on a categorical proportion, in the category of unordered categorical variables, of an unordered categorical variable, the dependent variable values of which is a target categorical value in the binary variables; and clustering the unordered categorical variable set on the basis of the categorical proportion of each category of unordered categorical variables so as to obtain a plurality of unordered categorical variable subsets, wherein each unordered categorical variable subset comprises at least onecategory of unordered categorical variables and each unordered categorical variable subset corresponds to an ordered categorical variable. According to the method and device, grouping can be realizedwithout participation of human experiences, so that the grouped processing efficiency is relatively high and the objectivity and correctness of grouping results are further enhanced.

Description

technical field [0001] The present application relates to the technical field of computer processing, in particular, to a method and device for processing disordered categorical variables. Background technique [0002] With the advent of the era of big data and the rapid development of Internet technology, the amount of data in various industries has shown explosive growth. In these data, unordered categorical variables account for a large proportion. In order to facilitate the mining of potential value in data, it is necessary to use effective processing methods to process the above-mentioned unordered categorical variables. Among them, the variable grouping problem is a common problem in variable processing methods. [0003] For the problem of variable grouping, most of the research is on the grouping of continuous variables. There are few related studies on the grouping of unordered categorical variables, and two processing methods are basically adopted: one is to grou...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 夏耘海李燕伟王甲樑夏虎
Owner GUOXIN YOUE DATA CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products