Data processing method in learning modeling

A data processing and data technology, applied in the field of data processing in learning modeling, to achieve the effect of improving accuracy, reducing feature dependence, and ensuring accuracy

Pending Publication Date: 2020-08-25
INSPUR SOFTWARE CO LTD
View PDF2 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The invention proposes a data processing method in learning modeling, which solves the problem of discretizing the dependent variable conforming to the long-tail distribution based on the power-law relationship in the modeling process of the machine learning algorithm.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method in learning modeling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work belong to the protection of the present invention. scope.

[0033] The present invention proposes a data processing method in learning modeling. Based on the power law relationship, the present invention performs log transformation on the y variable that conforms to the long-tail distribution to make it conform to the normal distribution, and then performs a log transformation on the transformed continuous variab...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a data processing method in learning modeling, and belongs to the technical field of Python machine learning modeling, and aims to convert a regression problem with a large amount of required information into a multi-classification problem with a small amount of required information by utilizing log transformation and equal-width standardization on the basis of a power law relationship and aiming at a y variable conforming to long tail distribution, so that subsequent modeling work can be carried out smoothly.

Description

technical field [0001] The invention relates to the technical field of Python machine learning modeling solutions, in particular to a data processing method in learning modeling. Background technique [0002] The power law comes from the analysis of the frequency of English words in the 1920s. There are very few words that are really commonly used, and many words are not often used. Linguists found that the frequency of word use and its use priority are a constant power inverse relationship. To be precise, in simple terms, the power law is two popular laws, one is the "long tail" theory, there are only a few large portals that many people pay attention to, but there is also a long tail, which is small websites, small company. The long-tail theory is a popular explanation of the power law. Another popular explanation is the Matthew effect, where the poor get poorer and the rich get richer. [0003] Discretization is a commonly used technique in programming, which can effe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N20/00
CPCG06N20/00
Inventor 马秀霖
Owner INSPUR SOFTWARE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products