Feature data processing method and device

A technology of characteristic data and processing method, which is applied in the field of data processing, can solve the problem of small discrimination of characteristic data, and achieve the effect of improving the degree of discrimination and ensuring the difference

Active Publication Date: 2019-04-05
ADVANCED NEW TECH CO LTD
View PDF12 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of this specification provides a feature data processing method and device, which are used to solve the problem of low discrimination of feature data after normalization processing caused by the long-tail distribution of feature data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Feature data processing method and device
  • Feature data processing method and device
  • Feature data processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] In order to make the purpose, technical solution and advantages of the present application clearer, the technical solution of the present application will be clearly and completely described below in conjunction with specific embodiments of the present application and corresponding drawings. Apparently, the described embodiments are only some of the embodiments of the present application, rather than all the embodiments. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0034] Such as figure 1 As shown, an embodiment of this specification provides a feature data processing method 100, which is used to solve the problem of low discrimination of feature data after normalization processing caused by the long-tail distribution of feature data. This embodiment 100 includes the following step:

[0035] S102: Determine ou...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a feature data processing method and device. The method comprises the following steps: determining outlier data in specified features of a sample set; Performing scaling processing on the outlier data in the sample set to obtain a scaled sample set, the scaled outlier data being greater than non-outlier data in specified features of the sample set before scaling; Performing clustering processing on the zoomed sample set; And based on the plurality of clusters subjected to clustering processing, respectively performing normalization processing on the specified feature data of the zoomed sample set in the specified feature interval corresponding to each cluster.

Description

technical field [0001] The embodiments of this specification relate to the field of data processing, and in particular, to a method and device for processing feature data. Background technique [0002] With the continuous development of the Internet, more and more characteristic data are produced by users in the process of using the Internet. These characteristic data can be widely used and converted into useful information, such as the amount of fund purchases based on users, the number of purchases According to characteristic data such as browsing records and board browsing records, the user access loyalty score, user value score or user browsing board stickiness score can be obtained. The above scoring scores can provide a reference for product operations, and can also be used as discretized data for model training. [0003] In the process of scoring users, it is found that many feature data, such as users' fund purchase amount, have an obvious long-tail distribution, th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/28
Inventor 刘松吟董扬
Owner ADVANCED NEW TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products