Data processing method and device

A data processing device and data processing technology, applied in the field of data processing, can solve problems such as affecting model construction, affecting off-grid user analysis, and unbalanced number of positive and negative samples.

Active Publication Date: 2019-12-27
HUAWEI TECH CO LTD
View PDF2 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The absence of specific types of sample data can significantly affect the construction of models and thus data analysis
For example, in the off-grid user prediction application, the number of off-grid users is very small, which leads to a high imbalance in the number of positive and negative samples, which affects the analysis of off-grid users

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method and device
  • Data processing method and device
  • Data processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The technical terms or nouns involved in the embodiments of the present application will be introduced below.

[0047] (1) Tabular data, data displayed in the form of a table, can be displayed in the form of a wide table, or data displayed in the form of a narrow table. Among them, a wide table literally means a database table with many fields. A wide table usually refers to a database table that associates indicators, dimensions, and attributes related to a business theme. For example, Table 1 below is tabular data in the telecom domain.

[0048] Table 1

[0049] username mobile phone number Attribution Package Type … Zhang San XXXXXXXXXXXX A place Package 1 … Li Si XXXXXXXXXXXX B land Package 2 … Wang Wu XXXXXXXXXXXX C land Package 3 … … … … … …

[0050] Table data is all the sample data displayed in the table, a row is a sample, and a column is a feature. For example, the row of data in Table 1 wher...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a data processing method and device, and the method can comprise the following steps: carrying out the standardized coding of input table data, obtaining firsttable data, and an object description feature of the first table data being a numerical type object description feature; based on the first table data, using a generative adversarial network model togenerate second table data, and the similarity between the second table data and the first table data reaches a first threshold value; and performing inverse standardized encoding on the second tabledata to obtain output table data, the output table data having the same object description feature as the input table data. By adopting the embodiment of the invention, the output data very close tothe input data can be constructed, and the data analysis can be realized even if the data is separated from the data local point.

Description

technical field [0001] The embodiments of the present application relate to the technical field of data processing, and in particular to a data processing method and device thereof. Background technique [0002] With the rapid development of big data technology, telecom operators have also begun to pay more and more attention to how to transform the messy and massive telecom domain data into valuable information, so as to realize applications such as package recommendation, customer retention, and base station traffic prediction. However, due to the following particularities of telecommunication domain data, it will bring difficulties to the analysis of telecommunication domain data. [0003] Particularity one, the telecom domain data cannot be taken away from the telecom office site, resulting in the inability to build a model for the telecom domain data and analyze the telecom domain data when it leaves the telecom site. Particularity two, the data of specific types of sa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q30/02G06Q50/30
CPCG06Q30/0201G06Q30/0203G06Q50/30G06Q30/02
Inventor 刘诗凯张旭王佳佳
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products