Method and device for realizing attribute normalization

A technology of attributes and attribute values, applied in the computer field, can solve problems such as labor consumption, errors, and untimely update of corresponding tables.

Active Publication Date: 2021-07-06
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 1) Manpower consumption: Maintaining the corresponding table requires continuous manpower, because whenever the original words of the same attribute appear in different ways, they must be manually maintained and added to this corresponding table
[0006] 2) Low accuracy and real-time performance: In massive data, it is difficult to manually accurately, comprehensively, and quickly find the original words of a certain attribute that appear in different ways. Classification errors caused by untimely and inaccurate updates, or situations that do not correspond at all

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for realizing attribute normalization
  • Method and device for realizing attribute normalization
  • Method and device for realizing attribute normalization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054]Exemplary embodiments of the present invention are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0055] figure 1 is a schematic diagram of the main flow of a method for realizing attribute normalization according to an embodiment of the present invention, such as figure 1 As shown, a method for realizing attribute normalization includes the following steps:

[0056] Step S101, for each piece of data to be normalized in the source data: use a word of the data to be no...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a device for realizing attribute normalization, and relates to the technical field of computers. A specific embodiment of the method includes: taking the word of the data to be normalized as KEY and the data to be normalized as VALUE to obtain the first data; aggregating the first data with the same KEY into a first data group, in the first data group The VALUE of each piece of first data is aggregated into the second data, and one of the words in the first data group is selected as the normalized word of the second data; each original word in the second data is KEY, and the second data is VALUE Obtain the third data; aggregate the third data with the same KEY into the second data group, aggregate the VALUE of each third data in the second data group into the fourth data, from all the normalized words in the second data group Select a normalized word as the fourth data and add it to the extended word set to form the result data; determine the normalized attribute value according to the result data. This implementation mode can realize attribute normalization without manual maintenance, has high accuracy, good real-time performance, and saves manpower.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to a method and device for realizing attribute normalization. Background technique [0002] The basis of various data mining algorithms is the characteristics of data. However, in today's Internet, in order to increase the exposure rate in various search engines, users often reflect the various ways of writing the same attribute in the text as much as possible. That is, a certain attribute value of a piece of data is often represented by splicing multiple redundant words that can represent the same meaning. Using different writing methods to write the attribute value of the same attribute will bring a lot of trouble to data processing. Taking the e-commerce field as an example, if a certain brand A has four writing styles: A1, A2, A3, and A4, these four writing styles all represent brand A. The number, sequence, etc. are not necessarily the same. If "A1 A2 A3" is the o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/215
CPCG06F16/215
Inventor 赵墨农
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products