Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Entity normalization processing method, device and equipment and storage medium

A processing method and normalization technology, applied in the field of data processing, can solve problems such as difficulty in applying business scenarios, lack of standardized guarantees, and high labor costs, so as to facilitate the formulation and modification of entity normalization strategies and reduce human development costs , The effect of reducing the cost of learning

Pending Publication Date: 2020-05-15
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF1 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the existing entity normalization method, R&D engineers need to program by themselves, which consumes a lot of labor costs, is difficult to learn, and lacks standardization guarantee; while using the model for entity normalization, the model training process requires a large amount of labeled data, and It requires professional algorithm engineers to iterate, it is difficult to apply to commercial scenarios, the industry is not universal, and it lacks applicability

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity normalization processing method, device and equipment and storage medium
  • Entity normalization processing method, device and equipment and storage medium
  • Entity normalization processing method, device and equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] Exemplary embodiments of the present application are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0069] In the existing entity normalization method, R&D engineers need to program by themselves, which consumes a lot of labor costs, is difficult to learn, and lacks standardization guarantee; while using the model for entity normalization, the model training process requires a large amount of labeled data, and It requires professional algorithm engineers to iterate...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an entity normalization processing method, device and equipment and a storage medium, and relates to an entity normalization processing technology. According to the specific implementation scheme, the method comprises the steps of receiving rule parameters related to an entity normalization strategy input by a user; generating a program code corresponding to the entity normalization strategy according to the rule parameter and a preset code generation rule; and running a program code corresponding to the entity normalization strategy, and performing normalization judgment on entities in a preset entity data set so as to cluster the same entities. The user only needs to input rule parameters related to an entity normalization strategy; program codes corresponding toentity normalization strategies can be automatically generated according to rule parameters and preset code generation rules. User programming is not needed, the manpower development cost and the learning cost are reduced, the threshold of data production is reduced, the entity normalization strategy is convenient to modify, the entity normalization processing efficiency is improved, and the method can be applied to entity normalization processing of data in any field.

Description

technical field [0001] The present application relates to the technical field of data processing, and specifically relates to entity normalization processing technology. Background technique [0002] In the construction of knowledge map data, since the construction of knowledge map often needs to use a variety of different data sources, it is an important task to normalize and fuse the same entities in different data sources. For example, the data of the movie "Children of the Weather" comes from three different websites, and its related attributes such as release time are 2019-11-01 (China), 2019-07-19 (Japan), 2019-11-01 (China) , the directors are Makoto Shinkai, etc., so they refer to the same entity, which needs to be disambiguated. The entity disambiguation process is divided into two steps: entity normalization and fusion. Entity normalization is to normalize the same entities into the same set; while in fusion, entities in the same set are fused, and strategies are ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F8/30
CPCG06F8/30
Inventor 王冠朝方舟江涛仲夏
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products