Coding-based entity ID creation method and system

A coding and entity technology, applied in the field of code-based entity ID creation method and system, can solve the problems of different unique ID generation rules, data fusion and joint analysis difficulties of data holders, etc., and achieve the effect of ensuring data security

Pending Publication Date: 2020-08-28
成都数联铭品科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, different data users have different unique ID generation rules and may change at a

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Coding-based entity ID creation method and system
  • Coding-based entity ID creation method and system
  • Coding-based entity ID creation method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0047] Assume that there are currently two data holders, and the common fields held include holder 1 owning the first field, the second field, the third field, the fourth field, the fifth field...the Nth field; the data holder Owner 2 has the first field, the second field, the third field, the fourth field, the fifth field, the sixth field...the Nth field. The first field, the second field and the third field owned by the two data holders are common fields, and the rest of the fields are different. In the joint analysis scenario, if the entity IDs of the data holders are different, it is impossible to quickly establish a fused knowledge graph. (This embodiment assumes that the number of data records and the information of common fields of the same entity owned by different data holders are the same)

[0048] A method for creating an entity ID is schematically provided in this embodiment. Assume that data holder 1 now owns the data shown in Table 1:

[0049] Table 1

[0050...

Embodiment 2

[0071] Assume that there are currently two data holders, holding common fields, including holder 1 owning the first field, second field, third field, fourth field... Nth field; data holder 2 owning the first field The first field, the second field, the third field, the fourth field, the fifth field...the Nth field. The first field owned by the two data holders, the second field is a common field, and the rest of the fields are different. In the joint analysis scenario, if the entity IDs of the data holders are different, it is impossible to quickly establish a fused knowledge graph.

[0072] A method for creating an entity ID is schematically provided in this embodiment. Assume that data holder 1 now owns the data shown in Table 7:

[0073] Table 7

[0074]

[0075] There is Data Holder 2 who owns the data shown in Table 8:

[0076] Table 8

[0077]

[0078] Utilize the method of the present invention to pass common field, use hash algorithm to carry out ID encoding...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a coding-based entity ID creation method and system. According to the invention, the entities are coded by using the common fields of different data holders, the constructed codes are sorted by using the set rules, and the codes conforming to a certain data rule are selected as the common IDs of the same entities, so that accurate and non-redundant construction of knowledge graph construction is realized. And support is provided for data fusion of different data holders. The ID constructed through the method has uniqueness, privacy or sensitive information is not involved, and data safety is guaranteed.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a coding-based entity ID creation method and system. Background technique [0002] At present, when creating big data knowledge graphs, high-quality knowledge graphs depend on high-quality data. Ideally, a graph should have clear logic, rich categories, accurate data, and no redundant information. It is actually not easy to achieve the above effects. The most prominent problem is that it is very difficult to realize the data of the knowledge graph is accurate and without redundancy; the reason is: the creation source of the knowledge graph and the integration and integration of data; mainly includes entities, relationships Data; where entities are generally presented in the form of nodes in the graph, and relationships are generally presented as connections between entities. The reality is that the same entity name often appears in different data, especially for natural...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/36G06F16/35G06F40/126G06K9/62
CPCG06F16/367G06F16/353G06F16/355G06F40/126G06F18/251
Inventor 韩远吴桐刘世林李焕周凡吟任渝车雨蒙
Owner 成都数联铭品科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products