Knowledge base entity normalization method, system, terminal and computer-readable storage medium
A knowledge base and entity technology, applied in the field of database construction, can solve problems such as the inability of classification scheme to solve the problem of normalization, large differences in data form, complex and difficult knowledge base construction, etc. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0060] The embodiment of the present invention provides a knowledge base entity normalization method, such as figure 1 As shown, the method mainly includes the following steps:
[0061] Step S100: Obtain the entity set in the knowledge base.
[0062] Wherein, the knowledge base may be a knowledge base with a scale of millions, tens of millions, or hundreds of millions. The above-mentioned knowledge bases of various scales can be Chinese knowledge graphs, single-category or multi-category hybrid knowledge bases.
[0063] Step S200: Pre-partitioning the entity set by combining multiple partitioning methods.
[0064] It should be noted that multiple partitioning methods refer to two or more partitioning methods. Pre-partitioning is to divide the entity collection into multiple groups (or multiple zones), and the entity collection in each group is several entities that are suspected to be the same. The combination of multiple partitioning methods can be understood as each part...
Embodiment 2
[0119] The embodiment of the present invention provides a knowledge base entity normalization system, such as Figure 4 shown, including:
[0120] Obtaining module 10, used for obtaining the entity set in the knowledge base;
[0121] The multi-dimensional partition module 20 is used to pre-partition the entity set by combining multiple partition methods;
[0122] Sample construction module 30, for carrying out sample construction according to the result of pre-partitioning, extracting key samples;
[0123] Feature construction module 40, is used for carrying out feature construction according to the result of pre-partition, extracts similar features;
[0124] The normalization determination module 50 is used to combine key samples and similar features through at least one normalization model, and perform a normalization determination on each entity pair in the pre-partitioned result, and determine whether each entity pair is the same entity;
[0125] A set division module 6...
Embodiment 3
[0137] The embodiment of the present invention provides a knowledge base entity normalization terminal, such as Figure 5 shown, including:
[0138] A memory 400 and a processor 500 , the memory 400 stores computer programs that can run on the processor 500 . When the processor 500 executes the computer program, the knowledge base entity normalization method in the foregoing embodiments is implemented. The number of memory 400 and processor 500 may be one or more.
[0139] The communication interface 600 is used for the memory 400 and the processor 500 to communicate with the outside.
[0140] The memory 400 may include a high-speed RAM memory, and may also include a non-volatile memory (non-volatile memory), such as at least one magnetic disk memory.
[0141] If the memory 400, the processor 500, and the communication interface 600 are implemented independently, the memory 400, the processor 500, and the communication interface 600 may be connected to each other through a ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com