Mining method and device of single entity instance
An entity, a single technology, applied in the field of data processing, can solve problems such as inaccurate knowledge base, inaccurate entity instances, inaccurate query results, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0044] refer to figure 1 , shows a flow chart of steps of a mining method for a single entity instance in Embodiment 1 of the present invention.
[0045] The mining method of a single entity instance in this embodiment may include the following steps:
[0046] Step 101, grabbing pages from multiple data sources that contain entity instances corresponding to entities of a specific type.
[0047] Among them, an entity is a specific thing or concept, and entities are generally divided into types, such as person-type entities, movie-type entities, and so on. The same entity can correspond to multiple entity instances. An entity instance is a descriptive page (content) for an entity in the network (or other media). For example, various encyclopedia pages contain the entity instance corresponding to the entity.
[0048] In the embodiment of the present invention, pages from multiple data sources including entity instances corresponding to entities of a specific type are firstly craw...
Embodiment 2
[0059] refer to image 3 , shows a flow chart of steps of a method for mining a single entity instance according to Embodiment 2 of the present invention.
[0060] The mining method of a single entity instance in this embodiment may include the following steps:
[0061] Step 301, grabbing pages from multiple data sources that contain entity instances corresponding to entities of a specific type.
[0062] In the embodiment of the present invention, processing is performed on a specific type of entity. The specific type is a person class as an example for description below. For the processing process of other types of entities, refer to the processing process of the person class entity.
[0063] Step 302, respectively extracting entity names, attribute names and attribute values of entity instances included in the page.
[0064] Crawl pages from various webpages, such as Baidu Encyclopedia, Sogou Encyclopedia, Haosou Encyclopedia, etc., and contain multiple pages of entity i...
Embodiment 3
[0104] refer to Figure 4 , shows a flow chart of steps of a method for building a knowledge base in Embodiment 3 of the present invention.
[0105] The method for building a knowledge base in this embodiment may include the following steps:
[0106] Step 401, grabbing pages from multiple data sources that contain entity instances corresponding to entities of a specific type.
[0107] Step 402, respectively extracting entity names, attribute names and attribute values of entity instances included in the page.
[0108] Step 403, for the set of entity instances of entities with the same name, according to the distribution entropy index of the attribute value under the attribute name with a single degree of discrimination corresponding to the entity with the same name, combine the entity instances describing the same entity in the set into the same A single entity instance for an entity.
[0109] For the specific process of the above step 401, step 402, and step 403, it is t...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com