Linked data oriented entity classification method and system

A technology oriented to linking and classification methods, applied in the field of information processing, can solve the problems of inability to accurately obtain entity categories, inability to obtain entity categories, insufficient recognition accuracy, etc., and achieve the effects of easy debugging, easy implementation, and good accuracy.

Inactive Publication Date: 2016-08-31
PEKING UNIV
View PDF6 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, the statistical classification method based solely on text features has insufficient recognition accuracy and cannot accurately obtain entity categories.
[0006] (2) Many entity pages do not have enough text description information. In this case, simply using text description information to classify entities will inevitably lead to classification errors, and entity categories cannot be obtained through text descriptions.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Linked data oriented entity classification method and system
  • Linked data oriented entity classification method and system
  • Linked data oriented entity classification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] Below in conjunction with accompanying drawing, further describe the present invention through embodiment, but do not limit the scope of the present invention in any way.

[0042] The present invention provides a link data-oriented entity classification method and system. Aiming at the link data entity classification problem, the purpose of high-precision entity classification is achieved through a statistical classification process and a post-processing process; The model is used to classify; the post-processing process uses rich resources (such as affix information, link data, etc.) to correct the results of entity statistical classification, figure 1 It is a flow chart of the entity classification method for linked data provided by the present invention. Such as figure 1 As shown, the method of the present invention includes a preprocessing process, a statistical classification process and a postprocessing process; firstly, word segmentation feature extraction is pe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a linked data oriented entity classification method and system which are aimed at the problem of entity classification of linked data. The entity classification method includes pretreatment, statistical classification and post treatment. Pretreatment: word segmentation is performed on text description information in an entity page; and an attribute name of an information box and word information obtained by segmentation form an entity page character. Statistical classification: a statistical classification model is trained through various segmentation granularities to classify the entity page, and then a primary prediction result of entity class can be obtained. Post treatment: the entity statistical classification result is corrected; and a combined entity class is corrected through model combination, language knowledge, linkage information and class associate attribute information. The method and the system is easy to implement and debug, is high in efficiency, is good in accuracy, can be used for performing knowledge management on the linked data, and achieve high-precision classification of the entity.

Description

technical field [0001] The invention belongs to the field of information processing, relates to link data classification and search, in particular to a method and system for high-precision classification of entity pages in link data. Background technique [0002] Currently in the era of big data, how to maximize the use of data to help computers perform information processing has become the hottest research topic in the field of information processing. In recent years, with the advent of the Web 2.0 era, linked data (such as semantic web, knowledge graph, etc.) has attracted widespread attention because of its powerful relationship description capabilities. Linked data refers to data organization forms such as Baidu Encyclopedia and Wikipedia. In this kind of data, each page corresponds to an entity, and there are mutual links between entities, so it is called linked data. With the continuous increase of data scale, it is unrealistic to use manual methods to manage linked d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/353G06F16/355G06F16/367
Inventor 葛涛穗志方
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products